companyBaseten logo

Software Engineer - GPU Inference at Baseten | San Francisco

BasetenSan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

We are looking for individuals who possess a strong background in software engineering, particularly with experience in GPU inference. Candidates should demonstrate a passion for AI technologies, as well as a desire to innovate and contribute to a collaborative team environment. Familiarity with voice recognition systems and open-source projects will be a significant advantage. Strong problem-solving skills and the ability to work effectively across teams are essential.

About the job

Baseten develops infrastructure and tools that help AI companies deploy and scale inference. Teams at organizations like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer rely on Baseten to bring advanced machine learning models into production. The company recently secured a $300M Series E from investors including BOND, IVP, Spark Capital, Greylock, and Conviction.

Role overview

This Software Engineer - GPU Inference position joins the founding team for Baseten Voice AI in San Francisco. The team focuses on building production-ready Voice AI systems, bringing open-source voice models into real-world use for clients in productivity, customer service, healthcare conversations, and education. The work shapes how people interact with technology through voice, creating broad impact across industries.

In this role, the engineer leads the internal inference stack that powers Voice AI models. Responsibilities include guiding the product roadmap and driving engineering execution. Collaboration is a key part of the job, working closely with Forward Deployed Engineers, Model Performance Engineers, and other technical groups to advance Voice AI capabilities.

Sample projects and initiatives

About Baseten

Baseten is a dynamic and rapidly growing company dedicated to advancing AI technology. With a focus on providing mission-critical inference capabilities, we support leading AI companies in deploying their models effectively. Our innovative approach combines cutting-edge research with developer-friendly tools, enabling a transformative impact across various industries.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.