About the job
Baseten creates AI inference solutions for clients such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. The team blends AI research, infrastructure, and developer tools to help organizations deploy advanced models. Backed by $300M in Series E funding from BOND, IVP, Spark Capital, Greylock, and Conviction, Baseten is expanding quickly and shaping the landscape for engineers building AI products.
Role overview
The Software Engineer - Voice AI role centers on building and deploying open-source voice models for real-world use. Voice is becoming a key interface across the web, and this position addresses the technical challenges of bringing production-ready Voice AI to market. The work supports applications in productivity, customer service, clinical dialogue, creator tools, education, and more, helping to change how people interact with technology across sectors.
This engineer leads Baseten’s Voice AI efforts, guiding the proprietary inference stack that powers Voice AI models. The role balances shaping the product roadmap with hands-on engineering. Collaboration is a core part of the job, working closely with Forward Deployed Engineers, Model Performance Engineers, and other technical teams to advance Voice AI capabilities.
Sample projects and initiatives
- The world's fastest Whisper, with streaming and diarization
- Canopy Labs selects Baseten for Orpheus TTS inference
- Partnering with the Core Product team to build an orchestration framework for a multi-model voice agent
- Working with the Training Platform team to support ongoing training of voice models
- Designing a developer-friendly API and SDK to encourage self-service adoption of Baseten Voice AI products
Location
San Francisco

