Software Engineer Voice jobs in San Francisco – Browse 5,516 openings on RoboApply Jobs

Software Engineer Voice jobs in San Francisco

Open roles matching “Software Engineer Voice” with location signals for San Francisco. 5,516 active listings on RoboApply Jobs.

5,516 jobs found

1 - 20 of 5,516 Jobs
Apply
Aqua Voice logo
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input landscape for the AI era. By training our own models and creating deep OS integrations, we ensure that voice technology is executed flawlessly across the entire stack.The evolution of work is upon us; individual contributor roles are transforming as we now manage AI agents, a task that naturally aligns…

Jan 28, 2026
Apply
Aqua Voice logo
Full-time|On-site|San Francisco

Join Our TeamAqua Voice is pioneering the voice interaction layer for the AI-driven era. We develop our own models and create deep OS integrations, as mastering voice technology necessitates control over the entire stack.The landscape of work is evolving rapidly. It's no longer just about individual contributions; now, you manage AI agents, and voice interaction is ideally suited for this new paradigm.We are not in the business of crafting conversational agents. Our vision is that the most intuitive way to engage with your computer is through voice commands with text responses (VITO). We believe that voice technology should operate at a layer above applications, and as a small company, we have the potential to push boundaries in this area.With relentless dedication, we are making great strides, and we invite you to be part of this journey.Your RoleWe are seeking a talented designer who is also proficient in coding.You will take the reins on design for our desktop applications, web platforms, and marketing collateral. You will implement your designs using React, TypeScript, and CSS.This is primarily a design-focused position. We prioritize your aesthetic sense over the speed of engineering.

Jan 28, 2026
Apply
Aqua Voice logo
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input technology for the AI era. Our unique approach involves training proprietary models and establishing deep integrations with operating systems, ensuring optimal voice interaction capabilities.The landscape of work is evolving; the traditional role of an individual contributor is being replaced by the management of AI agents, making voice interaction a natural fit for this new paradigm.Rather than developing mere conversational agents, we envision a more intuitive method of computer interaction—voice input leading to text output (VITO). We assert that voice technology should operate at a foundational level, allowing a small, agile company like ours to innovate and excel.We are passionately committed to this mission, and our initial results have been promising. Join us in shaping the future!The OpportunityWe are looking for an iOS Engineer to help us build Aqua Voice from the ground up. This greenfield project offers you the chance to shape the mobile experience entirely.Your Contributions Will Include:Developing a native iOS application with system-level voice input features (including keyboard extensions, Shortcuts, and Siri integration).Implementing real-time audio capture and streaming to our transcription backend.Creating deep iOS integrations to enhance user experience.This role requires a focus on native performance and comprehensive system integration—this is not just a web view wrapper.Technology StackSwift (SwiftUI + UIKit where applicable)Audio Technologies: AVFoundation, Audio Unit, real-time streamingNetworking: WebSockets, streaming protocolsSystem Integrations: Keyboard extensions, Shortcuts, accessibility APIs

Jan 28, 2026
Apply
Sierra logoSierra logo
Full-time|On-site|San Francisco, CA

About UsAt Sierra, we are pioneering a transformative platform that empowers businesses to enhance customer interactions through advanced AI technology. While our primary operations are in-person at our San Francisco headquarters, we are expanding our presence globally with offices in Atlanta, New York, London, France, Singapore, and Japan.Our culture is built on core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and define our collaborative spirit.Founded by industry leaders Bret Taylor and Clay Bavor, Sierra benefits from their extensive expertise in tech innovation and leadership from renowned companies such as Salesforce, Facebook, and Google.Your RoleJoin our dynamic engineering team of approximately 40 highly skilled professionals, including notable engineers like Mihai, Belinda, Arya, and Wei. You will work within small, autonomous teams focused on solving complex customer challenges. Your key responsibilities will include:Transcription: Develop robust systems for accurate speech recognition and transcription across multiple languages, ensuring low latency and high performance.Synthesis: Create advanced speech generation technologies that align with diverse brand voices and styles.Runtime Optimization: Enhance the efficiency and responsiveness of voice applications to deliver seamless user experiences.

Nov 14, 2025
Apply
Baseten logoBaseten logo
Full-time|On-site|San Francisco

Baseten creates AI inference solutions for clients such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. The team blends AI research, infrastructure, and developer tools to help organizations deploy advanced models. Backed by $300M in Series E funding from BOND, IVP, Spark Capital, Greylock, and Conviction, Baseten is expanding quickly and shaping the landscape for engineers building AI products. Role overview The Software Engineer - Voice AI role centers on building and deploying open-source voice models for real-world use. Voice is becoming a key interface across the web, and this position addresses the technical challenges of bringing production-ready Voice AI to market. The work supports applications in productivity, customer service, clinical dialogue, creator tools, education, and more, helping to change how people interact with technology across sectors. This engineer leads Baseten’s Voice AI efforts, guiding the proprietary inference stack that powers Voice AI models. The role balances shaping the product roadmap with hands-on engineering. Collaboration is a core part of the job, working closely with Forward Deployed Engineers, Model Performance Engineers, and other technical teams to advance Voice AI capabilities. Sample projects and initiatives The world's fastest Whisper, with streaming and diarization Canopy Labs selects Baseten for Orpheus TTS inference Partnering with the Core Product team to build an orchestration framework for a multi-model voice agent Working with the Training Platform team to support ongoing training of voice models Designing a developer-friendly API and SDK to encourage self-service adoption of Baseten Voice AI products Location San Francisco

Apr 26, 2026
Apply
Anthropic logoAnthropic logo
Full-time|Hybrid|San Francisco, CA | New York City, NY | Seattle, WA

Join our team at Anthropic as a Senior / Staff+ Software Engineer specializing in Voice Platform development. In this role, you will be instrumental in advancing our voice technology initiatives, working closely with cross-functional teams to create innovative solutions that enhance user experiences.

Apr 1, 2026
Apply
Aqua Voice logo
Full-time|On-site|San Francisco

Join Our Team at Aqua VoiceAqua Voice is at the forefront of developing the voice input layer for the AI era. Our innovative approach involves training proprietary models and establishing deep OS integrations, ensuring we master the entire voice technology stack.The nature of work is evolving, and we’re transitioning from traditional individual contributor roles to managing AI agents. This shift creates a unique opportunity for voice technology to thrive.Rather than simply creating conversational agents, we envision an interaction model where voice input leads to text output (VITO). We believe voice should operate at a higher level than applications, enabling a smaller company like ours to lead the charge.Our passionate commitment to this vision has yielded promising results, and we invite you to be a part of our journey.Your RoleAs a Sales Development Representative, you will play a critical role in building our enterprise pipeline by pinpointing ideal companies and connecting us with key decision-makers.Research and Targeting: Identify companies and teams that align with our profile.Outbound Engagement: Utilize email, LinkedIn, and creative strategies to reach out—prioritizing quality over quantity.Qualification: Assess client needs, budgets, and timelines to set Account Executives up for success.User Expansion: Leverage existing contacts to open doors at potential client companies.Iterative Learning: Analyze messaging and channel effectiveness to refine our approach.This position serves as a pathway to Account Executive roles for those interested in advancing their career.

Jan 28, 2026
Apply
Aqua Voice logo
Staff Engineer

Aqua Voice

Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is revolutionizing voice input for the AI era. We develop proprietary models and create deep operating system integrations, as mastering voice technology necessitates comprehensive control over the entire stack.The workplace landscape is evolving. Individual contributor roles are being replaced by the need to manage AI agents, and this transition is ideally suited for voice interaction.Unlike conventional conversational agents, we advocate for a voice-first approach where voice input leads to text output (VITO). We assert that voice interaction should transcend application layers, and our nimble organization is committed to pushing boundaries in this domain.Our relentless pursuit of excellence in this space has yielded promising results, and we invite you to be part of our journey.The RoleAs a Staff Engineer, you will take ownership of technical systems across the entire stack. Being a small, post-product-market-fit team, we are experiencing growth with real users facing real challenges.We favor modern technologies—opting for Bun over Node, Fly over AWS when applicable, and PyTorch over outdated ML frameworks.Your ResponsibilitiesGPU Inference: We independently run our Automatic Speech Recognition models.Real-time Transcription: Implementing WebSocket state machines, multi-provider failover, and achieving sub-100ms latency targets while tracking over 30 metrics per session.Native Applications: Utilizing Swift for macOS, C# for Windows, and Electron IPC for system-level programming.Backend Development: Employing Django for billing and analytics, Celery for background tasks, and PostgreSQL for database management.You will be involved in all aspects of our infrastructure and product development.Technology Stack- Languages: TypeScript, Python, Swift, C#- Runtime Environments: Bun, Node.js, Django, FastAPI- Machine Learning: PyTorch- Infrastructure: Fly.io, Terraform, AWS (RDS), Redis- Protocols: gRPC, WebSocket, RESTQualifications- Proven experience in building and managing systems at scale.- Strong architectural opinions that are adaptable.- Comfortable being the foremost technical expert in the room.- Capable of transforming ambiguous problems into functional systems.- Ability to write production-ready code.

Jan 28, 2026
Apply
Baseten logoBaseten logo
Full-time|On-site|San Francisco

Baseten seeks a Software Engineer to focus on Voice AI within the Inference Runtime team. This San Francisco-based role centers on building and refining AI models that power voice interaction features in Baseten products. Role overview This position involves hands-on work with AI models for voice-driven applications. The engineer will help shape how users interact with voice technology by developing and optimizing the underlying systems. What you will do Develop and optimize AI models for applications that use voice as a primary interface Work on inference runtime systems to support responsive and intelligent voice experiences Contribute directly to Baseten's products, influencing the future of voice technology for users Location This position is based in San Francisco.

Apr 23, 2026
Apply
Aqua Voice logo
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input layer for the AI era. We develop our proprietary models and create deep operating system integrations, as mastering voice interaction necessitates comprehensive control over the entire technology stack.The landscape of work is evolving. Individual contributor roles have transformed; now you will manage AI agents, a task that is ideally suited for voice technology.We are not in the business of creating simple conversational agents. Our vision is that the most intuitive interaction with computers is through voice input with text output (VITO). We advocate for voice integration at a fundamental level, confident that a nimble company can lead innovation in this space.Our commitment to this vision is unwavering, and we have already made significant strides. We invite you to become a part of this exciting journey.The OpportunityAs we witness growing enterprise interest—teams eager to integrate Aqua into their workflows—you will lead these initiatives.Full-Cycle Sales Ownership: From initial conversation to contract signing, you will manage the entire sales process without handoffs.Land and Expand Strategy: Begin with key champions within organizations, and grow our presence across entire teams.Navigating Enterprise Processes: You will adeptly handle security reviews, procurement, and legal considerations, drawing on your previous experience.Defining Sales Methodology: As a key early-stage hire, you will help shape how enterprise sales are conducted at Aqua Voice.Collaboration with Product Teams: Your insights into enterprise needs will directly influence our product roadmap.

Jan 28, 2026
Apply
Aircall logoAircall logo
Full-time|On-site|San Francisco Office

Join Aircall as a Senior Software Engineer specializing in AI Voice Agents, where you'll play a pivotal role in designing and implementing innovative voice solutions. Your expertise in software development and artificial intelligence will help us enhance customer interactions through cutting-edge technology.

Apr 10, 2026
Apply
Deepgram logoDeepgram logo
Full-time|On-site|San Francisco, CA

Join Deepgram, a pioneering leader in voice AI technology, as a Software Engineer specializing in Voice Agents for the restaurant industry. You will play a vital role in developing innovative AI applications that enhance customer experiences and streamline restaurant operations. Your expertise in software development and passion for AI will contribute to transforming how restaurants interact with their customers.

Feb 26, 2026
Apply
Nooks logoNooks logo
Full Time|On-site|San Francisco

About Nooks.ai:Nooks is revolutionizing the sales landscape with our AI Sales Assistant Platform (ASAP), designed to eliminate mundane tasks and empower sales representatives to prioritize meaningful customer interactions. Our technology has enabled thousands of sales professionals to exceed their quotas, saved clients countless hours, and generated hundreds of millions of dollars in sales pipeline. Nooks is trusted by sales teams from leading companies such as Hubspot, Rippling, and Toast, as well as many others.As a high-performing team, we have successfully secured over $70M in funding from prestigious venture capital firms, including Kleiner Perkins, which marked its first investment in sales technology in over a decade by choosing Nooks. Over the last two years, we've achieved remarkable growth, quadrupling our annual recurring revenue (ARR) and then tripling it again, with plans to do so once more this year.To learn more about us, visit Nooks.ai. In this role, you will take ownership of our backend infrastructure for real-time audio and video calling (including Twilio, Salesfloor, A/V quality, recordings, and transcriptions), ensuring our dialer can scale reliably to handle 10x the current volume.Your Key Responsibilities:Architect and enhance our real-time voice infrastructure (WebRTC/SIP/Twilio).Guarantee that call quality and latency consistently meet SLA targets on a global scale.Develop services for advanced functionalities such as call transfers, monitoring, and recordings.Create observability and debugging tools to optimize call flows.Collaborate with Product and Support teams to implement new A/V features and ensure rapid issue resolution.Ideal Candidate Profile:6 to 10+ years of experience in backend or infrastructure engineering, particularly with real-time audio/video or telephony systems.Proven hands-on experience with technologies such as WebRTC, SIP, Twilio, or similar platforms.Strong understanding of distributed systems and low-latency infrastructure.Experience in debugging and optimizing Quality of Service (QoS) metrics including latency, jitter, and packet loss.At Nooks, our compensation package for eligible roles includes a base salary, equity options, and comprehensive benefits. The base salary is just one part of the total compensation, which may also encompass equity, health, dental, vision, life and disability insurance, hybrid work arrangements, and unlimited paid time off. Please note that benefits are subject to change and may vary based on the employment jurisdiction.

Nov 13, 2025
Apply
Nooks logoNooks logo
FullTime|On-site|San Francisco

About Nooks.ai:Nooks is revolutionizing the sales landscape with our AI Sales Assistant Platform (ASAP) that streamlines operations, allowing sales representatives to concentrate on building meaningful customer relationships and boosting their sales pipeline. Our innovative platform has empowered thousands of sales professionals to achieve their quotas, saving clients countless hours and generating hundreds of millions in revenue. Trusted by sales teams at leading companies like Hubspot, Rippling, and Toast, Nooks is at the forefront of sales technology.Backed by over $70M in funding from premier venture capitalists, including Kleiner Perkins, Nooks is experiencing exceptional growth, having increased our Annual Recurring Revenue (ARR) by 4x and then 3x in recent years. We aim to achieve another 3x growth this year, continuing our trajectory of success.For more information, visit Nooks.ai.As a Senior Software Engineer specializing in voice AI, you will take ownership of our backend infrastructure for real-time audio/video communication, ensuring it scales efficiently to handle increased usage.Responsibilities:Design and enhance real-time voice infrastructure utilizing WebRTC, SIP, and Twilio.Monitor and optimize call quality and latency to comply with global SLA targets.Develop advanced features including call transfers, monitoring, and recordings.Create observability and debugging tools to streamline call flow management.Collaborate with Product and Support teams to facilitate new A/V feature rollouts and expedite issue resolution.Qualifications:6–10+ years of experience in backend or infrastructure engineering, specifically in real-time audio/video or telephony systems.Proven expertise with WebRTC, SIP, Twilio, or similar technology stacks.Solid understanding of distributed systems and low-latency infrastructure.Experience in debugging and optimizing Quality of Service (QoS) metrics such as latency, jitter, and packet loss.Compensation at Nooks for qualified roles includes a competitive base salary, equity options, and a comprehensive benefits package. Our offerings include health, dental, vision, life, and disability insurance coverage, a flexible hybrid work environment, and unlimited paid time off. Benefits may vary based on location and are subject to change.

Oct 8, 2025
Apply
Cartesia logoCartesia logo
Full-time|On-site|*HQ - San Francisco, CA

About CartesiaAt Cartesia, our vision is to create the future of AI—an intelligent, interactive system that seamlessly integrates into your daily life. Today, even the most advanced models struggle to process and analyze vast streams of audio, video, and text data—1 billion text tokens, 10 billion audio tokens, and 1 trillion video tokens—especially on-device.We are at the forefront of developing the model architectures that will make these capabilities a reality. Our founding team, who met as PhD candidates at the Stanford AI Lab, pioneered State Space Models (SSMs), a novel approach for training efficient, large-scale foundational models. Our diverse team combines deep expertise in model innovation and systems engineering with a design-oriented product engineering team to create and deliver cutting-edge models and experiences.With support from leading investors such as Index Ventures, Lightspeed Venture Partners, Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks, and others, along with guidance from an exceptional group of advisors and over 90 angel investors across various industries, we are well-positioned to redefine the AI landscape.About the RoleWe are seeking a passionate Product Software Engineer to join our team, focusing on building and scaling the platform infrastructure that supports Cartesia's voice AI products. In this role, you will develop composable platform components that enhance feature delivery across our API and conversational AI systems.Your ImpactDevelop and scale product platform components to facilitate Cartesia's growth.Take ownership of critical product infrastructure end-to-end, including real-time systems.Collaborate with product teams to identify common needs and create platform primitives that enhance their development speed.What You BringA minimum of 3 years of experience as a backend or platform engineer with a focus on building scalable systems.A customer-centric mindset that prioritizes outcomes over technical specifications.A proactive approach to owning your work, tackling ambiguous challenges, and achieving results.Strong decision-making skills regarding when to expedite development and when to invest in scalability.Familiarity with our technology stack: TypeScript for frontend, Python and Go for backend. Expertise in all three is not required, but a willingness to learn is essential.Nice-to-HavesExperience with AI or machine learning systems.Background in working with real-time data processing.

Feb 3, 2026
Apply
Wispr Flow logo
Full-time|On-site|San Francisco

Join the Team at Wispr FlowAt Wispr Flow, we are revolutionizing the way you interact with technology, making it as simple and natural as conversing with a friend.We are proud to be the leading voice dictation platform, surpassing traditional keyboards, thanks to our exceptional understanding and contextual awareness. Our platform is designed to be personalized and accessible on any device—be it desktop or mobile.As we look to 2026, we are committed to not just dictation but also to developing native actions with an agentic framework, enhancing the reliability of our voice interface.Our diverse team of AI researchers, designers, engineers, and growth experts are passionate about reimagining human-computer interactions. We seek high-agency individuals who value open communication, prioritize user experience, and pay meticulous attention to detail. Our culture thrives on spirited discussions, pursuit of truth, and making a meaningful impact in the real world.With a remarkable 150% revenue growth each quarter for the last year and $81 million raised from top-tier venture capitalists and renowned angel investors, we are poised for continued success.

Aug 11, 2025
Apply
deepgram logodeepgram logo
Full-time|$215/yr - $215/yr|On-site|San Francisco, CA

Company OverviewDeepgram stands at the forefront of the burgeoning Voice AI economy, delivering unparalleled real-time APIs for speech-to-text (STT) and text-to-speech (TTS), while also enabling the creation of scalable, production-grade voice agents. Over 200,000 developers and more than 1,300 organizations, including prominent names like Twilio, Cloudflare, and Jack in the Box, trust Deepgram to power their voice solutions. Our voice-native foundation models are accessible via cloud APIs, as well as self-hosted and on-premises software, boasting unmatched accuracy, minimal latency, and cost-efficient solutions. With a recent Series C funding led by top-tier global investors, Deepgram has processed over 50,000 years of audio and transcribed over 1 trillion words, making us the leading authority in voice technology.Company Operating RhythmAt Deepgram, we foster an AI-first culture where leveraging AI tools is essential to our innovation and performance metrics. Every member of our team is encouraged to actively engage with advanced AI tools and incorporate them into their daily tasks. Our success hinges on the creative and effective use of cutting-edge AI capabilities. Candidates should be adaptable, eager to learn, and excited by the prospect of integrating new models into their workflows, as our fast-paced environment requires continuous evolution and experimentation.The Opportunity:We are on the lookout for a Software Engineer to become a vital part of Deepgram for Restaurants, a newly established business unit focused on addressing critical challenges in the restaurant industry, collaborating closely with Deepgram’s core research teams.This role involves direct engagement with leading restaurant enterprises and technology partners to identify and tackle the most pressing issues in the sector. Initial focus areas will include:Developing ultra-reliable ASR systems for noisy, multi-microphone environmentsCreating high-accuracy, menu-aware drive-thru and phone ordering agentsImplementing real-time analytics and operational intelligence solutionsDesigning dependable in-restaurant hardware solutions

Jan 13, 2026
Apply
Weekend logoWeekend logo
Full-time|On-site|San Francisco

About UsWeekend, formerly known as Volley, stands at the forefront of voice AI game development for smart TVs. Our innovative games are enjoyed by millions every month, featuring beloved titles such as Jeopardy!, Song Quiz, CoComelon: Sing and Play with JJ, and Wheel of Fortune.We envision a future where voice interfaces will be the primary means of accessing entertainment in homes, cars, and more. To this end, we are creating a comprehensive subscription service akin to Netflix or Spotify, dedicated to the emerging genre of “AI-powered games.”Founded in 2024 by Max Child and James Wilsterman, Weekend made its mark by participating in Y Combinator in 2018 and earning a place on the prestigious 2022 YC Top Companies List. Based in the vibrant Union Square of San Francisco, we are rapidly expanding our team.

Mar 11, 2026
Apply
Wispr Flow logo
Full-time|On-site|San Francisco

About Wispr FlowWispr Flow is revolutionizing the way we interact with technology, making it as seamless as conversing with a close friend.As the leading voice dictation platform, Wispr Flow is now preferred by users over traditional keyboards, thanks to its exceptional understanding and contextual awareness. Whether on desktop or phone, our platform delivers personalized experiences that adapt to your unique needs.Looking ahead to 2026, we aim to expand beyond dictation to develop native actions — a framework that not only understands you but operates reliably across various tasks.Our diverse team of AI researchers, designers, growth specialists, and engineers is committed to reimagining human-computer interaction from the ground up. We prioritize high-agency collaborators who communicate transparently, deeply engage with our users, and focus on the finer details. We foster an environment of spirited discussions, truth-seeking, and meaningful impact.With a remarkable revenue growth of 150% each quarter over the past year and $81 million raised from top-tier venture capital firms and well-known angel investors, Wispr Flow is poised for continued success.

Aug 4, 2025
Apply
Weekend logoWeekend logo
Full-time|On-site|San Francisco

About Weekend Weekend (formerly Volley) creates voice AI games for smart TVs, reaching millions of players every month. Popular titles include Jeopardy!, Song Quiz, CoComelon: Sing and Play with JJ, and Wheel of Fortune. The team believes voice interfaces will shape the future of home entertainment, whether in the living room, kitchen, bedroom, or on the road. Weekend is building a subscription service for AI-powered games, inspired by models like Netflix and Spotify but focused on interactive play. Founded in 2024 by Max Child and James Wilsterman, Weekend started in Y Combinator’s 2018 batch and was recognized in the 2022 YC Top Companies List. The company is growing quickly in San Francisco’s Union Square neighborhood. Role Overview: Senior Software Engineer (San Francisco) The Hub team owns Weekend’s bundled gaming experience, serving as the entry point for users on every TV platform. This group shapes features that help players discover new games, stay engaged, and keep coming back. The work spans everything from the main product experience to experiments that support business growth. This role involves close collaboration with product and design partners, working across the full stack of a consumer-facing product. It suits engineers who want real ownership and enjoy working at the intersection of engineering, product, and data. What You’ll Do Work with Product to clarify requirements, write technical specs, and deliver features from idea through launch. Partner with Design to build engaging, user-friendly experiences. Spot risks early, influence engineering decisions, and mentor teammates. Use data to guide technical decisions and measure feature impact. Join code reviews to maintain strong standards for code quality, architecture, and performance. What We’re Looking For 6+ years of experience in software engineering. Hands-on expertise with TypeScript and React in production settings. Track record of leading initiatives that span multiple teams, not just solo projects. Experience with CI/CD pipelines and building for multiple platforms. Strategic approach to solving complex problems and improving processes.

Apr 17, 2026

Sign in to browse more jobs

Create account — see all 5,516 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.