Ai Researcher In Audio And Voice Technology jobs in San Francisco – Browse 5,624 openings on RoboApply Jobs

Ai Researcher In Audio And Voice Technology jobs in San Francisco

Open roles matching “Ai Researcher In Audio And Voice Technology” with location signals for San Francisco. 5,624 active listings on RoboApply Jobs.

5,624 jobs found

1 - 20 of 5,624 Jobs
Apply
company
Full-time|On-site|San Francisco, California, United States

Join Us at Amplifier Health!We are pioneering healthcare innovations with the world's first Large Acoustic Model (LAM), a groundbreaking foundation model that utilizes human voice to identify health conditions. This is where science fiction meets reality, and we have secured substantial funding from leading investors to establish a transformative new category in healthcare.We are in search of a passionate AI researcher who is ready to break free from the traditional "publish or perish" mindset and focus on creating impactful intelligence that truly works in real-world applications.The Reality of Our WorkWe are entering an exhilarating phase of rapid growth. Our commitment to pushing the boundaries of technology is matched only by our dedication to saving lives at scale.Our team collaborates in person in San Francisco, believing that the most challenging problems are best tackled together at a whiteboard rather than through virtual meetings.We operate at a fast pace, quickly transitioning from hypothesis to code, training, and validation with immediate feedback.We enjoy our work and thrive as a close-knit team on an exciting journey, driven by our passion for what we do.Your MissionAs part of our elite AI Research team, you will elevate the state-of-the-art in acoustic modeling. Your role will involve designing innovative architectures to extract clinical-grade biomarkers from raw audio data, not just fine-tuning existing models.The Challenges Ahead:Novel Architectures: You will explore how Transformer architectures can be adapted to process complex acoustic signals and long-range dependencies.Biomarker Discovery: You will conduct experiments to identify specific acoustic features (such as jitter, shimmer, and respiratory rate) that correlate with health conditions, often uncovering new signals that have yet to be recognized by medical science.Data Efficiency: You will contribute to building a foundation model, utilizing self-supervised learning techniques to harness vast amounts of unlabeled audio data.

Jan 30, 2026
Apply
company
Full-time|On-site|San Francisco Bay Area

About Retell AI Retell AI develops advanced voice AI technology for call centers, using first-principles approaches to create intelligent voice agents. These agents help businesses manage sales, support, and logistics communications while reducing reliance on large human teams. The company has reached $36M ARR in just 18 months, backed by Y Combinator and Alt Capital. With a team of 20, Retell AI is building a comprehensive customer experience platform, aiming for AI-powered contact centers by 2026. The vision: intelligent agents that execute, monitor, and improve customer interactions with minimal human oversight. Named a top 50 AI app by a16z: https://tinyurl.com/5853dt2x Ranked #4 on Brex's Fast-Growing Software Vendors of 2025: https://www.brex.com/journal/brex-benchmark-december-2025 Featured among top startups: https://leanaileaderboard.com/ Role Overview: Research Scientist - Voice AI Innovation This role centers on advancing machine learning for human-like voice agents in real-world settings. The Research Scientist will explore new methods in large language models (LLMs) and audio models, design evaluation techniques, and prototype systems that improve reasoning, reduce latency, and enhance conversational quality. The work involves open-ended ML challenges, rapid experimentation, and direct influence on the performance and cognitive abilities of voice AI systems at scale. Location San Francisco Bay Area

Apr 14, 2026
Apply
company
Full-time|On-site|San Francisco

Join Our Innovative Team at David AIDavid AI is pioneering the audio data research landscape. We adopt a rigorous R&D methodology for developing datasets that parallels the standards upheld by leading AI laboratories. Our vision is to seamlessly integrate AI into everyday experiences, with audio serving as the perfect conduit. The evolution of audio AI is rapidly unfolding, yet the availability of high-quality training data remains a critical challenge. This is where David AI steps in.Founded in 2024 by a talented group of former engineers and operators from Scale AI, we have quickly become a trusted partner to numerous FAANG companies and AI research labs. Recently, we secured $50 million in a Series B funding round with notable investors, including Meritech, NVIDIA, and Alt Capital.Our culture is built on sharp intellect, humility, ambition, and a close-knit community. We invite exceptional minds in research, engineering, product development, and operations to join us as we advance the field of audio AI.Research Team OverviewAt David AI, we are convinced that superior model capabilities stem from high-quality, differentiated data. Our research team is dedicated to conducting ambitious, long-term studies into audio technology while collaborating with both internal and external partners to implement cutting-edge research insights into practical applications.Your Role as a Founding Audio AI Research EngineerIn this position, you will establish the research framework that influences how premier AI labs develop their audio models. You will have access to a top-tier team of human AI trainers, robust computing resources, and the autonomy to shape your research agenda.Key ResponsibilitiesCreate and implement comprehensive evaluation frameworks for assessing audio AI capabilities in areas such as speech, emotion detection, conversational dynamics, and acoustic patterns.Investigate and prototype innovative methodologies for audio quality assessment, automated labeling, and optimizing data collection processes.Design focused data collection pipelines aimed at capturing novel, high-value audio capabilities.Develop automated systems for ongoing classifier enhancement and prompt engineering evaluation.Assess cutting-edge models and formulate actionable research strategies.Publish your findings in prestigious conferences.

Jun 24, 2025
Apply
companyThinking Machines Lab logo
Audio Research Specialist

Thinking Machines Lab

Full-time|$350K/yr - $475K/yr|On-site|San Francisco

At Thinking Machines Lab, our mission is to enhance humanity's potential through the advancement of collaborative general intelligence. We envision a future where everyone can access the knowledge and tools necessary to leverage AI for their individual needs and objectives.As a team of scientists, engineers, and innovators, we have developed some of the most widely utilized AI products, such as ChatGPT and Character.ai, along with open-weight models like Mistral, and popular open-source projects including PyTorch, OpenAI Gym, Fairseq, and Segment Anything.Role OverviewAt Thinking Machines, we adopt a multimodal-first approach, where multimodality is integral to our scientific goals and infrastructure. We are seeking skilled researchers to push the boundaries of audio capabilities. In this role, you will delve into how audio models can facilitate more natural and efficient communication and collaboration by preserving information and accurately capturing user intent.This position requires strong collaboration across pre-training, post-training, and product development with top-tier researchers, infrastructure engineers, and designers. Here, you will have the chance to influence the foundational capabilities of AI systems that will be utilized by millions globally.This role marries fundamental research with practical engineering, as we do not separate these functions internally. You will be expected to write high-performance code as well as engage with technical reports. It is an ideal position for someone who enjoys both extensive theoretical research and hands-on experimentation while laying the groundwork for how AI learns.Note: This is an evergreen role that remains open continuously to gauge interest in this research area. We receive numerous applications, and there may not always be an immediate match for your skills and experience. Nevertheless, we encourage you to apply, as we routinely review applications and reach out as new opportunities arise. You are welcome to reapply if you gain additional experience, but please avoid applying more frequently than every six months. Occasionally, we also post specific roles for distinct project or team needs, in which case you are welcome to apply directly alongside an evergreen submission.

Nov 23, 2025
Apply
companyCanva logo
Full-time|On-site|San Francisco

Join Canva as a Staff Research Scientist specializing in Video & Audio Generative AI. In this pivotal role, you will leverage your expertise in AI to develop innovative solutions that enhance our platform's multimedia capabilities. Collaborate with cross-functional teams to push the boundaries of AI technology, driving impactful projects that redefine user experiences.

Dec 18, 2025
Apply
company
Full-time|On-site|San Francisco

Join David AI as a Product ManagerDavid AI is pioneering the audio data research landscape, harnessing an R&D methodology to develop high-quality datasets that support AI models. Our vision is to seamlessly integrate AI into everyday experiences, with audio serving as the perfect entry point. As the demand for advanced audio AI solutions grows, the necessity for superior training data becomes paramount, and David AI is at the forefront of this evolution.Founded in 2024 by a talented team of former engineers from Scale AI, we’ve quickly established ourselves as a trusted partner for leading FAANG companies and AI labs. Our recent $50M Series B funding round, led by top-tier investors including Meritech and NVIDIA, is a testament to our trajectory and potential.We pride ourselves on our team’s intelligence, humility, and ambition. If you are passionate about research, engineering, product development, or operations, we invite you to contribute to our mission of advancing audio AI.The Product TeamOur Product team operates at the intersection of research, engineering, and operations, crafting audio data products that empower cutting-edge machine learning models. We translate groundbreaking research into scalable and thoughtful data products that prioritize quality, coverage, and clarity for model creators.Your RoleAs a Product Manager on our Product team, you will take charge of the strategy and execution of key components within our audio data portfolio. Collaborating closely with the Research, ML, Engineering, and Operations teams, you will convert model requirements into actionable data roadmaps, quality standards, and deliverables ready for clients.Key ResponsibilitiesSpecialize in areas such as data roadmap expansion, quality metrics, and evaluation based on your interests and background.Lead product strategy and development for audio data offerings, guiding them from research to full-scale production.Convert research insights into specific requirements for data collection, quality assurance, and documentation.Work in partnership with Research, ML, Engineering, and Operations teams to operationalize new datasets and capabilities.Establish metrics and success criteria to prioritize initiatives and evaluate impact across quality, coverage, cost, and speed.

Dec 30, 2025
Apply
companyTavus logo
Full-time|On-site|San Francisco

About UsTavus is an innovative research lab at the forefront of human computing technology. Our mission is to create AI Humans—advanced interfaces that bridge the gap between individuals and machines, eliminating the friction found in current systems. Our real-time human simulation models empower machines to see, hear, respond, and appear realistic, facilitating genuine, face-to-face conversations. With AI Humans, we blend the emotional intelligence inherent in humans with the extensive reach and reliability of machines, enabling them to serve as capable and trusted agents available 24/7, capable of communicating in any language.Envision a therapist accessible to everyone, a personal trainer that tailors sessions to your schedule, or a fleet of medical assistants dedicated to providing personalized attention to every patient. Tavus enables individuals, enterprises, and developers to create AI Humans that connect, empathize, and act with understanding on a large scale.Backed by prestigious investors like Sequoia Capital, Y Combinator, and Scale Venture Partners, we are a Series A company ready to shape the future of human-machine interaction.Join us in transforming a future where humans and machines genuinely comprehend one another.The RoleWe are seeking a passionate AI Researcher to join our core AI team and advance the science of audio-visual avatar generation. If you thrive in dynamic startup environments, enjoy experimenting with generative models, and are excited to see your research translated into production, you will find a welcoming home here.Your Mission Conduct research and develop cutting-edge audio-visual generation models for conversational agents (e.g., Neural Avatars, Talking Heads).Focus on models that intricately align with conversation flows, ensuring seamless integration of verbal and non-verbal cues.Experiment with diffusion models (DDPMs, LDMs, etc.), long-video generation, and audio synthesis.Collaborate closely with the Applied ML team to transition your research into practical applications.Stay updated on the latest breakthroughs in multimodal generation and contribute to the evolution of this field.You Will Excel If You Have:A PhD (or nearing completion) in a relevant discipline, or equivalent hands-on research experience.Proficiency in applying image/video generation techniques and a solid understanding of machine learning principles.

Oct 16, 2025
Apply
company
Full-time|On-site|San Francisco

About Liquid AILiquid AI, a pioneering company spun out of MIT CSAIL, is at the forefront of developing general-purpose AI systems that operate efficiently across various platforms, from data center accelerators to on-device hardware. Our commitment to low latency, minimal memory usage, privacy, and reliability allows us to partner with some of the most esteemed enterprises in consumer electronics, automotive, life sciences, and financial services. As we experience rapid growth, we are seeking exceptional talent to join our innovative journey.The OpportunityJoin our cutting-edge Audio team, where we are developing advanced speech-language models capable of handling Speech-to-Text (STT), Text-to-Speech (TTS), and speech-to-speech tasks within a unified architecture. This pivotal role supports applied audio model development, directly collaborating with the technical lead to deliver production systems that operate on-device under real-time constraints. You will take ownership of key workstreams encompassing data pipelines, evaluation systems, and customer deployments. If you are eager to tackle unique technical challenges within a small, elite team where your contributions are impactful, this is the role for you.What We're Looking ForWe are seeking an individual who:Builds first, theorizes later: You prioritize shipping working systems over theoretical models; production-grade code is your default.Owns outcomes end-to-end: You take full responsibility for everything from data pipelines to customer deployments and don't shy away from challenges.Thrives under constraints: On-device, low-latency, memory-constrained environments motivate you. You view constraints as opportunities for innovative design.Ramps quickly on new territory: You are comfortable closing knowledge gaps swiftly and actively seek feedback to drive results.The WorkDevelop and scale data pipelines for audio model training, including preprocessing, augmentation, and quality filtering at scale.Design, implement, and maintain evaluation systems that assess multimodal performance across both internal and public benchmarks.Fine-tune and adapt audio models to cater to customer-specific use cases, taking charge from requirement gathering through to deployment.Contribute production code to the core audio repository while collaborating closely with infrastructure and research teams.Facilitate experimentation under real hardware constraints, transitioning smoothly between customer-focused projects and core development initiatives.

Dec 16, 2025
Apply
companyZyphra logo
Full-time|On-site|San Francisco

Zyphra is an innovative artificial intelligence company located in the heart of San Francisco, California.The Opportunity:Join our dynamic team as a Research Engineer - Audio & Speech Models, where you will play a pivotal role in advancing Zyphra’s Audio Team. You will be instrumental in developing cutting-edge open-source text-to-speech and audio models. Your contributions will span the full spectrum of the model training process, from data collection and processing to the design of innovative architectures and training approaches.Your Responsibilities:Conduct large-scale audio training operationsOptimize the performance of our training infrastructureCollect, process, and evaluate audio datasetsImplement architectural and methodological improvements through rigorous testingWhat We Seek:A strong research mindset with the ability to navigate projects from ideation to implementation and documentation.Proficiency in rapid prototyping and implementation, allowing for swift experimentation.Effective collaboration skills in a fast-paced research environment.A quick learner who is eager to embrace and implement new concepts.Excellent communication abilities, enabling you to contribute to both research and engineering tasks at scale.Preferred Qualifications:Expertise in training audio models, such as text-to-speech, ASR, speech-to-speech, or emotion recognition.Experience with training audio autoencoders.Solid understanding of signal processing, particularly in audio.Familiarity with diffusion models, consistency models, or GANs.Experience with large-scale (multi-node) GPU training environments.Strong understanding of experimental methodologies for conducting rigorous tests and ablations.Interest in large-scale, parallel data processing pipelines.Competence in PyTorch and Python programming.Experience contributing to large, established codebases with rapid adaptation.

Aug 28, 2025
Apply
companybland logo
Full-time|On-site|San Francisco

bland is looking for a Machine Learning Researcher with a focus on audio. This position is based in San Francisco and centers on advancing how machines process and understand sound. The team works on pushing the boundaries of audio technology for a range of platforms. Responsibilities Research and develop new machine learning techniques for audio applications Contribute to projects that improve audio processing and analysis Collaborate with colleagues to bring research ideas into real-world audio products Location This role requires working onsite in San Francisco.

Apr 20, 2026
Apply
company
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input landscape for the AI era. By training our own models and creating deep OS integrations, we ensure that voice technology is executed flawlessly across the entire stack.The evolution of work is upon us; individual contributor roles are transforming as we now manage AI agents, a task that naturally aligns with voice interactions.We are not focused on traditional conversational agents; instead, we champion a voice-in-text-out (VITO) approach, believing that voice functionality should operate above traditional applications, and that innovative startups can lead in this domain.With an unwavering commitment to this vision, our progress has been promising, and we invite you to be a part of our journey.The RoleAs a growing team, you will have the opportunity to engage with all facets of our system.Recent projects include:Development of a real-time transcription server capable of managing thousands of concurrent audio streams.Training and deploying custom speech recognition models.Creating native integrations for macOS and Windows utilizing advanced system APIs.Technology StackFrontend: TypeScript, React, Next.js, ElectronBackend: Python, real-time server (Bun/Node.js), WebSocketsNative: Swift (macOS), C# (Windows)Machine Learning: Custom speech recognition models, inference pipelinesInfrastructure: Terraform, StripePrior experience with all these technologies is not required.Qualifications- Showcase your previous work and projects.- Adaptable in switching between programming languages and domains.- Capable of taking ownership of projects from concept to production.- Proficient in writing clean and maintainable code.- Experience with production systems is essential.- A positive attitude and a collaborative spirit are a must!

Jan 28, 2026
Apply
companyVapi logo
Full-time|$240K/yr - $240K/yr|On-site|San Francisco

About Vapi:At Vapi, we are pioneering the transition to voice as the primary interface for human interaction with technology.Our platform stands out as the most customizable solution for deploying intelligent voice agents.In just two years, we have attracted over 600,000 developers, welcoming more than 2,000 new members daily.Experience Vapi's capabilities firsthand!Role Overview:As our enterprise clients implement more sophisticated voice AI solutions, we are seeking a Senior Technical Advisor to provide the expert architectural guidance necessary for successful deployments.This position is critical for driving expansion revenue and ensuring high-quality deployments, ultimately leading to enhanced customer satisfaction and success.Responsibilities:First 30 Days:Gain an in-depth understanding of Vapi’s platform architecture, APIs, and common enterprise application scenarios.Observe customer architecture meetings and analyze current deployment strategies.Collaborate with Deployment Strategists and FDEs to familiarize yourself with existing scoping workflows and documentation practices.Start participating in architecture reviews for ongoing enterprise projects.Next 60 Days:Lead technical scoping for new use cases, including the development of new agents, workflows, and integrations.Create clear architectural diagrams and scoping documents to assist FDE teams in execution.Conduct comprehensive architecture reviews to identify optimization possibilities and scalability challenges.Work closely with SEs during complex sales processes to assess feasibility and facilitate seamless post-sale transitions.After 90 Days:Take full ownership of solution design for high-value enterprise clients.Drive expansion revenue by uncovering architectural opportunities that align with commercial objectives.Develop and share reusable architectural reference designs for future projects.

Feb 12, 2026
Apply
company
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input technology for the AI era. Our unique approach involves training proprietary models and establishing deep integrations with operating systems, ensuring optimal voice interaction capabilities.The landscape of work is evolving; the traditional role of an individual contributor is being replaced by the management of AI agents, making voice interaction a natural fit for this new paradigm.Rather than developing mere conversational agents, we envision a more intuitive method of computer interaction—voice input leading to text output (VITO). We assert that voice technology should operate at a foundational level, allowing a small, agile company like ours to innovate and excel.We are passionately committed to this mission, and our initial results have been promising. Join us in shaping the future!The OpportunityWe are looking for an iOS Engineer to help us build Aqua Voice from the ground up. This greenfield project offers you the chance to shape the mobile experience entirely.Your Contributions Will Include:Developing a native iOS application with system-level voice input features (including keyboard extensions, Shortcuts, and Siri integration).Implementing real-time audio capture and streaming to our transcription backend.Creating deep iOS integrations to enhance user experience.This role requires a focus on native performance and comprehensive system integration—this is not just a web view wrapper.Technology StackSwift (SwiftUI + UIKit where applicable)Audio Technologies: AVFoundation, Audio Unit, real-time streamingNetworking: WebSockets, streaming protocolsSystem Integrations: Keyboard extensions, Shortcuts, accessibility APIs

Jan 28, 2026
Apply
company
Full-time|On-site|San Francisco

About Aqua VoiceAqua Voice is pioneering the voice input layer for the AI era. We develop our proprietary models and create deep operating system integrations, as mastering voice interaction necessitates comprehensive control over the entire technology stack.The landscape of work is evolving. Individual contributor roles have transformed; now you will manage AI agents, a task that is ideally suited for voice technology.We are not in the business of creating simple conversational agents. Our vision is that the most intuitive interaction with computers is through voice input with text output (VITO). We advocate for voice integration at a fundamental level, confident that a nimble company can lead innovation in this space.Our commitment to this vision is unwavering, and we have already made significant strides. We invite you to become a part of this exciting journey.The OpportunityAs we witness growing enterprise interest—teams eager to integrate Aqua into their workflows—you will lead these initiatives.Full-Cycle Sales Ownership: From initial conversation to contract signing, you will manage the entire sales process without handoffs.Land and Expand Strategy: Begin with key champions within organizations, and grow our presence across entire teams.Navigating Enterprise Processes: You will adeptly handle security reviews, procurement, and legal considerations, drawing on your previous experience.Defining Sales Methodology: As a key early-stage hire, you will help shape how enterprise sales are conducted at Aqua Voice.Collaboration with Product Teams: Your insights into enterprise needs will directly influence our product roadmap.

Jan 28, 2026
Apply
company
Full-time|On-site|San Francisco

Join Our Team at Aqua VoiceAqua Voice is at the forefront of developing the voice input layer for the AI era. Our innovative approach involves training proprietary models and establishing deep OS integrations, ensuring we master the entire voice technology stack.The nature of work is evolving, and we’re transitioning from traditional individual contributor roles to managing AI agents. This shift creates a unique opportunity for voice technology to thrive.Rather than simply creating conversational agents, we envision an interaction model where voice input leads to text output (VITO). We believe voice should operate at a higher level than applications, enabling a smaller company like ours to lead the charge.Our passionate commitment to this vision has yielded promising results, and we invite you to be a part of our journey.Your RoleAs a Sales Development Representative, you will play a critical role in building our enterprise pipeline by pinpointing ideal companies and connecting us with key decision-makers.Research and Targeting: Identify companies and teams that align with our profile.Outbound Engagement: Utilize email, LinkedIn, and creative strategies to reach out—prioritizing quality over quantity.Qualification: Assess client needs, budgets, and timelines to set Account Executives up for success.User Expansion: Leverage existing contacts to open doors at potential client companies.Iterative Learning: Analyze messaging and channel effectiveness to refine our approach.This position serves as a pathway to Account Executive roles for those interested in advancing their career.

Jan 28, 2026
Apply
company
Full-time|On-site|San Francisco

Join Our TeamAqua Voice is pioneering the voice interaction layer for the AI-driven era. We develop our own models and create deep OS integrations, as mastering voice technology necessitates control over the entire stack.The landscape of work is evolving rapidly. It's no longer just about individual contributions; now, you manage AI agents, and voice interaction is ideally suited for this new paradigm.We are not in the business of crafting conversational agents. Our vision is that the most intuitive way to engage with your computer is through voice commands with text responses (VITO). We believe that voice technology should operate at a layer above applications, and as a small company, we have the potential to push boundaries in this area.With relentless dedication, we are making great strides, and we invite you to be part of this journey.Your RoleWe are seeking a talented designer who is also proficient in coding.You will take the reins on design for our desktop applications, web platforms, and marketing collateral. You will implement your designs using React, TypeScript, and CSS.This is primarily a design-focused position. We prioritize your aesthetic sense over the speed of engineering.

Jan 28, 2026
Apply
company
Full-time|On-site|San Francisco Bay Area

About Retell AI Retell AI builds voice AI technology that helps businesses transform their call center operations. In just 18 months, thousands of companies have adopted Retell’s AI voice agents to streamline sales, support, and logistics, work that once required large human teams. Backed by investors including Y Combinator and Alt Capital, Retell has grown annual recurring revenue from $5M to $36M with a focused team of 20. The company’s goal for 2026: a modern customer experience platform where AI powers entire contact centers. Retell is developing AI “workers” that can serve as frontline agents, quality assurance analysts, and managers, handling, evaluating, and improving customer interactions on their own. Named a top 50 AI app by a16z: https://tinyurl.com/5853dt2x Ranked #4 on Brex’s Fast-Growing Software Vendors of 2025: https://www.brex.com/journal/brex-benchmark-december-2025 Featured on the Lean AI Leaderboard: https://leanaileaderboard.com/ Role Overview: Research Scientist – LLM Retell AI is hiring a Research Scientist focused on large language models (LLMs) and audio processing. This role suits machine learning researchers who want to push the boundaries of real-time AI and see their work in production. What You Will Do Investigate new approaches in large language models and audio processing for human-like voice agents Design and implement evaluation methods for complex, real-world conversational systems Prototype systems to improve reasoning, reduce latency, and enhance conversation quality Work closely with engineering and product teams to bring research advances into production Impact Research at Retell directly shapes the capabilities of voice AI agents for thousands of businesses. The work blends advanced research with practical deployment, improving how customers interact with automated systems across industries. Location This position is based in the San Francisco Bay Area.

Apr 14, 2026
Apply
companyBaseten logo
Full-time|On-site|San Francisco

Baseten creates AI inference solutions for clients such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. The team blends AI research, infrastructure, and developer tools to help organizations deploy advanced models. Backed by $300M in Series E funding from BOND, IVP, Spark Capital, Greylock, and Conviction, Baseten is expanding quickly and shaping the landscape for engineers building AI products. Role overview The Software Engineer - Voice AI role centers on building and deploying open-source voice models for real-world use. Voice is becoming a key interface across the web, and this position addresses the technical challenges of bringing production-ready Voice AI to market. The work supports applications in productivity, customer service, clinical dialogue, creator tools, education, and more, helping to change how people interact with technology across sectors. This engineer leads Baseten’s Voice AI efforts, guiding the proprietary inference stack that powers Voice AI models. The role balances shaping the product roadmap with hands-on engineering. Collaboration is a core part of the job, working closely with Forward Deployed Engineers, Model Performance Engineers, and other technical teams to advance Voice AI capabilities. Sample projects and initiatives The world's fastest Whisper, with streaming and diarization Canopy Labs selects Baseten for Orpheus TTS inference Partnering with the Core Product team to build an orchestration framework for a multi-model voice agent Working with the Training Platform team to support ongoing training of voice models Designing a developer-friendly API and SDK to encourage self-service adoption of Baseten Voice AI products Location San Francisco

Apr 26, 2026
Apply
companyAmbience Healthcare logo
AI Researcher in Healthcare

Ambience Healthcare

Full-time|$205K/yr - $300K/yr|Hybrid|San Francisco

About Us:At Ambience Healthcare, we are not just another scribe service; we are pioneering an AI intelligence platform designed to bring humanity back to healthcare while delivering substantial ROI for health systems nationwide.Our innovative technology empowers healthcare providers to concentrate on exceptional patient care by alleviating the administrative tasks that distract them from their vital responsibilities. We offer real-time, coding-aware documentation and clinical workflow support across ambulatory, emergency, and inpatient settings at leading health systems throughout North America.Our teams are driven by a strong sense of ownership and dedication to creating the best solutions for our partners within the health system. We prioritize honesty, positivity, and thoughtful discourse, holding each other to high standards because we understand the significance of the challenges we tackle.Ambience has received numerous accolades, including being ranked #1 for Enhancing the Clinician Experience in the KLAS Research Emerging Solutions Top 20 Report, recognized by Fast Company as one of the Next Big Things in Tech, honored as one of the best AI companies in healthcare by Inc., and selected as a LinkedIn Top Startup in both 2024 and 2025. We are proudly backed by notable investors like Oak HC/FT, Andreessen Horowitz (a16z), OpenAI Startup Fund, and Kleiner Perkins — and our journey is just beginning.The Role:Become part of a vibrant team that is transforming healthcare through AI-driven solutions that boost clinician productivity and minimize administrative workload. This position combines clinical knowledge with technical problem-solving and the application of advanced AI technologies, aiding in the development of precise and efficient medical documentation, coding, and various AI tools that empower clinicians.Preferred: Willingness to work 3 days a week from our San Francisco office.What You'll Do:Prototype and test cutting-edge models, techniques, and architectures in collaboration with clinicians and technical teams to enhance user experience and functionality.Design experiments, establish success criteria, and create roadmaps for new products and features.Assess and improve AI models to ensure clinical accuracy and dependability.Curate and audit datasets to guarantee high-quality, representative training and testing materials.Analyze feedback and address issues with AI outputs using clinical insights and analytical skills.Engage in continuous learning and development in AI technologies and clinical practices.

Mar 10, 2026
Apply
companyHex Technologies logo
AI Research Engineer

Hex Technologies

Full-time|$150.4K/yr - $285K/yr|On-site|SF or NYC

About the Role Hex Technologies is at the forefront of the AI revolution, providing an innovative platform that transforms modern Data Science and Data Analytics workflows. As an AI Research Engineer, you will collaborate with product teams to create cutting-edge AI experiences, including the Notebook Agent. Your responsibilities will include conducting experiments, fine-tuning models, deploying AI infrastructure, and developing robust experimentation tools. Your primary focus will be enhancing Hex's context engine and advancing the capabilities of our Notebook Agent, designed for professionals engaged in complex and impactful data tasks. The Notebook Agent serves as a sophisticated data copilot, capable of writing SQL and Python, crafting visually stunning reports, and collaborating with analysts to explore new data inquiries. Your efforts will help data teams within Hex deliver highly accurate and tailored data experiences for their stakeholders, empowering data-driven decision-making across the organization. If you are a passionate builder eager to amplify these capabilities for thousands of users, join us on the leading Data Science platform with unparalleled user context.

Mar 17, 2026

Sign in to browse more jobs

Create account — see all 5,624 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.