About the job
About David AI
David AI is pioneering the audio data research landscape, applying a rigorous R&D approach to dataset development akin to the methodologies used in AI labs for model creation. Our goal is to integrate AI seamlessly into real-world applications, with audio serving as the perfect entry point due to its inherent versatility and human connection. As the audio AI field progresses, the demand for high-quality training data becomes critical, and that's where David AI excels.
Founded in 2024 by a team of experienced engineers and operators from Scale AI, David AI has quickly gained traction, serving prominent clients among FAANG companies and AI research labs. Recently, we secured a $50M Series B funding round from esteemed investors including Meritech, NVIDIA, Jack Altman (Alt Capital), Amplify Partners, and First Round Capital.
Our team embodies intelligence, humility, ambition, and a close-knit ethos. We are on the lookout for exceptional talent in research, engineering, product, and operations to join us in advancing the frontiers of audio AI.
About Our Engineering Team
At David AI, our engineering team is responsible for constructing the pipelines, platforms, and models that convert raw audio into valuable data for top AI labs and enterprises. We pride ourselves on our collaborative environment, comprising product engineers, infrastructure specialists, and machine learning experts dedicated to leading the charge in audio data research.
We operate at a fast pace, taking ownership of our projects from conception through to production. Our team develops real-time processing pipelines capable of managing terabytes of audio data daily while deploying innovative generative audio models.
About This Role
As a Product Engineer at David AI, you will design and implement state-of-the-art tools that enable our users to leverage audio data effectively for training their AI models. You will collaborate closely with researchers to continuously refine our data collection methodologies.
Your Responsibilities
- Deliver full-stack features that will be utilized by thousands of users on a daily basis.
- Develop scalable systems that create essential data processing pipelines, extracting actionable insights from terabytes of audio data each day.
- Construct, deploy, and assess LLM and DSP-based solutions to enhance our clients' comprehension of intricate features within our datasets.
- Rapidly iterate on research hypotheses by collaborating with researchers and the operations team to deploy enhancements efficiently.

