About the job
About David AI
David AI is a pioneering audio data research company that applies a rigorous R&D framework to data development, akin to that of leading AI labs. Our mission is to seamlessly integrate AI into everyday life, recognizing that audio serves as a natural interface. As the demand for advanced audio AI grows, the need for high-quality training data becomes critical—this is where David AI excels.
Founded in 2024 by a talented team of former Scale AI engineers and operators, we have rapidly secured partnerships with major FAANG companies and AI laboratories. Recently, we raised $50M in Series B funding from prestigious investors like Meritech, NVIDIA, Jack Altman (Alt Capital), Amplify Partners, First Round Capital, and others.
Our team is composed of sharp, humble, and ambitious individuals who collaborate closely. We are eager to welcome the brightest minds in research, engineering, product, and operations to join us in advancing the frontiers of audio AI.
About our Data Operations Team
The Data Operations team is the powerhouse behind David AI's Data Factory, converting raw audio into exceptional training datasets for top AI labs. Our goal is to develop and manage new data pipelines on a grand scale.
This entails starting with a model capability we aim to unlock, experimenting with various data shapes and collection strategies, and validating these approaches with researchers. Once we identify a successful method, we industrialize it by creating reliable, efficient, and high-quality audio processing pipelines. Our team thrives in ambiguous environments, adept at both prototyping innovative workflows and managing extensive production systems.
About This Role
As the Data Product Operations Lead, you will be instrumental in propelling the Data Factory at David AI. Your responsibilities will include designing and scaling pipelines that transform raw audio into valuable datasets for leading AI labs. You will take full ownership of data products from initial prototypes to large-scale implementations, actively building workflows, validating them with researchers, and ensuring their reliability at production levels.
In This Role, You Will
- Oversee the complete success of a data pipeline, from initial experiments to scaling up production systems that produce high-quality audio data in significant volumes.
- Design and operate pipelines that maintain reliability, quality, and efficiency while processing an extensive volume of audio data.

