About the job
Who We Are
At Twelve Labs, we are on a mission to set global standards in video understanding AI! We are crafting state-of-the-art AI models that efficiently process vast amounts of video data, offering specialized features for search, analysis, summarization, and insight generation.
Our models are utilized in the world’s largest sports leagues, enabling rapid and accurate highlight selection from extensive game footage, thus providing a hyper-personalized viewing experience. In domestic integrated control centers, we collaborate with Twelve Labs to efficiently navigate CCTV footage for rapid crisis response, while major global broadcasters and studios harness our models to create content for billions of viewers.
Headquartered in San Francisco and Seoul, Twelve Labs is a Deep Tech startup recognized as one of the world's top 100 AI startups by CB Insights for four consecutive years. We have secured over $110 million in funding from renowned investors such as NVIDIA, NEA, Index Ventures, Databricks, and Snowflake. Our AI model, uniquely developed in Korea, is the only one offered through Amazon Bedrock. We thrive on innovation and collaboration, building extraordinary products alongside exceptional colleagues to grow with our global clientele.
At Twelve Labs, we uphold core values centered around:
Honesty and reflection towards oneself and the team.
Courage and humility to embrace failure and feedback.
A commitment to continuous learning and enhancing team capabilities.
If you enjoy solving challenging problems and growing through the process, we invite you to explore opportunities at Twelve Labs.
About the Team
The ML Data Team at Twelve Labs operates under the belief that data determines the performance of AI models. We are responsible for building high-quality data end-to-end for training and evaluating multimodal AI models. Our team collects, filters, processes, and labels various types of multimodal data, including video, images, and audio, while collaborating with different teams to design learning datasets that unlock new model capabilities. We also create evaluation datasets that reflect real user experiences and develop internal tools to efficiently manage this entire process.
The ML Data Team plays a critical role in the development of Twelve Labs’ world-class video understanding models through our meticulously designed data pipelines.
About the Role
The Software Engineer - Data Acquisition focuses on the collection and acquisition of data to support the training of Twelve Labs' multimodal AI models.
This position involves assisting in obtaining, organizing, and processing diverse data types, including video, audio, images, and text, while developing the necessary data pipelines and tools.
We seek individuals who understand the entire data lifecycle, especially those who can showcase engineering capabilities in effectively acquiring and preparing data.
In This Role, You Will
Build and operate data pipelines that support the collection, preprocessing, refinement, filtering, and labeling of large multimodal datasets (video, images, audio) for LLM/VLM training.
Implement and enhance data collection libraries and services that deliver real impact.
Collaborate with various teams to understand project objectives and priorities, actively participating throughout the planning, development, and operational phases.
You May Be a Good Fit if You Have
Proficiency in Python, with experience ranging from simple scripting to data processing and backend service development.
Experience with web crawling, API integration, and data management.

