About the job

Who We Are

At Twelve Labs, we are on a mission to set global standards in video understanding AI! We are crafting state-of-the-art AI models that efficiently process vast amounts of video data, offering specialized features for search, analysis, summarization, and insight generation.

Our models are utilized in the world’s largest sports leagues, enabling rapid and accurate highlight selection from extensive game footage, thus providing a hyper-personalized viewing experience. In domestic integrated control centers, we collaborate with Twelve Labs to efficiently navigate CCTV footage for rapid crisis response, while major global broadcasters and studios harness our models to create content for billions of viewers.

Headquartered in San Francisco and Seoul, Twelve Labs is a Deep Tech startup recognized as one of the world's top 100 AI startups by CB Insights for four consecutive years. We have secured over $110 million in funding from renowned investors such as NVIDIA, NEA, Index Ventures, Databricks, and Snowflake. Our AI model, uniquely developed in Korea, is the only one offered through Amazon Bedrock. We thrive on innovation and collaboration, building extraordinary products alongside exceptional colleagues to grow with our global clientele.

At Twelve Labs, we uphold core values centered around:

Honesty and reflection towards oneself and the team.
Courage and humility to embrace failure and feedback.
A commitment to continuous learning and enhancing team capabilities.

If you enjoy solving challenging problems and growing through the process, we invite you to explore opportunities at Twelve Labs.

About the Team

The ML Data Team at Twelve Labs operates under the belief that data determines the performance of AI models. We are responsible for building high-quality data end-to-end for training and evaluating multimodal AI models. Our team collects, filters, processes, and labels various types of multimodal data, including video, images, and audio, while collaborating with different teams to design learning datasets that unlock new model capabilities. We also create evaluation datasets that reflect real user experiences and develop internal tools to efficiently manage this entire process.

The ML Data Team plays a critical role in the development of Twelve Labs’ world-class video understanding models through our meticulously designed data pipelines.

About the Role

The Software Engineer - Data Acquisition focuses on the collection and acquisition of data to support the training of Twelve Labs' multimodal AI models.

This position involves assisting in obtaining, organizing, and processing diverse data types, including video, audio, images, and text, while developing the necessary data pipelines and tools.

We seek individuals who understand the entire data lifecycle, especially those who can showcase engineering capabilities in effectively acquiring and preparing data.

In This Role, You Will

Build and operate data pipelines that support the collection, preprocessing, refinement, filtering, and labeling of large multimodal datasets (video, images, audio) for LLM/VLM training.
Implement and enhance data collection libraries and services that deliver real impact.
Collaborate with various teams to understand project objectives and priorities, actively participating throughout the planning, development, and operational phases.

You May Be a Good Fit if You Have

Proficiency in Python, with experience ranging from simple scripting to data processing and backend service development.
Experience with web crawling, API integration, and data management.

About the job

Who We Are

At Twelve Labs, we uphold core values centered around:

Honesty and reflection towards oneself and the team.
Courage and humility to embrace failure and feedback.
A commitment to continuous learning and enhancing team capabilities.

If you enjoy solving challenging problems and growing through the process, we invite you to explore opportunities at Twelve Labs.

About the Team

The ML Data Team plays a critical role in the development of Twelve Labs’ world-class video understanding models through our meticulously designed data pipelines.

About the Role

The Software Engineer - Data Acquisition focuses on the collection and acquisition of data to support the training of Twelve Labs' multimodal AI models.

This position involves assisting in obtaining, organizing, and processing diverse data types, including video, audio, images, and text, while developing the necessary data pipelines and tools.

We seek individuals who understand the entire data lifecycle, especially those who can showcase engineering capabilities in effectively acquiring and preparing data.

In This Role, You Will

Build and operate data pipelines that support the collection, preprocessing, refinement, filtering, and labeling of large multimodal datasets (video, images, audio) for LLM/VLM training.
Implement and enhance data collection libraries and services that deliver real impact.
Collaborate with various teams to understand project objectives and priorities, actively participating throughout the planning, development, and operational phases.

You May Be a Good Fit if You Have

Proficiency in Python, with experience ranging from simple scripting to data processing and backend service development.
Experience with web crawling, API integration, and data management.

About the job

Who We Are

At Twelve Labs, we uphold core values centered around:

Honesty and reflection towards oneself and the team.
Courage and humility to embrace failure and feedback.
A commitment to continuous learning and enhancing team capabilities.

If you enjoy solving challenging problems and growing through the process, we invite you to explore opportunities at Twelve Labs.

About the Team

The ML Data Team plays a critical role in the development of Twelve Labs’ world-class video understanding models through our meticulously designed data pipelines.

About the Role

The Software Engineer - Data Acquisition focuses on the collection and acquisition of data to support the training of Twelve Labs' multimodal AI models.

This position involves assisting in obtaining, organizing, and processing diverse data types, including video, audio, images, and text, while developing the necessary data pipelines and tools.

We seek individuals who understand the entire data lifecycle, especially those who can showcase engineering capabilities in effectively acquiring and preparing data.

In This Role, You Will

Build and operate data pipelines that support the collection, preprocessing, refinement, filtering, and labeling of large multimodal datasets (video, images, audio) for LLM/VLM training.
Implement and enhance data collection libraries and services that deliver real impact.
Collaborate with various teams to understand project objectives and priorities, actively participating throughout the planning, development, and operational phases.

You May Be a Good Fit if You Have

Proficiency in Python, with experience ranging from simple scripting to data processing and backend service development.
Experience with web crawling, API integration, and data management.

About the job

Who We Are

At Twelve Labs, we uphold core values centered around:

Honesty and reflection towards oneself and the team.
Courage and humility to embrace failure and feedback.
A commitment to continuous learning and enhancing team capabilities.

If you enjoy solving challenging problems and growing through the process, we invite you to explore opportunities at Twelve Labs.

About the Team

The ML Data Team plays a critical role in the development of Twelve Labs’ world-class video understanding models through our meticulously designed data pipelines.

About the Role

The Software Engineer - Data Acquisition focuses on the collection and acquisition of data to support the training of Twelve Labs' multimodal AI models.

This position involves assisting in obtaining, organizing, and processing diverse data types, including video, audio, images, and text, while developing the necessary data pipelines and tools.

We seek individuals who understand the entire data lifecycle, especially those who can showcase engineering capabilities in effectively acquiring and preparing data.

In This Role, You Will

Build and operate data pipelines that support the collection, preprocessing, refinement, filtering, and labeling of large multimodal datasets (video, images, audio) for LLM/VLM training.
Implement and enhance data collection libraries and services that deliver real impact.
Collaborate with various teams to understand project objectives and priorities, actively participating throughout the planning, development, and operational phases.

You May Be a Good Fit if You Have

Proficiency in Python, with experience ranging from simple scripting to data processing and backend service development.
Experience with web crawling, API integration, and data management.

Software Engineer - Data Acquisition

Experience Level

Qualifications

About the job

About Twelve Labs

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

Software Engineer - Data Acquisition

Experience Level

Qualifications

About the job

About Twelve Labs

Software Engineer - Data Acquisition

Experience Level

Qualifications

About the job

About Twelve Labs

Software Engineer - Data Acquisition

Experience Level

Qualifications

About the job

About Twelve Labs