company

Machine Learning Engineer - Embedding

Twelve LabsSeoul, South Korea
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

QualificationsResearch or development experience in computer vision, natural language processing, or multimodal learning. Strong proficiency in Python and PyTorch, with hands-on experience in large-scale distributed model training. Familiarity with embedding models, vector search systems, and advanced machine learning methodologies.

About the job

About Us

Join us in setting a global standard for AI in video understanding!

At Twelve Labs, we are developing state-of-the-art AI models specialized in video processing, enabling advanced search, analysis, summarization, and insight generation from vast amounts of video data.

Our models are utilized by the world's largest sports leagues to swiftly and accurately highlight content from extensive match footage, delivering an ultra-personalized viewing experience. Additionally, integrated control centers in South Korea leverage our technology for efficient CCTV video exploration to respond to crises swiftly. Major broadcasters and studios worldwide harness our models to create content for billions of viewers.

Twelve Labs, a deep tech startup with offices in San Francisco and Seoul, has been recognized as one of the world's top 100 AI startups by CB Insights for four consecutive years. We have secured over $110 million in investments from leading VCs and corporations such as NVIDIA, NEA, Index Ventures, Databricks, and Snowflake. Our AI model, uniquely developed in Korea, is available through Amazon Bedrock. We thrive on collaboration with exceptional peers to create innovative products and grow alongside our global clientele.

Our core values include:

  • Honesty and reflection regarding ourselves and our team.
  • A spirit of resilience and humility that doesn’t fear failure or feedback.
  • A commitment to continuous learning to enhance team capabilities together.

If you enjoy solving challenging problems and growing through collaboration, your opportunity awaits at Twelve Labs.

About the Team

Our team focuses on multimodal representation learning and production serving. We integrate various modalities like video, audio, and text into a unified embedding space, ensuring stable serving to thousands of global customers.

We conduct experiments on multimodal embedding models within a large-scale distributed learning environment, taking responsibility for the end-to-end process of transforming research findings into real-time inference systems. Leveraging top-tier GPU resources such as the NVIDIA B300, we minimize the transition cycle from research to production.

In our rapid development cycles, we collaborate closely with Research, Product, and Infrastructure teams to create significant technical impacts, delivering research outcomes to customers globally within months.

Role Overview

Key Responsibilities

  • Design and optimize large-scale distributed learning pipelines for multimodal embedding models.
  • Enhance the inference performance of embedding models in production environments (throughput, latency, cost-efficiency).
  • Design and build vector search systems and embedding serving infrastructure.
  • Improve and automate the ML pipeline covering model development, training, and serving to facilitate rapid transitions to production.
  • Address applied research challenges such as data filtering and evaluation metric design to enhance model quality and user experience.
  • Explore and experiment with AI-based development tools like Claude and Gemini to boost development productivity.
  • Collaborate closely with Research, Product, and Infrastructure teams, taking ownership of the end-to-end process to deliver models to actual customers.

Ideal Candidate Profile

  • Research or development experience in computer vision, natural language processing, or multimodal learning.
  • Proficiency in Python and PyTorch, with experience in model training within large-scale distributed environments.
  • Experience with embedding models, vector search systems, and advanced ML techniques.

About Twelve Labs

Twelve Labs is a pioneering deep tech startup dedicated to advancing AI for video understanding. With our innovative models, we empower industries across the globe to harness the power of video data effectively. Our commitment to excellence and rapid development cycles has positioned us as a leader in the AI space.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.