About the job
Who We Are
We are on the lookout for exceptional talent to help shape the global standard of video understanding AI!
At TwelvaLabs, we are creating world-leading AI models specifically designed to process vast video datasets, enabling specialized functionalities such as search, analysis, summarization, and insight generation.
Our models are utilized by the largest sports leagues globally to swiftly and accurately curate highlights from extensive game footage, providing an ultra-personalized viewing experience. In South Korea, integrated command centers rely on TwelvaLabs to efficiently navigate CCTV footage for rapid crisis response, while major broadcasters and studios worldwide leverage our models to produce content for billions of viewers.
TwelvaLabs, a deep tech startup with offices in San Francisco and Seoul, has been recognized for four consecutive years as one of the top 100 AI startups globally by CB Insights. We have secured over $110 million in investments from premier VCs and companies including NVIDIA, NEA, Index Ventures, Databricks, and Snowflake. Our AI models, uniquely developed in Korea, are exclusively serviced through Amazon Bedrock. We are committed to creating innovative products alongside outstanding colleagues and growing together with our global clientele.
At TwelvaLabs, we operate based on core values that include:
A commitment to honesty and self-reflection about ourselves and our team
Resilience and humility in the face of failure and feedback
A dedication to continuous learning to enhance the team's capabilities
If you enjoy solving challenging problems and growing through the process, the opportunity awaits you here at TwelvaLabs.
About the Team
You will be joining the team responsible for multimodal representation learning and production serving at TwelvaLabs. We develop models that integrate various modalities such as video, audio, and text into a unified embedding space, reliably serving them in production systems used by thousands of clients worldwide.
Our responsibilities include conducting experiments on multimodal embedding models within a large-scale distributed learning environment and transitioning research outcomes into real-time inference systems. With access to top-tier GPU resources like the NVIDIA B300, we minimize the transition cycles from research to production.
Our research results reach global clients within months, and we collaborate closely with the Research, Product, and Infrastructure teams to create significant technical impacts.
About the Role
As a Senior Machine Learning Engineer on the Embedding & Search team, you will play a pivotal role in developing and owning critical components of TwelvaLabs' search and retrieval platform. This platform integrates vector search, lexical retrieval, and reranking to deliver fast, accurate, and scalable search experiences for our clients.
This position demands expertise in systems-heavy ML engineering, focusing on information retrieval, ML serving, and distributed systems. We seek a strong engineer capable of deconstructing well-defined problems with moderate ambiguity into concrete milestones, delivering reliable and high-performance solutions.
Key Responsibilities
Develop and manage essential subsystems of our search platform on EKS, including vector indexing (ANN), lexical retrieval, hybrid fusion, reranking, and temporal (segment-level) search.
Enhance retrieval performance across vector and lexical paths at a scale ranging from millions to billions.

