About the job
About Us
At Twelve Labs, we are at the forefront of innovation, creating advanced multimodal foundation models that enable machines to understand videos in a human-like manner. Our groundbreaking approach to video-language modeling is setting new benchmarks, enhancing our ability to analyze and interact with diverse media formats.
With an impressive $107 million in funding from leading venture capital firms including NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, as well as visionary AI leaders like Fei-Fei Li and Alexandr Wang, we are strategically positioned to drive global innovation. Our headquarters in San Francisco, along with a significant presence in Seoul, illustrates our commitment to reshaping the future of technology.
We celebrate the unique journeys of every individual and believe that our diverse backgrounds fuel our creativity and innovation. We invite passionate individuals who share our mission to join us in transforming video understanding and multimodal AI.
Role Overview
Unlike typical video engineering roles that focus primarily on enhancing human playback, at Twelve Labs, we engineer video for machine comprehension. This unique approach prioritizes AI model performance over mere perceptual quality. As the Principal Software Engineer for Video Engineering, you will architect and implement our video processing pipelines, overseeing the entire journey from byte ingestion through to playback. Your work will be pivotal in ensuring our systems are efficient, cost-effective, and tailored for AI-driven video intelligence at scale.
Your Responsibilities
End-to-End Ownership: Lead the design and execution of video pipelines, covering ingestion, decoding, chunking, storage, and playback, applicable to both batch and streaming modes.
Codec Expertise: Guide the development of decoding strategies, container format management, and codec implementations, ensuring optimal performance for various video types.

