companyOpenAI logo

Machine Learning Engineer, Distributed Data Systems

OpenAISan Francisco
Hybrid Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Strong experience with distributed systems, large-scale infrastructure, and a keen interest in data. A detail-oriented approach with rigor in system reliability. Excellent software engineering fundamentals and organizational skills. Comfort with ambiguity and adaptability to rapid changes.

About the job

About Our Team

Join the innovative Sora team at OpenAI, where we are at the forefront of developing multimodal capabilities for our foundation models. Our hybrid research and product team is dedicated to seamlessly integrating multimodal functionalities into our AI solutions, ensuring they are dependable, user-centric, and aligned with our vision of benefiting society at large.

Role Overview

As a Machine Learning Engineer specializing in Distributed Data Systems, you will be instrumental in designing and scaling the infrastructure that facilitates large-scale multimodal training and evaluation at OpenAI. Your role will involve managing complex distributed data pipelines, collaborating closely with researchers to convert their requirements into robust, production-ready systems, and enhancing pipelines that are essential for Sora's rapid iteration cycles.

We are seeking detail-oriented engineers with extensive experience in distributed systems who thrive in high-stakes environments and excel in building resilient infrastructure.

This position is located in San Francisco, CA, and follows a hybrid work model, requiring three days in the office each week. We also provide relocation assistance for new team members.

Key Responsibilities:

  • Design, implement, and maintain data infrastructure systems, including distributed computing, data orchestration, distributed storage, streaming infrastructure, and machine learning systems, with a focus on scalability, reliability, and security.

  • Ensure our data platform can scale exponentially while maintaining high reliability and efficiency.

  • Collaborate with researchers to gain a deep understanding of their requirements, translating them into production-ready systems.

  • Strengthen, optimize, and manage critical data infrastructure systems that support multimodal training and evaluation.

You Will Excel in This Role If You:

  • Possess strong experience with distributed systems and large-scale infrastructure, coupled with a keen interest in data.

  • Exhibit meticulous attention to detail and a commitment to building and maintaining reliable systems.

  • Demonstrate solid software engineering fundamentals and effective organizational skills.

  • Thrive in environments characterized by ambiguity and rapid change.

About OpenAI

OpenAI is a trailblazing AI research and deployment organization committed to ensuring that general-purpose artificial intelligence serves humanity. We continuously push the boundaries of AI capabilities and strive to create technology that benefits everyone.

About OpenAI

OpenAI is a pioneering AI research and deployment organization dedicated to ensuring that general-purpose artificial intelligence is beneficial to all of humanity. We are committed to pushing the limits of AI capabilities and creating technologies that empower society.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.