companyOpenAI logo

Software Engineer, Distributed Data Systems (Sora)

OpenAISan Francisco
Hybrid Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Candidates should have extensive experience with distributed systems and large-scale infrastructure, strong attention to detail, excellent software engineering fundamentals, and the ability to thrive in dynamic environments.

About the job

About Our Team

Join the innovative Sora team at OpenAI, where we are at the forefront of developing multimodal capabilities for our foundation models. As a dynamic hybrid of research and product development, we focus on seamlessly integrating advanced multimodal functionalities into our AI offerings, ensuring they are not only reliable and user-friendly but also aligned with our mission to foster broad societal benefits.

About the Position

We are seeking a dedicated Software Engineer specializing in Distributed Data Systems to architect and enhance the infrastructure that supports large-scale multimodal training and evaluation at OpenAI. In this role, you will oversee distributed data pipelines and collaborate closely with our researchers to translate their requirements into robust, high-performance systems. You will play a crucial role in fortifying the pipelines that underpin Sora’s rapid innovation cycles.

We are looking for engineers with a keen eye for detail, substantial experience with distributed systems, and a proven track record of building reliable infrastructures in high-stakes environments.

This position is based in San Francisco, CA, and follows a hybrid work model requiring three days in the office each week. We also provide relocation assistance to new team members.

Key Responsibilities:

  • Design, build, and maintain data infrastructure systems including distributed computing, data orchestration, distributed storage, streaming infrastructure, and machine learning infrastructure, ensuring they are scalable, reliable, and secure.

  • Ensure our data platform can scale dramatically while maintaining high levels of reliability and efficiency.

  • Collaborate with researchers to deeply understand their needs and translate them into production-ready systems.

  • Harden, optimize, and maintain vital data infrastructure systems that drive multimodal training and evaluation.

Ideal Candidates Will Have:

  • Extensive experience with distributed systems and large-scale infrastructure, coupled with a strong passion for data.

  • A detail-oriented mindset and a commitment to building and maintaining dependable systems.

  • Solid software engineering fundamentals and exceptional organizational skills.

  • Comfort with ambiguity and rapid changes in a fast-paced environment.

About OpenAI

OpenAI is a pioneering AI research and deployment organization dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We strive to advance digital intelligence in a way that is safe and beneficial, pushing the boundaries of innovation and technology.

About OpenAI

OpenAI is at the cutting edge of AI research and deployment, committed to ensuring that artificial intelligence serves the broader interests of humanity. We prioritize safety, innovation, and societal impact in our work.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.