About the job
OpenAI’s networking teams design and manage high-performance systems that support the company’s training and inference infrastructure. As a Software Engineer focused on productivity, this position centers on making those teams more effective by improving the developer experience and streamlining complex workflows.
Role overview
This role supports engineers working on intricate infrastructure, with a focus on build systems, testing architecture, release pipelines, and overall development efficiency. The work involves optimizing how engineers build, test, validate, and deploy changes in environments that span multiple servers and interact closely with hardware.
What you will do
- Improve development workflows for engineers building and operating networking systems at OpenAI.
- Design and refine pipelines for continuous deployment, release, and validation.
- Create and maintain test harnesses for multi-server, networked, and hardware-backed environments.
- Increase iteration speed across codebases, particularly in C++, Python, and environments centered on build systems.
- Work with engineers to identify and resolve pain points in CI, testing, debugging, and deployment processes.
- Lead testing and reliability strategies for infrastructure components that support large-scale training and inference workloads.
- Collaborate with both centralized developer experience teams and networking engineers who work directly with these systems.

