About the job
About Our Team:
At OpenAI, we are dedicated to ensuring that artificial general intelligence (AGI) serves the greater good of humanity. Our API has emerged as the most widely embraced AI platform in the industry, catering to a diverse clientele ranging from startups and independent developers to Fortune 500 companies. By leveraging our multimodal APIs—which encompass real-time interactions, text-to-speech (TTS), speech generation, and image creation—we empower users to effectively harness the full spectrum of AI capabilities at scale.
About the Role:
We are on the lookout for an Engineering Manager to spearhead our multimodal API product suite. In this pivotal role, you will guide a talented team focused on delivering cutting-edge APIs for real-time processing, speech transcription, speech generation, and image creation. You will be instrumental in shaping the product roadmap and developing the tools that enable developers to connect with millions of end users through AI-driven audio, video, and imagery.
In this role, you will:
Lead, mentor, and cultivate a high-performing engineering team dedicated to multimodal API products, including our real-time API, Whisper transcription models, TTS speech generation models, and DALLE image generation APIs.
Collaborate with product managers, designers, and various stakeholders to articulate the strategic vision and product roadmap.
Work alongside our research teams to enhance our core multimodal models tailored for API customer use cases.
Steer technical and architectural decisions with a focus on scalability, robustness, and user experience.
Promote a culture of innovation, continuous improvement, and accountability within your team.
Qualifications:
Demonstrated experience in managing engineering teams that successfully deliver complex, high-quality products at scale.
Strong technical expertise with proficiency in modern software engineering practices and system architecture.
Exceptional collaboration and communication skills to effectively engage with diverse teams and stakeholders.
Familiarity with or a strong passion for multimodal AI, encompassing speech technologies, real-time systems, and image generation.
Adept at thriving in a fast-paced, dynamic startup environment.

