companyPostman logo

AI Reliability & Monitoring Engineering Lead at Postman | San Francisco

PostmanSan Francisco, California, United States
On-site Full-time $256K/yr - $276K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Proven experience in AI systems reliability engineering or a related field. Strong understanding of reliability metrics, observability tools, and incident response protocols. Experience with automation frameworks and real-time monitoring systems. Excellent problem-solving skills and the ability to work under pressure.

About the job

Who Are We?

Postman stands at the forefront of the API revolution, serving over 45 million developers and 500,000 organizations, including 98% of the Fortune 500. We empower developers and professionals worldwide to construct an API-first ecosystem by simplifying every aspect of the API lifecycle while enhancing collaboration and innovation.

Headquartered in San Francisco, our offices span across Boston, New York, Austin, Tokyo, London, and Bangalore—our roots. As a privately held enterprise, we have attracted investments from leading firms including Battery Ventures, BOND, Coatue, CRV, Insight Partners, and Nexus Venture Partners. To dive deeper into our vision, explore The 'API-First World' graphic novel.

The Opportunity

We are on the lookout for a skilled AI Systems Reliability Engineer who will play a pivotal role in defining, building, and maintaining the infrastructure and processes that guarantee the reliability, scalability, and performance of our AI-enhanced API and agentic systems in production. This position emphasizes monitoring, availability, incident response, and automation to support AI services and tools relied on by millions globally.

What You’ll Do

  • Develop and manage reliability metrics (SLOs) for AI-driven API services and features of the agentic AI platform.

  • Implement comprehensive observability and monitoring systems for real-time performance and fault detection.

About Postman

At Postman, we are dedicated to leading the transformation of the API landscape. Our cutting-edge platform is designed to simplify the development process, enabling teams to create and manage APIs efficiently. Join us in our mission to foster collaboration and innovation within the API community.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.