companyAnthropic logo

Staff + Senior Software Engineer, Cloud Inference

AnthropicSan Francisco, CA | New York City, NY | Seattle, WA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Qualifications

Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field. Experience in designing and developing cloud-based infrastructure across multiple CSPs. Strong knowledge of networking, APIs, and operational paradigms specific to cloud computing. Proficiency in CI/CD automation systems and experience with validation and deployment pipelines. Ability to understand and manage cost-effective inference strategies across various platforms. Excellent problem-solving skills and the ability to work collaboratively with cross-functional teams. Familiarity with capacity planning and autoscaling methodologies.

About the job

About Anthropic

At Anthropic, our mission is to develop AI systems that are safe, interpretable, and controllable. We believe in harnessing AI for the greater good of our users and society at large. Our dynamic team comprises dedicated researchers, engineers, policy experts, and business leaders who collaborate to create beneficial AI systems.

About the Role

The Cloud Inference team is responsible for scaling and optimizing Claude to cater to a vast array of developers and enterprise clients across platforms such as AWS, GCP, Azure, and future cloud service providers (CSPs). We manage the complete lifecycle of Claude on each cloud platform—from API integration and intelligent request routing to inference execution, capacity management, and daily operations.

Our engineers wield significant influence, driving multiple key revenue streams while optimizing one of Anthropic's most valuable resources—compute power. As we expand to additional cloud providers, the intricacies of efficiently managing inference across diverse platforms with varying hardware, networking frameworks, and operational models grow substantially. We seek engineers adept at navigating these variances, developing strong abstractions that are effective across providers, and making informed infrastructure choices that keep us cost-effective at scale.

Your contributions will enhance the operational scale of our services, expedite our capacity to launch cutting-edge models and innovative features to clients across all platforms, and ensure our large language models (LLMs) adhere to stringent safety, performance, and security standards.

About Anthropic

Anthropic is at the forefront of AI development, committed to creating technology that is not only powerful but also safe and beneficial. Our innovative team is dedicated to advancing the field of AI while ensuring that our systems are interpretable and controllable, fostering trust and reliability in AI applications.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.