companyHyperbolic Labs logo

Senior AI Infrastructure Engineer

Hyperbolic LabsSan Francisco, CA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Your QualificationsIn-depth knowledge of bare-metal provisioning and lifecycle management, including IPMI/Redfish, BMC-based remote management, PXE boot, and automated OS deployment workflows. Expertise in GPU scheduling and orchestration, with a focus on GPU type awareness, memory management, topology considerations, placement strategies for multi-GPU jobs, and minimization of fragmentation. Strong skills in infrastructure and DevOps engineering, with proficiency in Terraform or Pulumi, CI/CD for infrastructure, secrets management, configuration management, and observability stack implementation. Experience with storage and data infrastructure for AI/ML workloads, including object storage, high-IOPS block storage, and distributed file systems for training data and checkpoints. Proficient in API design and cloud-init for automated provisioning and configuration. Solid understanding of GPU architecture, CUDA, and GPU compute optimization. Highly collaborative team player with excellent communication skills.

About the job

Join Our Mission

At Hyperbolic Labs, we are dedicated to democratizing artificial intelligence by eliminating barriers to computing power through our Open-Access AI Cloud. We aggregate global computing resources to provide an innovative GPU marketplace and AI inference service, making AI affordable and accessible for everyone. As pioneers at the crossroads of AI and open-source technology, we envision a future where AI innovation is driven by imagination, not resource limitations. We invite forward-thinking individuals who share our vision of making AI universally accessible, secure, and cost-effective to join us in crafting a platform that empowers innovators to realize their groundbreaking AI projects.

As we gear up for expansion following our Series A funding, our team, led by co-founders with PhDs in AI, Mathematics, and Computer Science, is set to transform the landscape of computing.

The Role

We are on the lookout for a Senior Infrastructure Engineer to drive the development and scaling of Hyperbolic's GPU Cloud Marketplace. In this pivotal role, you will create a multi-tenancy provisioning and virtualization solution that transforms raw GPUs from diverse global suppliers into a programmable, orchestrated resource pool serving thousands of AI developers and researchers. You will work at the forefront of cloud infrastructure, building the core orchestration layer that allows our platform to deliver cost savings of up to 75% compared to traditional cloud providers.

About Hyperbolic Labs

Hyperbolic Labs is committed to transforming the AI landscape by providing an Open-Access AI Cloud that offers an innovative GPU marketplace. Our mission is to make AI accessible to everyone by harnessing global computing resources to deliver unprecedented affordability and accessibility. We are driven by the belief that innovation in AI should be limited only by imagination, and we strive to create a platform that empowers developers and researchers to realize their visions.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.