About the job
Join Our Team as a Software Engineer - AI Infrastructure
Location: North America Remote / San Francisco · Full-Time
At Andromeda Cluster, we are dedicated to democratizing access to advanced AI infrastructure that was once only available to hyperscalers. Founded by industry leaders Nat Friedman and Daniel Gross, we have evolved from a singular managed cluster to a global platform that connects top AI labs, data centers, and cloud providers around the world. Our orchestration layer efficiently manages training and inference tasks globally, enhancing flexibility and efficiency in this rapidly expanding sector. We aim to create a global marketplace for AI computing, empowering AGI with the same fluidity as global financial markets.
As we continue to grow, we are on the lookout for talented individuals in the fields of AI infrastructure, research, and engineering.
Your Role
In the position of Infrastructure Product Engineer, you will be integral in constructing the foundational framework of Andromeda’s platform. Your challenge will be to simplify complex, real-world infrastructure issues into scalable product solutions that our customers will benefit from.
Key Responsibilities
- Architect and develop essential platform components, focusing on infrastructure orchestration, provisioning, and lifecycle management solutions.
- Create robust APIs, services, and control planes that abstract diverse infrastructure types, including VMs, Kubernetes, bare metal, and schedulers.
- Convert customer usage patterns into actionable product requirements, delivering impactful features and enhancements.
- Design automation and internal tools to mitigate manual and ad-hoc operational tasks.
- Improve platform reliability, performance, and observability, focusing on sustainable enhancements rather than quick fixes.
- Collaborate with other teams to establish clear ownership boundaries between platform features and customer-specific solutions.
- Write clean, maintainable, and well-documented code with a focus on long-term sustainability.
- Engage in technical design discussions and contribute to the architectural advancements of our platform.

