companyGeneralist logo

Software Engineer: Machine Learning Infrastructure

GeneralistSan Francisco Bay Area (San Mateo) or Boston (Somerville)
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Candidates should possess a robust understanding of machine learning frameworks, experience with high-performance computing environments, and a passion for robotics and AI technologies. Strong problem-solving skills and the ability to work collaboratively in a fast-paced environment are essential.

About the job

About the Role

At Generalist, we are at the forefront of training expansive robot foundation models, leveraging cutting-edge GPU hardware, primarily from Nvidia, to execute distributed training tasks and experimental research. Our operations demand exceptional storage solutions and optimized data loading processes, necessitating the full utilization of cloud infrastructure alongside custom-built solutions.

In this role, you will take charge of our inference infrastructure. Our robotic systems rely on a dedicated fleet of on-premises GPUs designed for demanding real-time computations and latency-sensitive applications within resource-constrained environments.

Your Responsibilities:

  • Manage and optimize our GPU compute fleets.

  • Facilitate user-friendly access to GPUs for researchers, ensuring optimal utilization.

  • Enhance ML data loading, transport, and storage systems in extensively utilized distributed environments.

  • Oversee the orchestration of our robot inference fleets.

You May Excel in This Position If You:

  • Have experience managing large GPU fleets for large-scale, distributed training or inference.

  • Possess significant expertise in using Slurm or Kubernetes for ML workload orchestration.

  • Have developed high-scale ML data loaders and preparation systems.

  • Understand the intricacies of ML hardware, storage, and networking systems.

  • Are familiar with the Nvidia GPU ecosystem.

About Generalist

Generalist is dedicated to transforming the future with general-purpose robotics. We envision a world where humans and machines collaborate seamlessly to enhance productivity and innovation. Our focus on developing embodied foundation models, particularly in dexterity, drives us to push the boundaries of data, modeling, and hardware, enabling robots to navigate and interact intelligently with their environments. Our team comprises experts from leading organizations such as OpenAI, Boston Dynamics, and Google DeepMind, all united by a commitment to advancing AI technologies. We have a proven track record of delivering significant AI innovations and are excited to continue this journey at Generalist.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.