About the job
Our Mission
As we continue to establish ourselves as the Essential Cloud for AI, CoreWeave is on the lookout for a visionary Vice President of Research Training Infrastructure. This executive leader will be responsible for defining the product strategy and overseeing engineering execution for the services that empower the world’s most ambitious AI research labs. Your role will be to connect the technological foundation with the research community, creating a seamless, high-performance environment where groundbreaking models are developed.
The Role: Architect of the AI Factory
You will oversee the product strategy for our Research Training Stack, emphasizing specialized orchestration, evaluation, and iteration tools necessary for large-scale pre-training and post-training. This is a pivotal role at the convergence of high-performance computing (HPC) and cloud-native agility.
Core Responsibilities
- Frontier Orchestration: Lead the advancement of SUNK (Slurm on Kubernetes) to grant researchers deterministic, bare-metal performance through a cloud-native interface.
- Holistic Training Services: Beyond Slurm, spearhead the evolution of next-generation orchestrators and automated training evaluation frameworks to ensure model quality throughout the lifecycle.
- Post-Training Excellence: Establish the infrastructure necessary for advanced Reinforcement Learning (RL) and RLHF.

