companyScale AI logo

Machine Learning Systems Research Engineer - Post-training Algorithms

Scale AISan Francisco, CA; New York, NY
On-site Full-time $218.4K/yr - $273K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

1-3 years of experience in training LLMs in a production setting. Strong passion for system optimization. Familiarity with post-training methodologies such as RLHF, RLVR, and algorithms like PPO and GRPO. Proven ability to operate the architecture of modern GPU clusters. Experience in multi-node LLM training and inference. Excellent software engineering skills, proficient in tools and frameworks like CUDA, PyTorch, transformers, and flash attention. Exceptional written and verbal communication skills for effective collaboration in a cross-functional team environment. Master's degree or PhD in Computer Science or a related field is preferred.

About the job

As AI continues to play a crucial role across various sectors, Scale AI is committed to accelerating the evolution of AI applications. For nearly a decade, we have been at the forefront of AI data solutions, driving significant innovations such as generative AI, defense technologies, and autonomous vehicles. With recent funding from Meta, we are intensifying our efforts to develop cutting-edge post-training algorithms essential for enhancing the performance of complex enterprise agents globally.

The Enterprise ML Research Lab is at the forefront of this AI transformation. Our team is dedicated to creating a suite of proprietary research and resources tailored for our enterprise clientele. As a Machine Learning Systems Research Engineer, you will play a pivotal role in developing algorithms for our next-generation Agent Reinforcement Learning (RL) training platform, support large-scale training operations, and integrate state-of-the-art technologies to optimize our machine learning systems. You will collaborate with other ML Research Engineers and AI Architects on the Enterprise AI team to apply these training algorithms to various client use cases, from next-gen AI cybersecurity firewalls to foundational healthtech search models. If you are passionate about shaping the future of AI, we want to hear from you!

About Scale AI

At Scale AI, we are dedicated to advancing the capabilities of artificial intelligence across various fields. Our innovative approach has established us as a leader in AI data solutions, providing essential support for groundbreaking technologies that are shaping the future.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.