companyScale AI logo

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Scale AISan Francisco, CA; New York, NY
On-site Full-time $218.4K/yr - $273K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Responsibilities:Train state-of-the-art models, developed both internally and from the broader community, for deployment to our enterprise clients. Conduct research on cutting-edge algorithms to seamlessly integrate into our training framework. Design solutions that enable complex multi-agent systems to learn directly from both process and outcome-based rewards. Preferred Qualifications:5+ years of experience in training large language models (LLMs) in a production environment. Familiarity with post-training methods such as Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning with Value Regularization (RLVR), along with algorithms like Proximal Policy Optimization (PPO) and Generalized Rate of Policy Optimization (GRPO). Recent publications in top-tier conferences such as NEURIPS, ICLR, or ICML within the last two years. A PhD or Master’s degree in Computer Science or a related field.

About the job

Artificial Intelligence is increasingly becoming a pivotal element across all sectors of society. At Scale AI, we are committed to accelerating the evolution of AI applications. For nearly a decade, we have been the premier AI data foundry, propelling groundbreaking advancements in areas such as generative AI, defense applications, and autonomous vehicles. Following our recent investment from Meta, we are intensifying our efforts to develop advanced post-training algorithms that are essential for sophisticated agents in enterprises worldwide.

The Enterprise ML Research Lab is at the forefront of this AI revolution, leveraging a suite of proprietary research, tools, and resources to support our enterprise clients. As a Staff Machine Learning Research Engineer focusing on Agent Post-training, you will be instrumental in creating our next-generation Agent Reinforcement Learning training platform. Your work will enable the training of top-tier Agents that deliver state-of-the-art results in real-world enterprise applications.

You will incorporate cutting-edge research into our training framework, empowering ML Research Engineers on the Enterprise AI team to deploy use cases ranging from next-generation AI cybersecurity firewalls to training foundational healthtech search models. If you are passionate about shaping the future of the GenAI movement, we welcome your application!

About Scale AI

Scale AI is a leading innovator in the AI data industry, committed to enhancing the development of AI applications. Our cutting-edge solutions are transforming sectors from cybersecurity to healthcare, backed by significant investments and a strong commitment to research and development.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.