companyAfterQuery logo

Reinforcement Learning Environment Engineer

AfterQuerySan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

1-4 years of experience in relevant fieldsExperience or internships with RL environment organizations or AI safety/benchmarking entities such as METR or Artificial Analysis is a major plusStrong analytical skills and a passion for improving AI modelsProficiency in data management and evaluation techniquesAbility to work collaboratively in a fast-paced environment

About the job

About AfterQuery

AfterQuery develops training data and evaluation frameworks that leading AI labs use to improve their models. The team partners with major research institutions to build datasets and run assessments that go beyond standard benchmarks. As a post-Series A company based in San Francisco, AfterQuery values contributions from every team member. Work here directly shapes the next generation of AI models.

Role Overview

The Reinforcement Learning Environment Engineer designs datasets and evaluation systems that influence how advanced AI models learn and improve. This role involves close collaboration with research teams, hands-on experimentation with new data collection methods, and the creation of metrics to track model progress. Work moves from theoretical analysis to practical experiments, feeding directly into large-scale model training efforts.

What You Will Do

  • Develop data segments that expose key failure modes in sectors such as finance, software engineering, and enterprise operations.
  • Refine reward signals for Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from Value Reinforcement (RLVR) systems.
  • Define quantitative metrics for dataset quality, diversity, and their effects on model alignment and capability.
  • Work closely with research teams to translate training objectives into concrete data requirements and evaluation criteria.

This position is based in San Francisco.

About AfterQuery

At AfterQuery, we specialize in developing the training data and evaluation systems that empower the world’s leading AI labs to enhance their models. Our mission is to push the boundaries of AI research through the creation of high-quality datasets and rigorous evaluation processes, enabling significant advancements in artificial intelligence technology.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.