companyThinking Machines Lab logo

Research Infrastructure Engineer at Thinking Machines | San Francisco

On-site Full-time $350K/yr - $475K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Skills and QualificationsMinimum qualifications:Bachelor's degree or equivalent experience in computer science, engineering, machine learning, or a related field. Strong software engineering fundamentals with a proven track record of building reliable, maintainable systems. Proficiency in at least one backend programming language (we frequently utilize Python or Rust). Ability to operate effectively across the tech stack and manage projects end-to-end. Experience in highly collaborative environments, working with diverse cross-functional teams and subject matter experts. Preferred qualifications: We encourage you to apply even if you meet some but not all of these qualifications.

About the job

At Thinking Machines Lab, we are on a mission to empower humanity by advancing collaborative general intelligence. Our vision is to create a future where everyone has access to the knowledge and tools necessary to harness AI for their unique needs and objectives.

We are a diverse team of scientists, engineers, and builders responsible for developing some of the most influential AI products on the market, such as ChatGPT and Character.ai. Our contributions extend to open-weight models like Mistral and popular open-source projects including PyTorch, OpenAI Gym, Fairseq, and Segment Anything.

About the Role

We are seeking talented engineers to join our team and develop the libraries and tools that will accelerate research efforts at Thinking Machines. You will take charge of our internal infrastructure—creating evaluation libraries, reinforcement learning training libraries, and experiment tracking platforms—while building systems that enhance research velocity over time.

This position emphasizes collaboration. You will work closely with researchers to identify bottlenecks and pain points, ensuring that they trust your systems to function seamlessly and find them enjoyable to use.

What You'll Do

  • Design, build, and manage research infrastructure, including evaluation frameworks, RL training systems, experiment tracking platforms, visualization tools, and shared utilities.
  • Develop high-throughput, scalable pipelines for distributed evaluation, reward modeling, and multimodal assessment.
  • Establish systems for reproducibility, traceability, and robust quality control across research experiments and model training runs, implementing effective monitoring and observability.
  • Collaborate directly with researchers to identify bottlenecks and unlock new capabilities, managing research tools like a product manager by proactively seeking feedback and tracking adoption.
  • Work alongside infrastructure, data, and product teams to integrate tools across the technical stack.

About Thinking Machines Lab

Thinking Machines is at the forefront of AI innovation, dedicated to enhancing human capabilities through advanced collaborative general intelligence. Our team is passionate about creating cutting-edge technologies that democratize access to AI tools and knowledge.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.