companyThinking Machines Lab logo

Pre-Training Research Scientist

On-site Full-time $350K/yr - $475K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Minimum qualifications:PhD in Computer Science, Machine Learning, or a related field. Demonstrated experience in machine learning research, with a focus on model training and optimization. Proficiency in programming languages such as Python and experience with AI frameworks. Strong analytical skills and the ability to work collaboratively in a research-oriented environment.

About the job

At Thinking Machines Lab, we are dedicated to empowering humanity through the advancement of collaborative general intelligence. Our vision is to create a future where everyone can harness the power of AI to meet their individual needs and aspirations.

Our team is composed of passionate scientists, engineers, and innovators who have developed some of the most influential AI technologies, such as ChatGPT and Character.ai, as well as cutting-edge open-weight models like Mistral and acclaimed open-source projects including PyTorch, OpenAI Gym, Fairseq, and Segment Anything.

About the Role

The role of Pre-Training Researcher is pivotal to our strategic roadmap, focused on enhancing our understanding of how large models learn from data. You will investigate novel pre-training methodologies, architectures, and learning objectives aimed at making model training more efficient, robust, and aligned with human values.

This position combines fundamental research with practical engineering, as we seamlessly integrate both disciplines within our team. You will be expected to produce high-performance code and engage with technical literature. This is an ideal opportunity for individuals who thrive on theoretical exploration as well as hands-on experimentation, and who aspire to influence the foundational methods by which AI learns.

This is an evergreen role, meaning we keep this position open to welcome expressions of interest in this research field. We receive numerous applications, and while there may not always be an immediate fit, we encourage you to apply. We consistently review applications and will reach out as new opportunities arise. If you gain additional experience, you are welcome to reapply, but please limit your applications to once every six months. We may also post specific openings for project or team needs, where direct applications are welcome in addition to this evergreen role.

What You’ll Do

  • Research and innovate new methodologies for pre-training.
  • Engage in areas such as scaling, architecture, algorithms, or optimization of large-scale training runs based on your research interests and expertise.
  • Design data curricula and sampling strategies that enhance learning dynamics and model generalization.
  • Collaborate with infrastructure and data teams to conduct large-scale experiments in an efficient and reproducible manner.
  • Publish and present research that propels the entire community forward, sharing code, datasets, and insights to accelerate progress across both industry and academia.

About Thinking Machines Lab

Thinking Machines Lab is at the forefront of AI innovation, committed to democratizing access to advanced technologies and fostering human-centered applications of AI. As part of our mission, we focus on building intelligent systems that empower individuals and organizations alike.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.