About the job
Our Mission
At Reflection AI, our goal is to create open superintelligence and ensure its accessibility for everyone.
We are pioneering open weight models for various users, including individuals, enterprises, and even nation-states. Our talented team comprises AI researchers and industry veterans from leading organizations such as DeepMind, OpenAI, Google Brain, Meta, Character. AI, and Anthropic.
Role Overview
Develop systems that convert robust pre-trained models into aligned and versatile agents.
Lead research and engineering efforts to advance post-training practices, focusing on data curation and large-scale optimization.
Create data generation frameworks, reward models, reinforcement learning algorithms, and techniques for inference-time scaling.
Collaborate with both pre-training and post-training teams to achieve significant enhancements in model capabilities.
Help refine our understanding of how large models learn to reason, follow instructions, and evolve through reinforcement learning.
Your Profile
Solid grasp of machine learning principles with hands-on experience in large-scale LLM training.
Proficient engineering skills, with the ability to navigate intricate ML codebases and distributed systems.
Experience in enhancing model performance through data, reward modeling, or reinforcement learning techniques.
Track record of leading ambitious research or engineering projects resulting in measurable improvements.
Thrives in a dynamic, high-agency startup atmosphere; oriented towards action and clarity in execution.
Ability to work seamlessly across research and infrastructure boundaries.
Excellent communication skills and a collaborative mindset.
Driven by a passion for pushing the boundaries of intelligence.
What We Provide:
At Reflection AI, we believe that to truly build open superintelligence, it must be rooted in a strong foundation. By joining us, you will contribute to building from the ground up within a compact, highly skilled team. Together, we will shape the future of our company and the landscape of open foundational models.
We aim for you to accomplish the most impactful work of your career, with the assurance that you and your loved ones are well-supported.

