About the job
Join Arena Intelligence as a Machine Learning Scientist
At Arena Intelligence, we are revolutionizing how AI models are evaluated in real-world scenarios. Founded by innovative researchers from UC Berkeley’s SkyLab, our mission is to push the boundaries of AI evaluation and ensure its practical application.
With millions of users engaging with our platform each month, we prioritize community feedback to develop transparent, rigorous, and human-centered model evaluations. Our leaderboards serve as the benchmark for AI performance, gaining the trust of leading enterprises and AI labs to understand the reliability, alignment, and impact of AI systems.
Our diverse team comprises experts from esteemed institutions such as UC Berkeley, Google, Stanford, DeepMind, and Discord. We foster a culture that values truth, agility, craftsmanship, curiosity, and impact over hierarchy. We are committed to creating an environment where talented individuals from all backgrounds can excel in their work.
Role Overview
We are seeking a passionate Machine Learning Scientist to spearhead our open-source research initiatives, including the development of open datasets and code releases. You will be instrumental in advancing how AI models are evaluated and understood globally.
In this position, you will operationalize our dedication to openness by curating impactful datasets, developing innovative methodologies, and establishing reproducible benchmarks. Your contributions will enhance our public leaderboards, empower community tools, and promote transparency in AI evaluation on a global scale.
This interdisciplinary role involves collaboration with engineers, product teams, marketing, and the broader research community to refine model comparisons, analyze preference data, and explore dimensions like style, reasoning, and robustness. You will also work closely with our go-to-market teams to advocate for our open research initiatives, strengthen research partnerships, and encourage community engagement.
If you are excited by complex challenges, rigorous evaluation processes, and scientific outreach, we invite you to apply!

