companyArena Intelligence logo

Machine Learning Scientist - Open Source Lead

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

PhD or Master's degree in Computer Science, Data Science, or related field. Strong experience in machine learning algorithms and evaluation methodologies. Proficient in programming languages such as Python, R, or similar. Experience with open-source projects and community engagement. Excellent communication skills, both verbal and written. Ability to work collaboratively in a fast-paced, interdisciplinary environment.

About the job

Join Arena Intelligence as a Machine Learning Scientist

At Arena Intelligence, we are revolutionizing how AI models are evaluated in real-world scenarios. Founded by innovative researchers from UC Berkeley’s SkyLab, our mission is to push the boundaries of AI evaluation and ensure its practical application.

With millions of users engaging with our platform each month, we prioritize community feedback to develop transparent, rigorous, and human-centered model evaluations. Our leaderboards serve as the benchmark for AI performance, gaining the trust of leading enterprises and AI labs to understand the reliability, alignment, and impact of AI systems.

Our diverse team comprises experts from esteemed institutions such as UC Berkeley, Google, Stanford, DeepMind, and Discord. We foster a culture that values truth, agility, craftsmanship, curiosity, and impact over hierarchy. We are committed to creating an environment where talented individuals from all backgrounds can excel in their work.

Role Overview

We are seeking a passionate Machine Learning Scientist to spearhead our open-source research initiatives, including the development of open datasets and code releases. You will be instrumental in advancing how AI models are evaluated and understood globally.

In this position, you will operationalize our dedication to openness by curating impactful datasets, developing innovative methodologies, and establishing reproducible benchmarks. Your contributions will enhance our public leaderboards, empower community tools, and promote transparency in AI evaluation on a global scale.

This interdisciplinary role involves collaboration with engineers, product teams, marketing, and the broader research community to refine model comparisons, analyze preference data, and explore dimensions like style, reasoning, and robustness. You will also work closely with our go-to-market teams to advocate for our open research initiatives, strengthen research partnerships, and encourage community engagement.

If you are excited by complex challenges, rigorous evaluation processes, and scientific outreach, we invite you to apply!

About Arena Intelligence

Arena Intelligence is at the forefront of AI evaluation, creating an open platform that allows users to assess the performance of AI models in real-world applications. Our innovative leaderboards and evaluation processes are trusted by the global AI community and continue to shape the conversation around AI reliability and progress.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.