About the job
Join Our Innovative Team
Adaptive ML is an emerging leader in AI technology, dedicated to creating a cutting-edge Reinforcement Learning Operations (RLOps) platform. Our mission is to empower enterprises to customize and implement large language models (LLMs) that deliver tangible results.
We offer the foundational infrastructure necessary for tuning, assessing, and deploying specialized models at scale, driving advancements in task-specific LLM development while managing production workflows that handle millions of requests efficiently and cost-effectively across distributed systems.
Our close-knit team comprises experts who previously contributed to the development of state-of-the-art open-access large language models. With a successful $20M seed funding round led by Index Ventures and ICONIQ in early 2024, we are already operational, serving clients like Manulife, AT&T, and Deloitte in various sectors, including travel and finance.
Our Technical Staff is at the heart of Adaptive ML, crafting the essential technology that supports our objectives while closely collaborating with our Commercial and Product teams. We are committed to developing resilient and efficient technologies and conducting impactful research that aligns with our strategic goals and enhances customer value.
Internship Overview
This internship position is open within our Technical Staff. If you find any of the following aspects appealing, we encourage you to apply.
As a Technical Intern, you will play a vital role in developing components of the core technology that drives Adaptive ML, specifically within our internal LLM stack, Adaptive Harmony. We believe that generative AI thrives on the fusion of robust engineering and thoughtful experimentation, and our interns gain exposure to both realms.
In this role, you will collaborate with seasoned engineers and researchers, receive mentorship, and contribute to meaningful projects that support production systems and ongoing research initiatives. This position is designed for proactive students or early-career engineers eager to gain hands-on experience in applied machine learning systems.
Please note: This is a 6-month in-person internship based in our Paris office or NYC office.
Typical Responsibilities of the Technical Team include:
- Design and implement robust software in Rust, seamlessly connecting user-friendly Python scripts with high-performance distributed training algorithms operating on hundreds of GPUs.
- Analyze and enhance GPU inference kernels using Triton or CUDA, pinpointing memory constraints and optimizing latency while determining effective benchmarking methodologies for inference services.
- Collaborate on additional technical tasks that contribute to the overall success of our projects.
