companyBaseten logo

Senior Software Engineer - Model Training

BasetenSan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or a related field; Proven experience in designing and implementing distributed systems; Strong proficiency in Python and deep learning frameworks (e.g., TensorFlow, PyTorch); Familiarity with GPU programming and optimization techniques; Experience with CI/CD pipelines and version control systems; Ability to work collaboratively in a fast-paced environment; Excellent problem-solving skills and a passion for AI technology.

About the job

ABOUT BASETEN

At Baseten, we are at the forefront of enabling transformative AI solutions for some of the world's leading companies, including Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our innovative platform combines cutting-edge AI research, adaptable infrastructure, and developer-friendly tools to facilitate the production of advanced models. Recently, we celebrated our rapid growth with a successful $300M Series E funding round from notable investors like BOND, IVP, Spark Capital, Greylock, and Conviction. We invite you to join our dynamic team and contribute to the evolution of AI product deployment.

THE ROLE

As a Senior Software Engineer specializing in Model Training at Baseten, you will play a pivotal role in constructing the infrastructure essential for the large-scale training and fine-tuning of foundational AI models. Your responsibilities will include designing and implementing distributed training systems, optimizing GPU utilization, and establishing scalable pipelines that empower Baseten and our clientele to adapt models with efficiency and reliability. This role demands a high level of technical expertise and hands-on involvement: you will be responsible for critical components of our training stack, collaborate with product and infrastructure teams to identify customer needs, and drive advancements in scalable training infrastructure.

EXAMPLE WORK:

  1. Training open-source models that surpass GPT-5 capabilities for a leading digital insurer

  2. Exploring specialized, continuously learning models as the future of AI

  3. Overview of our training documentation

  4. Research initiatives we've undertaken

RESPONSIBILITIES

  • Design, construct, and sustain distributed training infrastructures for large foundation models

  • Develop scalable pipelines for fine-tuning and training across diverse GPU/accelerator clusters

  • Enhance training performance through optimization of algorithms and infrastructure

  • Collaborate closely with cross-functional teams to align technical solutions with business objectives

  • Stay abreast of advancements in the field of machine learning and AI to continually improve our training processes

About Baseten

Baseten is a pioneering company dedicated to revolutionizing the AI landscape by providing essential tools and support for companies at the cutting edge of artificial intelligence. With a strong commitment to innovation and excellence, we empower our partners to deploy state-of-the-art AI models that drive their businesses forward. Join us, and be part of a vibrant community that thrives on creativity and technical challenge.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.