companyDatabricks logo

Senior Software Engineer, Model Serving

DatabricksSan Francisco, California
On-site Full-time $166K/yr - $225K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Key Responsibilities:Design and implement fundamental systems and APIs that drive Databricks Model Serving, ensuring scalability, reliability, and operational excellence. Guide architectural choices and trade-offs to enhance performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads. Directly contribute to critical components across serving infrastructure—from model container builds and deployment workflows to runtime systems such as routing, caching, observability, and intelligent autoscaling—ensuring seamless and efficient operations at scale. Work collaboratively with product, platform, and research teams to translate client needs into reliable and high-performing systems. Lead technical initiatives focused on enhancing latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers. Establish best practices for code quality, testing, and operational readiness, mentoring fellow engineers through design reviews and technical guidance.

About the job

At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging issues of our time—from realizing the future of transportation to speeding up medical innovations. We achieve this by developing and maintaining the premier data and AI infrastructure platform, allowing our clients to leverage profound data insights to enhance their operations.

 

Our Model Serving product equips organizations with a cohesive, scalable, and governed platform for deploying and overseeing AI/ML models, spanning traditional ML to specialized large language models. It provides real-time, low-latency inference, governance, monitoring, and lineage capabilities. With the rapid rise of AI adoption, Model Serving stands as a fundamental component of the Databricks platform, enabling clients to operationalize models efficiently and cost-effectively at scale.

 

As a Senior Engineer, your role will be pivotal in transforming both the product experience and the underlying infrastructure of Model Serving. You will design and create systems enabling high-throughput, low-latency inference across CPU and GPU workloads, influence architectural strategies, and work closely with platform, product, infrastructure, and research teams to deliver an exceptional serving platform.

About Databricks

Databricks is committed to revolutionizing how organizations utilize data and AI to solve complex challenges. Our innovative platform and dedicated teams are at the forefront of the data and AI landscape, helping enterprises unlock value and accelerate breakthroughs across various industries.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.