Staff Software Engineer, Model Serving

DatabricksSan Francisco, California

On-site Full-time $192K/yr - $260K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

The impact you will have: Design and implement essential systems and APIs that drive Databricks Model Serving, ensuring scalability, reliability, and operational excellence. Collaborate with product and engineering leadership to outline the technical roadmap and long-term architecture for serving workloads. Guide architectural decisions to enhance performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads. Contribute directly to critical components across the serving infrastructure — from model container builds and deployment workflows to runtime systems such as routing, caching, observability, and intelligent autoscaling — ensuring seamless operations at scale. Work cross-functionally with product, platform, and research teams to convert customer needs into dependable and high-performing systems. Lead initiatives that enhance latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers. Establish best practices for code quality, testing, and operational readiness, while mentoring fellow engineers through design reviews and technical guidance. Represent the team in cross-organizational technical discussions, influencing the broader AI platform strategy at Databricks.

About the job

Our Model Serving product equips organizations with a cohesive, scalable, and governed solution for deploying and managing AI/ML models — ranging from traditional machine learning to intricate proprietary large language models. It ensures real-time, low-latency inference, governance, monitoring, and lineage. As the adoption of AI surges, Model Serving stands as a fundamental component of the Databricks platform, allowing customers to operationalize models at scale with robust SLAs and cost efficiency.

In the role of Staff Engineer, you will significantly influence both the product experience and the core infrastructure of Model Serving. Your responsibilities will include designing and constructing systems that facilitate high-throughput, low-latency inference across CPU and GPU workloads, steering architectural strategies, and collaborating extensively with platform, product, infrastructure, and research teams to create an exceptional serving platform.

About Databricks

Databricks is at the forefront of data and AI innovation, committed to equipping data teams with the tools needed to solve pressing global challenges. Our platform allows organizations to leverage comprehensive data insights to drive significant business improvements.

Similar jobs

1 - 20 of 5,991 Jobs

Search for Infrastructure Staff Software Engineer

5,991 results

Select all on this page (20)

Apply

Staff Software Engineer, Infrastructure

Check

Full-time|$180K/yr - $247.5K/yr|Remote|San Francisco or Remote

Join the Revolution at CheckAt Check, we are transforming the payroll landscape. Our mission goes beyond just building a successful business; we collaborate with our partners to innovate payroll solutions. As pioneers of embedded payroll, we are reshaping the payment process, enabling payroll businesses to launch, expand, and succeed with ease. Discover our journey | Listen in.Check is more than an API; we are the catalyst for developing and scaling payroll operations.Our TeamThe payroll system is in dire need of innovation. We invite you to join a passionate team dedicated to making an impactful change! At Check, you will leverage creative problem-solving and critical thinking to influence every business we partner with. We view challenges as opportunities for improvement, valuing the unique contributions of each team member in our collective mission.If you're ready to dive in and transform payroll, let's collaborate to simplify complexity and enhance the future for businesses of all sizes.Your RoleAt Check, engineering is our foundation. We believe that payroll should resemble modern financial software; achieving this requires a comprehensive understanding of systems and reliable infrastructure that our partners can trust. Every product we deliver relies on scalable and secure systems that ensure timely payments and payroll processing.We are seeking a Staff Software Engineer who possesses strong software design capabilities coupled with hands-on infrastructure experience. In this position, you will focus on the essential systems that drive payroll operations, enhancing our service scalability, production operations, and empowering engineers with the tools to deliver software confidently and securely.You will collaborate across product and platform areas to enhance our cloud infrastructure, fortify our deployment and monitoring strategies, and streamline the architecture that supports embedded payroll services. The challenges you will address often intersect infrastructure, product, and operational domains.This opportunity is perfect for someone who has managed complex systems end-to-end in a dynamic environment and takes pride in developing resilient, comprehensible infrastructure that is vital to our operations.

Mar 12, 2026

Apply

Staff Software Engineer - Node Infrastructure

Anthropic

Full-time|On-site|San Francisco, CA | New York City, NY | Seattle, WA

Anthropic is hiring a Staff Software Engineer to focus on Node Infrastructure. This position is based in San Francisco, New York City, or Seattle. Role overview This role centers on designing, building, and maintaining the core systems that support Anthropic’s services. The work directly affects the reliability and scalability of the company’s AI offerings. Collaboration Work closely with a skilled engineering team to develop infrastructure that supports high-quality AI solutions. The team values input and hands-on problem solving from every member. Impact Efforts in this role help ensure Anthropic’s services remain stable and can grow as demand increases. The systems you help create will play a key part in the company’s ability to deliver dependable AI products.

Apr 29, 2026

Apply

Senior Staff Software Engineer, Data Infrastructure

Watney Robotics

Full-time|$200K/yr - $275K/yr|On-site|San Francisco

About Watney RoboticsAt Watney Robotics, we are pioneers in developing autonomous robotic solutions aimed at enhancing critical infrastructure. Recently securing $21 million in seed funding from leading investors such as Conviction, Abstract, and A*, we are collaborating with the world’s largest hyperscalers to propel the expansion of data centers and streamline maintenance processes.This is an extraordinary opportunity to join our team at a pivotal stage as we transition from prototype to large-scale production. Be part of a team that not only ships cutting-edge systems but also plays a crucial role in shaping the operational framework of an innovative robotics company.

Oct 5, 2025

Apply

Staff Software Engineer, Core Infrastructure

Harvey

Full-time|$236K/yr - $290K/yr|On-site|San Francisco

Harvey builds a secure, enterprise-grade platform for legal and professional services, powered by advanced agentic AI. The company serves more than 1,000 clients in over 60 countries and is backed by top investors. Harvey’s team emphasizes speed, ownership, and high standards, working closely with customers to address real-world needs. This Staff Software Engineer role is based in San Francisco and requires in-person work. Relocation support is available for those moving to the area. Role overview The Core Infrastructure team at Harvey designs and maintains the systems that support every user interaction on the company’s global legal AI platform. These systems process billions of prompt tokens and millions of daily requests for leading law firms and professional service providers around the world. The position combines new infrastructure development with a focus on operational reliability. The work has a direct effect on the platform’s scalability, security, and resilience as Harvey grows into new regions and serves more customers. Key responsibilities Design and implement scalable, fault-tolerant infrastructure systems for Harvey’s AI platform across multiple cloud regions. Own and enhance multi-cloud infrastructure (Azure, GCP), with emphasis on Kubernetes orchestration, networking, and container management. Lead technical initiatives in observability, incident response, and performance tuning.

Apr 27, 2026

Apply

Staff Software Engineer - Machine Learning Infrastructure

Decagon

Full-time|Remote|San Francisco

Join Decagon as a Staff Software Engineer specializing in Machine Learning Infrastructure. In this role, you will play a crucial part in enhancing and optimizing our machine learning systems. You will collaborate with a talented team of engineers to build scalable and efficient infrastructure that supports our AI-driven initiatives.As a key contributor, you will leverage your expertise in software engineering and machine learning to solve complex challenges and drive innovation. Your work will impact various projects and help shape the future of our technology.

Feb 24, 2026

Apply

Staff Software Engineer, Inference Infrastructure

Cohere

Full-Time|On-site|San Francisco

Who are we?At Cohere, our mission is to elevate intelligence to benefit humanity. We specialize in training and deploying cutting-edge models for developers and enterprises focused on creating AI systems that deliver extraordinary experiences such as content generation, semantic search, retrieval-augmented generation, and intelligent agents. We view our work as pivotal to the broad acceptance of AI technologies.We are passionate about our creations. Every team member plays a vital role in enhancing our models' capabilities and the value they provide to our customers. We thrive on hard work and speed, always prioritizing our clients' needs.Cohere is a diverse team of researchers, engineers, designers, and more, all dedicated to their craft. Each individual is a leading expert in their field, and we recognize that a variety of perspectives is essential to developing exceptional products.Join us in our mission and help shape the future of AI!Why this role?Are you excited about architecting high-performance, scalable, and reliable machine learning systems? Do you aspire to shape and construct the next generation of AI platforms that enhance advanced NLP applications? We are seeking talented Members of Technical Staff to join our Model Serving team at Cohere. This team is responsible for the development, deployment, and operation of our AI platform, which delivers Cohere's large language models via user-friendly API endpoints. In this role, you will collaborate with multiple teams to deploy optimized NLP models in production settings characterized by low latency, high throughput, and robust availability. Additionally, you will have the opportunity to work directly with customers to create tailored deployments that fulfill their unique requirements.

Jan 12, 2026

Apply

Staff Software Engineer, Database Infrastructure

Gusto, Inc.

Full-time|$200K/yr - $270K/yr|On-site|Denver, CO;San Francisco, CA;New York, NY;Los Angeles, CA;Seattle, WA

About GustoAt Gusto, we are dedicated to empowering small businesses by managing essential services like payroll, health insurance, 401(k)s, and HR, allowing owners to focus on their passions and customers. With offices in Denver, San Francisco, and New York, we proudly support over 400,000 small businesses nationwide, fostering a workplace that reflects and celebrates the diverse customers we serve. Explore our Total Rewards philosophy. About the Role:We are seeking a seasoned engineer with extensive knowledge in distributed data systems to help shape the future of Gusto's storage architecture. In this impactful role, you will oversee intricate migrations, design high-scale systems, and establish benchmarks for automation, resilience, and security. Your work in implementing distributed database solutions will facilitate Gusto's ongoing growth and scalability.About the Team:The Datastores Infrastructure Engineering team is responsible for designing, building, and maintaining the data platforms that drive Gusto's products, including MySQL, Postgres, Redis, Kafka, and S3. We are committed to ensuring that our infrastructure is consistent, dependable, and equipped to support Gusto's expanding requirements. As we transition to self-hosted distributed databases, our focus lies in minimizing the blast radius, enhancing operational resilience, and enabling sustainable scalability.Here’s what you’ll do day-to-day:Architect, deploy, and manage the complete lifecycle of distributed database systems (TiDB) on Kubernetes at scale, ensuring high availability, data consistency, and operational excellence.Coordinate complex, zero-downtime migrations from monolithic to distributed architectures, including vertical sharding to isolate Product Services.Define and implement efficiency enhancements across the storage infrastructure through query optimization, caching strategies, and workload management.Establish standards and develop reliable automation to maintain data consistency, integrity, and security across distributed systems.Continuously enhance operational excellence by decreasing on-call burdens with sustainable, long-term solutions.Collaborate with product engineering teams and technical partners to enable rapid and reliable product development.

Jan 27, 2026

Apply

Staff Software Engineer, Compute

fal

Full-time|$180K/yr - $250K/yr|On-site|San Francisco

Join our innovative team at fal as a Staff Software Engineer specializing in large-scale computation platforms. We are seeking a seasoned software engineer with extensive experience in developing backend systems that efficiently orchestrate workloads and manage resource constraints. Your expertise in foundational cloud infrastructure and Linux provisioning will be crucial as you work towards achieving high reliability and scalability with minimal operational overhead.

Dec 16, 2025

Apply

Senior/Staff Software Engineer, Infrastructure

Ivo

Full-time|On-site|San Francisco, California

Join the Crew of Ivo!At Ivo, we are more than just engineers; we are the pioneers of the digital seas! Our crew has set sail with groundbreaking innovations that have reshaped the landscape of legal tech:• An AI agent that seamlessly integrates with MS Word to enhance your documents [2023]• Transitioning from traditional embedding models to agentic RAG for superior performance [2023]• Advancing large-scale LLM-driven legal fact extraction [2024]• A legal assistant capable of accurately searching vast contract databases [2024]• Clustering legal documents from the same lineage [2025]• Implementing automatic deviation analysis to uncover hidden risks in extensive contract databases [2025]• Merging contracts with amendments to create comprehensive “composite” contracts (one of our clients shed tears of joy upon seeing this) [2025]The Role of an Infrastructure EngineerAs an Infrastructure Engineer, you will be the architect of Ivo's platform, ensuring its robustness and scalability.Your mission includes:• Taking ownership of our environment's future, with ample room for creative system design.• Managing numerous customer deployments—every client deserves a unique setup, from containers to databases.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics, logs, and health checks into user-friendly dashboards and alerts.• Leading the charge during infrastructure incidents.• Accelerating our CI/CD system (currently a sluggish ~12 minutes—let's speed that up!).If you share our passion for LLMs and thrive in a dynamic environment, we want you to help us push the boundaries of DevOps:• Innovating real-time LLM evaluations to ensure the accuracy of our outputs.• Building upon our existing infrastructure to enhance performance and reliability.Set sail with us at Ivo, where your technical skills will help chart the course for the future of legal technology!

Mar 5, 2026

Apply

Infrastructure Staff Software Engineer

Ivo, Inc.

Full-time|$325K/yr - $405K/yr|On-site|San Francisco

About Ivo, Inc. Ivo, Inc. is based in San Francisco and builds advanced tools for the legal and document management space. The team has delivered recent projects such as: An AI agent for MS Word that streamlines document editing (2023) Agentic RAG for improved embedding model precision (2023) Large-scale LLMs for legal fact extraction (2024) A legal assistant for searching extensive contract databases with accuracy (2024) Clustering techniques for related legal documents (2025) Automatic deviation analysis to uncover risks in large contract sets (2025) Innovative contract merging to create composite contract series for clients (2025) Role Overview: Infrastructure Staff Software Engineer This role shapes the foundation of Ivo’s platform. The Infrastructure Engineer will design, build, and maintain the systems that power our products and support our engineering team. What You Will Do Design and build scalable infrastructure for Ivo’s platform Manage multiple customer deployments, ensuring each client has dedicated containers, databases, and VPCs Instrument systems to identify and resolve performance bottlenecks and errors Aggregate metrics, logs, and health checks into dashboards and alerting systems Lead response to infrastructure incidents and participate in on-call rotations as needed Optimize CI/CD pipelines to reduce deployment times from approximately 12 minutes DevOps and LLM Innovation Ivo values engineers who are eager to experiment and improve. Areas of exploration include: Building real-time LLM evaluation tools to monitor output accuracy Developing autonomous agents to detect and fix production issues before they escalate Contributing new ideas that advance our mission and platform reliability

Apr 14, 2026

Apply

Senior Staff Software Engineer, Storage

Crusoe

Full-time|$240K/yr - $310K/yr|On-site|San Francisco, CA - US

At Crusoe, we are dedicated to accelerating the abundance of energy and intelligence. As a pioneering AI infrastructure company, we control every aspect of our operations — from energy generation to the digital tokens that power the world’s most ambitious AI workloads. Joining Crusoe means being part of a team that is shaping the future at an unprecedented pace.We are amid a transformative industrial revolution. The endless demand for AI computing power poses significant challenges, particularly concerning energy supply. Our energy-first strategy not only enhances AI infrastructure but also contributes positively to the environment, empowering innovators in the AI sector.We seek proactive, problem-solving team members who recognize the scale of our mission and are eager to navigate uncharted territories. If you aspire to advance your career alongside experts in energy, manufacturing, data center construction, and cloud services, we invite you to become part of our dynamic team.If you are ready to engage in the most impactful work of your career, assist our customers and partners in elevating their AI strategies, and contribute to a high-performing, supportive team, we welcome you to build the future with us at Crusoe.About This RoleThe Cloud Storage team at Crusoe is searching for a Senior Staff Software Engineer to act as the principal architect for our storage strategy. Unlike a Staff Engineer who leads feature development, a Senior Staff Engineer will define the long-term technical roadmap essential for our AI-scale infrastructure. You will play a crucial role in establishing the architectural strategy, ensuring the integrity and global scalability of our specialized storage services. Your work will focus on the underlying physics of the stack, bridging high-performance NVMe hardware with globally distributed object storage solutions that compete with S3.Your ResponsibilitiesArchitectural Vision & Strategy: Lead the development and execution of the long-term technical strategy for Crusoe's storage engine, while identifying and integrating industry trends such as CXL and NVMe-oF into a unified roadmap.System Programming Expertise: Utilize your extensive experience in system programming with languages such as C, C++, Go, and Rust to lay the groundwork for our V2 storage re-architecture.Storage Protocols: Design and implement solutions employing industry-standard storage protocols, including NFS, SMB, iSCSI, and NVMe/TCP.

Apr 8, 2026

Apply

Senior Staff Infrastructure Engineer

TwelveLabs

Full-time|On-site|San Francisco

Who We Are:TwelveLabs is at the forefront of developing innovative multimodal foundation models that enable video comprehension akin to human understanding. Our groundbreaking models have set new benchmarks in video-language modeling, enhancing our capabilities and revolutionizing how we engage with and analyze diverse media formats.With an impressive $107 million in Seed and Series A funding, we're supported by premier venture capital firms including NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, alongside influential AI pioneers like Fei-Fei Li, Silvio Savarese, and Alexandr Wang. Our headquarters in San Francisco, complemented by a significant presence in Seoul, highlights our dedication to fostering global innovation.We celebrate the individuality of every team member’s journey, believing that the diverse cultural, educational, and life experiences of our employees fuel our ability to challenge the status quo. We seek passionate individuals who resonate with our mission and are eager to make a significant impact as we advance technology to reshape the world. Join us in redefining video understanding and multimodal AI.About the RoleAs a Senior Staff Infrastructure Engineer at TwelveLabs, you will leverage your technical expertise and leadership skills to construct the systems that drive our multimodal foundation models. Your focus will be on designing and enhancing a scalable, secure, and high-performance infrastructure that accommodates extensive AI workloads across both cloud-based and on-premises environments.This position demands strong technical acumen, an eagerness to delve into low-level systems when necessary, and the capability to influence infrastructure strategy through hands-on contributions and operational improvements. Your impact will be felt through your technical expertise and the results you deliver, rather than through hierarchical status, in a dynamic and fast-paced environment.In this role, you will:Architect and advance cloud and hybrid infrastructure, blending hands-on execution with technical leadership.Guide the development of AI/ML infrastructure components, engaging directly in critical tasks when necessary.Define infrastructure standards and abstractions while maintaining close interaction with production systems.Collaborate closely with Machine Learning Engineers, Data Scientists, Backend Developers, and other key stakeholders to ensure system alignment and efficiency.

Oct 10, 2025

Apply

Senior / Staff Infrastructure Engineer

Apiphany

Full-time|$160K/yr - $300K/yr|On-site|San Francisco

About ApiphanyApiphany is a trailblazing AI company focused on revolutionizing physical product development. We empower innovators across automotive, aerospace, medtech, and energy sectors to convert vast unstructured technical data into real-time, actionable insights. Supported by elite investors including Markforged, Databricks, GM, and Character, our mission is to transform engineering decision-making, turning complexity into simplicity for leading manufacturers worldwide.Our advanced models are designed to address the intricacies of engineering and manufacturing, comprehending physics principles, design specifications, and program constraints. Our small, elite team consists of builders hailing from prestigious institutions such as Stanford, Berkeley, MIT, UW, and CMU, along with industry veterans from GM, Ford, and Genesis Therapeutics. We are committed to advancing hard-tech and establishing a market-leading company together.About the RoleIn the role of Senior / Staff Infrastructure Engineer at Apiphany, you will architect, build, and manage the infrastructure that underpins our intelligence platform. Your responsibilities will encompass secure, reliable, and scalable cloud deployments, including the unique challenge of deploying across both internal and customer-managed cloud environments.You will ensure our systems adhere to stringent requirements for latency, availability, and compliance within data-intensive environments. Additionally, you will shape our security strategy, implement infrastructure-as-code practices, and establish a solid foundation enabling engineering teams to deliver with assurance.

Oct 23, 2025

Apply

Software Engineer, Infrastructure

Sierra

Full-time|On-site|San Francisco, CA

About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.

Oct 15, 2025

Apply

Infrastructure Software Engineer

Exa

Full-time|On-site|San Francisco, California

At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.

Sep 3, 2025

Apply

Senior Software Engineer, Infrastructure

Serval

Full-time|On-site|San Francisco

Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.

Jan 29, 2026

Apply

Staff Software Engineer, Model Serving

Databricks

Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California

At Databricks, we are dedicated to empowering data teams to tackle the most challenging problems in the world — from realizing the future of transportation to fast-tracking medical innovations. We accomplish this by developing and operating the premier data and AI infrastructure platform, enabling our customers to harness profound data insights for business enhancement. Our Model Serving product equips organizations with a cohesive, scalable, and governed solution for deploying and managing AI/ML models — ranging from traditional machine learning to intricate proprietary large language models. It ensures real-time, low-latency inference, governance, monitoring, and lineage. As the adoption of AI surges, Model Serving stands as a fundamental component of the Databricks platform, allowing customers to operationalize models at scale with robust SLAs and cost efficiency. In the role of Staff Engineer, you will significantly influence both the product experience and the core infrastructure of Model Serving. Your responsibilities will include designing and constructing systems that facilitate high-throughput, low-latency inference across CPU and GPU workloads, steering architectural strategies, and collaborating extensively with platform, product, infrastructure, and research teams to create an exceptional serving platform.

Jan 30, 2026

Apply

Software Engineer - Technical Staff Member

Mithril

Full-time|$170K/yr - $230K/yr|On-site|Palo Alto / San Francisco Bay Area

Mithril is building AI infrastructure to make GPU computing accessible for enterprises, AI startups, and research organizations. The company’s customers include LG AI Research, Saronic, and the Broad Institute. Mithril was founded by a former Google DeepMind research scientist and a Stanford CS PhD, and has raised $80 million in seed and Series A funding from Sequoia Capital, Lightspeed Venture Partners, and others. Platform revenue has grown more than sixfold in the past year. Fast Company recognized Mithril as the 8th Most Innovative Company in Artificial Intelligence for 2026. The team is transitioning from bare-metal operations to a cloud-native, multi-provider platform, introducing an auction and flexibility model. This is an opportunity to help shape the platform from its early stages. Role overview The Software Engineer - Technical Staff Member will work across three main areas: Consumption: Developer-facing product, billing, and API Platform: Orchestration and marketplace solutions Supply: Cloud provider integrations and capacity management Engineers at Mithril take on significant ownership, building features end-to-end that support critical customer workloads and drive revenue. The scope includes backend systems, marketplace logic, and customer interfaces. Architectural decisions here have a direct impact on Mithril’s growth and scalability. What makes this role unique This position blends deep systems work with product-facing challenges. Engineers contribute to the orchestration engine that manages GPU capacity across providers, as well as the interfaces customers use to reserve, bid, and utilize resources. The systems built in this role handle financial transactions, real workloads, and market mechanisms such as spot auctions, reservation pricing, and capacity allocation. For those interested in the mechanics of GPU infrastructure markets and building the technology behind them, this role offers direct involvement. Location This role is based in Palo Alto or the San Francisco Bay Area.

Apr 22, 2026

Apply

Software Engineer, Infrastructure

Imprint

Full-time|On-site|San Francisco

About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.

Jan 16, 2026

Apply

Backend & Infrastructure Software Engineer

vooma

Full-time|On-site|San Francisco Office

About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.

Jul 1, 2025

Create account — see all 5,991 results