Data Infrastructure Engineer jobs in San Mateo – Browse 471 openings on RoboApply Jobs

Data Infrastructure Engineer

ZaimlerSan Mateo, CA

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

QualificationsProficient in programming languages such as Python, Java, or Scala. Experience with distributed data processing frameworks (e.g., Apache Kafka, Apache Spark). Strong understanding of database technologies (SQL and NoSQL). Ability to design scalable data architectures. Knowledge of data modeling and ETL processes. Excellent problem-solving skills and attention to detail. Strong communication skills and ability to work in a team-oriented environment.

About the job

About Zaimler

In a world where AI agents struggle to reason over fragmented data, Zaimler emerges as the solution. Our mission is to unify disparate enterprise data across countless systems, providing a shared context, meaning, and structure. This transformation is essential as we transition from traditional copilots to fully autonomous agents, necessitating a new infrastructure layer that we are dedicated to building.

At Zaimler, we are pioneering context infrastructure for the agentic era—a platform that autonomously discovers domain knowledge, maps intricate relationships, and equips AI agents with the semantic understanding required for precise and scalable operations. Envision knowledge graphs that facilitate real-time inference, tailored for systems that need to reason rather than merely retrieve data.

Founded by industry veterans Biswajit Das (former VP Engineering at Truera and Chief Architect at Visa) and Sofus Macskassy (ex-Director of Engineering at LinkedIn), who notably built one of the largest knowledge graphs in production, Zaimler is a small, senior team at the seed stage, collaborating with major enterprises in sectors like insurance, travel, and technology. If you are passionate about creating the infrastructure that will support the next decade of AI advancements, we are eager to connect with you.

The Role

We are in search of a talented Data Infrastructure Engineer to establish the foundational distributed data layer that will power our semantic platform. In this role, you will be responsible for designing, building, and scaling systems that enable high-throughput data ingestion, transformation, and real-time processing.

About Zaimler

Zaimler is at the forefront of transforming AI capabilities by integrating fragmented enterprise data into a cohesive system. Our innovative approach fosters the development of intelligent agents capable of autonomous reasoning, setting the stage for the future of AI technology.

Similar jobs

1 - 20 of 471 Jobs

Select all on this page (20)

Apply

Data Infrastructure Engineer

Zaimler

Full-time|On-site|San Mateo, CA

About ZaimlerIn a world where AI agents struggle to reason over fragmented data, Zaimler emerges as the solution. Our mission is to unify disparate enterprise data across countless systems, providing a shared context, meaning, and structure. This transformation is essential as we transition from traditional copilots to fully autonomous agents, necessitating a new infrastructure layer that we are dedicated to building.At Zaimler, we are pioneering context infrastructure for the agentic era—a platform that autonomously discovers domain knowledge, maps intricate relationships, and equips AI agents with the semantic understanding required for precise and scalable operations. Envision knowledge graphs that facilitate real-time inference, tailored for systems that need to reason rather than merely retrieve data.Founded by industry veterans Biswajit Das (former VP Engineering at Truera and Chief Architect at Visa) and Sofus Macskassy (ex-Director of Engineering at LinkedIn), who notably built one of the largest knowledge graphs in production, Zaimler is a small, senior team at the seed stage, collaborating with major enterprises in sectors like insurance, travel, and technology. If you are passionate about creating the infrastructure that will support the next decade of AI advancements, we are eager to connect with you.The RoleWe are in search of a talented Data Infrastructure Engineer to establish the foundational distributed data layer that will power our semantic platform. In this role, you will be responsible for designing, building, and scaling systems that enable high-throughput data ingestion, transformation, and real-time processing.

Sep 3, 2025

Apply

Software Engineer - Data Infrastructure

Maxima

Full-time|On-site|San Mateo

Join Our Team at MaximaAt Maxima, we are pioneering an innovative AI platform designed to automate enterprise accounting processes. Our solution effectively manages vast amounts of financial data, intricate accounting workflows, and guarantees robust execution that is both precise and dependable.We are addressing some of the most challenging issues in enterprise automation, and we have attracted a stellar engineering team comprised of industry experts from renowned companies like Robinhood, Glean, Google, Netflix, and Meta. Backed by prestigious investors such as Kleiner Perkins and Redpoint Ventures, we are proud to serve leading clients like Scale AI and Rippling.Your RoleAs a Software Engineer specializing in Data Infrastructure, you will be instrumental in developing and scaling systems that efficiently ingest and process large financial datasets. This foundational platform supports all accounting workflows and the agentic system. Your responsibilities will include designing our data lakes, constructing multi-tenant relational databases, and exploring optimal search engine and vector database configurations to enhance various workflows.

Nov 7, 2025

Apply

Senior Software Engineer - Data Infrastructure, Safety

Roblox Corporation

Full-time|On-site|San Mateo, CA, United States

Roblox Corporation seeks a Senior Software Engineer focused on Data Infrastructure and Safety in San Mateo, CA. This position plays a key part in maintaining the reliability and performance of the Roblox platform, with a strong focus on user protection and a secure environment. Role overview This engineer will design and build scalable data infrastructure to support Roblox’s continued growth. The work centers on improving data quality and reliability, ensuring the platform remains robust as user numbers increase. Collaboration with teams from various disciplines is essential to identify, investigate, and resolve safety-related issues. System architecture decisions made in this role will directly influence user safety and experience. Responsibilities Develop and implement scalable data infrastructure solutions for the Roblox platform Enhance data quality and reliability across systems Work with cross-functional teams to address and resolve safety issues Contribute to architectural decisions that impact user safety and overall experience Requirements Significant experience in software development, data management, and system architecture Proven ability to design solutions that scale with the platform’s growth Strong collaboration skills, especially for addressing safety concerns across teams This role directly influences the safety and experience of millions of Roblox users, supporting the company’s ongoing commitment to a secure and engaging platform.

Apr 27, 2026

Apply

Software Engineer: Infrastructure

Generalist

Full-time|On-site|San Francisco Bay Area (San Mateo) or Boston (Somerville)

About the RoleThis position is pivotal in overseeing infrastructure across our entire tech stack. If it exists in the cloud, it falls under your purview. In the world of robotics, data is essential, and we require robust, scalable infrastructure to manage, store, and process vast amounts of this data. The APIs, services, and monitoring systems you will manage are critical to our operations.Your Responsibilities Include:Managing compute resources (both CPU and GPU) to efficiently process petabytes of data at high throughput.Overseeing the infrastructure required for data processing and storage.Ensuring the security and integrity of our infrastructure and data.You Will Excel in This Role If You Have:A minimum of 5 years of experience in managing large-scale cloud infrastructure using tools such as Kubernetes and Terraform, with a primary focus on Python services.Deep understanding of AWS services (or their equivalents) and their permission models.Strong perspectives on the effective use of coding agents within an infrastructure context.

Feb 12, 2026

Apply

Senior Data Engineer, Ads

Roblox

Full-time|$242.1K/yr - $293.8K/yr|On-site|San Mateo, CA, United States

Join the vibrant community of Roblox, where millions of users come together daily to explore, create, play, learn, and connect through immersive 3D digital experiences crafted by our global network of developers and creators.At Roblox, we’re dedicated to creating the tools and platform that empower our users to bring their imaginative experiences to life. Our vision is to revolutionize how people connect, regardless of their location or device. Our mission is to unite a billion people in a spirit of positivity and civility, and we are seeking extraordinary talent to help us achieve this goal.A career at Roblox means you will be at the forefront of shaping the future of human interaction, tackling unique technical challenges at scale, and contributing to the creation of safer, more respectful shared experiences for all.About the Role:As Roblox continues to experience unprecedented growth in daily active users, we are looking for a Senior Data Engineer to help build the next generation of Ad Data and ML feature infrastructure. Our team plays a pivotal role in the core advertising systems at Roblox, driving high-scale, high-throughput data systems that enhance ad personalization, ranking, and measurement. In this position, you will design and construct foundational data pipelines, real-time streaming systems, and scalable feature computation frameworks. Your contributions will facilitate rapid experimentation, empower ranking models, and support personalized ad delivery on a massive scale. You will work closely with engineers across machine learning, backend, and analytics teams to innovate the future of ads data infrastructure at Roblox. This role is ideal for individuals who possess a strong proficiency in managing large-scale systems and data-driven applications and are passionate about developing impactful solutions.

Feb 10, 2026

Apply

Senior Infrastructure Software Engineer

Skydio

Full-time|$170K/yr - $170K/yr|On-site|San Mateo, California, United States

Skydio, a premier drone manufacturer based in the United States, stands at the forefront of autonomous flight technology, paving the way for the future of drones and aerial mobility. Our diverse team merges profound expertise in artificial intelligence with top-tier hardware and software development, operational excellence, and a relentless focus on customer satisfaction. We empower a wide array of drone users, from utility inspectors to first responders and military personnel, to leverage our cutting-edge technology in various scenarios.About the Team: The Skydio Cloud Infrastructure team is dedicated to ensuring the Skydio Cloud platform is consistently available to our users at critical moments, whether conducting routine inspections or supporting rescue missions during emergencies. With a global fleet of thousands of drones, we are committed to continuous improvement, emphasizing robust delivery and testing pipelines as vital components of our operations.About the Role: As a Senior Infrastructure Engineer focused on an innovative product, you will play a pivotal role in maintaining our Kubernetes fleet and enhancing the core product software to meet evolving use cases. This position blends software engineering and infrastructure management, allowing you to address product deficiencies directly rather than solely relying on automation. We seek a professional who thrives on the autonomy to influence architecture, security, and functionality across the entire stack.Your Impact:Re-engineer and sustain the expanding requirements of our Kubernetes fleet and its underlying infrastructure.Enhance and broaden the continuous delivery processes for our product.Collaborate across teams (hardware to cloud) to introduce new capabilities to the platform.Engage directly with security teams to refine practices and controls that safeguard our customers' data and drones.Lead cost-saving initiatives early in the product lifecycle to ensure scalability.

Mar 4, 2026

Apply

Infrastructure Software Engineer

Skydio

Full-time|$140K/yr - $140K/yr|On-site|San Mateo, California, United States

Skydio builds autonomous drones for a wide range of users, from utility inspectors and first responders to military personnel in the field. Based in San Mateo, California, Skydio combines artificial intelligence expertise with advanced hardware and software development, always focused on customer needs. About the Cloud Infrastructure Team The Cloud infrastructure group keeps Skydio’s platform available whenever and wherever it’s needed, whether for routine inspections or urgent disaster response. With thousands of drones deployed worldwide, the team continually improves how infrastructure is delivered and updated. Role Overview The Infrastructure Software Engineer manages and evolves Skydio’s Kubernetes fleet, making key software changes to support new and changing requirements. This hybrid role spans both infrastructure and software, offering the chance to shape product architecture, security, and performance. The position suits someone who enjoys working across the stack and tackling a mix of challenges. What You’ll Do Redesign and maintain a growing Kubernetes fleet and its supporting systems. Improve and expand the continuous delivery pipeline for Skydio’s products. Work with teams from hardware to cloud to introduce new platform features. Partner with security experts to strengthen data and drone protection measures. Introduce cost-saving strategies early in the product lifecycle to support long-term growth. What We’re Looking For At least 2 years of experience in infrastructure or software engineering. Hands-on knowledge of Kubernetes and cloud platforms. Strong analytical and problem-solving skills, with a collaborative approach. Drive for innovation and a high standard of quality in your work. Location: San Mateo, California, United States

Apr 17, 2026

Apply

Data Engineer at Tarro | San Mateo, CA

Tarro

Full-time|$180K/yr - $350K/yr|On-site|San Mateo, CA

About UsAt Tarro, we create innovative solutions that empower small brick-and-mortar restaurants by alleviating the operational challenges of managing their businesses. Our comprehensive ecosystem connects these restaurants with their customers through AI-driven order processing, seamless delivery solutions, integrated payment systems, and advanced point-of-sale software. We blend cutting-edge technology with the human touch to address the pressing issues faced by small business owners.Our customer-centric approach drives everything we do. When our clients thrive, we thrive. The U.S. restaurant industry is a massive $1 trillion market, yet it remains largely underserved by technology. While large chains can afford sophisticated tools that give them a competitive edge, we believe that small restaurant owners should also have access to top-tier technology at a reasonable cost.With nearly a decade of profitability and a remarkable 5x revenue growth over the past four years, Tarro was valued at $450M in our latest fundraising round in mid-2022. We have experienced substantial growth in customer acquisition, product innovation, and team expansion. Thousands of dedicated restaurants trust Tarro to help them succeed, and we are proud to have served nearly 20 million customers. Recognized as one of Built In’s top companies to work for in 2023, we invite you to learn more about our culture and values. We are committed to helping restaurants not just survive, but thrive.What We’re Looking ForWe are in search of a talented Data Engineer to join our team and help us build and scale Tarro’s data infrastructure as we expand our product offerings and customer base. In this pivotal role, you will significantly influence how data is gathered, structured, validated, and utilized throughout the organization.This is a unique opportunity to make a substantial impact in a dynamic, profitable startup environment where you will have genuine ownership of your work, allowing you to quickly see the effects of your contributions on product and business outcomes.What You’ll AccomplishDevelop and enhance Tarro’s core data infrastructure and essential datasets.Design, implement, and manage scalable data pipelines that cater to analytics, machine learning, experimentation, and product applications.Guarantee data integrity and reliability through robust audits, alert systems, and anomaly detection mechanisms at scale.

Jan 20, 2026

Apply

Core Infrastructure Software Engineer

Genesis Therapeutics

Full-time|On-site|San Mateo, CA

At Genesis Therapeutics, we are on a mission to revolutionize drug discovery by harnessing the power of machine learning, biophysical simulation, and computational chemistry. We are assembling a top-tier computational team and seek a passionate Infrastructure Engineer to contribute to the development of innovative medicines while playing a pivotal role in enhancing our AI platform.Your ResponsibilitiesCollaborate with the infrastructure team to sustain and expand our multi-cloud compute infrastructure that underpins ML model training, computational chemistry research, and ongoing drug discovery initiatives.Develop configuration and procedures for monitoring, resource allocation, and deployment automation to scale our autoscaling compute clusters for larger workloads.Enhance our orchestration scheduling framework to boost execution throughput, reliability, and compute utilization across diverse pipelines.Your QualificationsA minimum of 5 years of experience in building and maintaining large-scale cloud infrastructure, preferably in AWS or GCP.Strong proficiency in Python, Bash, Terraform, Ray, and Kubernetes.Experience in constructing and maintaining compute clusters for distributed ML training jobs utilizing 1,000+ GPUs is highly desirable.Hands-on experience with physical hardware and datacenter management is a plus.What We OfferAn opportunity to work on impactful infrastructure that accelerates the discovery of new medicines.Join a world-class, close-knit team of dedicated professionals across software, machine learning, computational chemistry, medicinal chemistry, and biology.Competitive salary and equity, along with comprehensive medical, dental, and vision insurance, and a 401(k) program.

Jan 27, 2026

Apply

Infrastructure Software Engineer at Genesis Molecular AI | San Mateo

Genesis Therapeutics

Full-time|On-site|San Mateo, CA

At Genesis Therapeutics, we are at the forefront of revolutionizing drug discovery by harnessing the power of machine learning, biophysical simulations, and computational chemistry. We are actively seeking a passionate Infrastructure Engineer to join our elite computational team. In this role, you will contribute to the development of groundbreaking medicines and play a pivotal part in the expansion of our advanced AI platform.Your Role:Collaborate with our infrastructure team to enhance and maintain our extensive multi-cloud compute infrastructure, which is vital for ML model training, computational chemistry research, and drug discovery initiatives.Develop and implement configurations and procedures for monitoring, resource allocation, and deployment automation to adapt to the growing demands of our autoscaling compute clusters.Contribute to the orchestration scheduling framework to boost execution throughput, increase reliability, and optimize compute utilization across diverse pipelines.Your Qualifications:Minimum of 5 years of experience in building and maintaining scalable cloud infrastructure, particularly with AWS or GCP.Proficient in Python, Bash, Terraform, Ray, and Kubernetes.Experience with distributed ML training jobs on compute clusters with over 1,000 GPUs is highly desirable.Hands-on experience with physical hardware and data center management is a plus.What We Offer:An opportunity to engage with impactful infrastructure that accelerates the discovery of new medicines.Be part of a world-class, close-knit team of dedicated professionals across software, machine learning, computational chemistry, medicinal chemistry, and biology.Competitive salary and equity options, alongside comprehensive medical, dental, and vision coverage, plus a 401(k) retirement plan.

Jan 27, 2026

Apply

Machine Learning Infrastructure Software Engineer

Genesis Therapeutics

Full-time|On-site|San Mateo, CA

Join Our Innovative TeamAt Genesis Therapeutics, we are a dynamic and passionate group of drug discovery experts, deep learning researchers, and software engineers dedicated to revolutionizing biochemistry through AI. Our mission is clear: to uncover and develop transformative therapies for patients with severe medical conditions.Our AI team is at the forefront of creating foundational models for small molecule drug discovery. We conduct cutting-edge research that bridges machine learning, physics, and computational chemistry, while building resilient software systems capable of executing large-scale simulations and training advanced generative and predictive AI models utilizing our powerful cluster of thousands of GPUs and tens of thousands of CPUs.Your RoleWe are on the lookout for skilled ML infrastructure engineers to propel our machine learning research initiatives, particularly in generative modeling of molecular systems, which is vital to our overarching goals.In this position, you will spearhead the rapid advancement of our AI platform and infrastructure, enhancing performance, efficiency, and scalability to unprecedented levels. You will construct expansive distributed training and inference pipelines, essential MLOps tools and frameworks, and fine-tune GPU operations to accelerate ML model performance.Genesis fosters a collaborative and interdisciplinary environment, allowing you to work closely with our talented engineers, researchers, and scientists.Your ResponsibilitiesDrive engineering initiatives aimed at the continuous enhancement of our AI platform, focusing on the rapid development of scalable and robust distributed infrastructures for ML training, inference, and evaluation.Facilitate model training and deployment across various clusters and cloud environments, optimizing for throughput and cost-effectiveness.Maximize the efficiency of ML models and other workloads in terms of latency, throughput, and memory usage, particularly through GPU performance engineering, pushing the boundaries of current hardware capabilities.Contribute to the long-term strategic vision for Genesis’ infrastructure platform.Your ProfileA strong engineering background with a focus on machine learning infrastructure.

Nov 24, 2025

Apply

Platform Infrastructure Software Engineer

Verkada

Full-time|$120K/yr - $220K/yr|On-site|San Mateo, CA United States

About UsAt Verkada, we are revolutionizing the way organizations safeguard their people and property through an integrated, AI-powered platform. As a frontrunner in cloud physical security, Verkada empowers more than 30,000 organizations globally—including over 100 Fortune 500 companies—to enhance their safety and operational efficiency via a unified software platform that offers solutions for video surveillance, access management, air quality monitoring, alarms, intercoms, and visitor management.Founded in 2016, Verkada has experienced rapid growth, boasting 15 offices and a dedicated team of over 2,200 employees.The RoleJoin our innovative cloud infrastructure team, where you will play a crucial role in designing, building, and maintaining highly scalable, reliable systems that power Verkada’s services. You will have the chance to work on exciting projects such as scaling microservice clusters, automating serverless deployments, adopting a full service mesh, and enhancing system observability. Take charge of a subdomain and lead collaborative efforts across teams.This position requires your presence at our headquarters located in San Mateo, CA, as we are dedicated to fostering a vibrant in-office culture.

Feb 9, 2026

Apply

Founding Cloud Infrastructure Engineer

Zaimler

Full-time|On-site|San Mateo, CA

About ZaimlerZaimler is at the forefront of transforming the way enterprise data is utilized in the era of AI. Our mission is to eliminate the fragmentation of data across disparate systems, providing AI agents with the contextual understanding necessary to operate efficiently and effectively. We are pioneering an innovative infrastructure layer that will redefine the capabilities of autonomous agents, enabling real-time inference through advanced knowledge graphs.Founded by industry veterans Biswajit Das and Sofus Macskassy, Zaimler is a seed-stage startup focused on delivering cutting-edge solutions to major enterprises across various sectors, including insurance, travel, and technology. If you're passionate about building the foundational infrastructure for the next generation of AI, we invite you to join our small, experienced team.Your RoleAs our Founding Cloud Infrastructure Engineer, you will take the lead in designing, constructing, and managing the cloud infrastructure that underpins Zaimler’s semantic platform. This is not a maintenance position; it’s an opportunity to create a robust system from the ground up that will shape the future of our operations.

Feb 14, 2025

Apply

Software Engineer, AI Infrastructure

Fireworks AI

Full-time|On-site|New York, NY; San Mateo, CA

About Us:At Fireworks AI, we are at the forefront of creating next-generation generative AI infrastructure. Our cutting-edge platform is recognized for delivering the highest-quality models with unparalleled speed and scalability in inference. Independently benchmarked as a leader in LLM inference speed, we drive significant advancements through innovative projects, including our proprietary function calling and multimodal models. As a Series C company valued at $4 billion and backed by leading investors such as Benchmark, Sequoia, Lightspeed, Index, and Evantic, we are a dynamic team of builders, comprised of veterans from Meta PyTorch and Google Vertex AI.The Role:We are seeking a talented Software Engineer to join our AI Infrastructure team. In this pivotal role, you will contribute to designing and developing the foundational systems that power Fireworks AI’s generative AI platform. Your focus will be on building robust infrastructure and tools that guarantee the reliability, performance, quality, and availability of our AI systems.Our mission is to establish Fireworks AI as the most dependable and user-friendly generative AI platform globally. You will collaborate closely with our cloud infrastructure, product, and performance teams to create infrastructure solutions that connect our customers with the high-performance proprietary Fireworks inference engine.Key Responsibilities:Design and develop scalable backend infrastructure supporting distributed training, inference, and data pipelines.Build and maintain essential backend services, including LLM CI/CD pipelines, control planes, and model serving systems.Enhance performance optimization, cost efficiency, and reliability across compute, storage, and networking layers.Create frameworks and safeguards to ensure Fireworks AI maintains the highest model quality in the industry.Work alongside performance, training, and product teams to translate research and product requirements into effective infrastructure solutions.Engage in code reviews, technical discussions, and continuous integration and deployment processes.

Mar 5, 2026

Apply

Data Platform Engineer - Member of Technical Staff

Fireworks AI

Full-time|$175K/yr - $220K/yr|On-site|San Mateo, CA

About Us:At Fireworks, we are pioneering the future of generative AI infrastructure. Our platform is recognized for delivering the highest-quality models with the industry's fastest and most scalable inference capabilities. With an independent benchmark as the leader in LLM inference speed, we are at the forefront of innovation, working on advanced projects including our proprietary function calling and multimodal models. As a Series C company valued at $4 billion, we are backed by esteemed investors such as Benchmark, Sequoia, Lightspeed, Index, and Evantic. Our team is made up of ambitious builders, including veterans from Meta PyTorch and Google Vertex AI.The RoleWe are seeking a skilled Data Platform Engineer specializing in Order-to-Cash (OTC) Revenue Transformation and AI Application Enablement. This role will involve taking ownership of and evolving the comprehensive billing, revenue, and business data pipeline—from usage metering and invoice generation to revenue recognition and financial reporting. You will be positioned at the nexus of Engineering, Finance, and Data, ensuring precise capture, billing, recognition, and reconciliation of every dollar generated across our five revenue streams.This impactful, cross-functional role requires hands-on engagement with our billing platform (e.g., Orb), accounting systems, data warehouse (BigQuery), and cloud marketplaces (AWS, GCP). Ultimately, you will contribute to the design of AI-enabled workflow agents that automate reconciliation, anomaly detection, and revenue operations once the core data infrastructure is fortified.

Apr 7, 2026

Apply

Software Engineer: Machine Learning Infrastructure

Generalist

Full-time|On-site|San Francisco Bay Area (San Mateo) or Boston (Somerville)

About the RoleAt Generalist, we are at the forefront of training expansive robot foundation models, leveraging cutting-edge GPU hardware, primarily from Nvidia, to execute distributed training tasks and experimental research. Our operations demand exceptional storage solutions and optimized data loading processes, necessitating the full utilization of cloud infrastructure alongside custom-built solutions.In this role, you will take charge of our inference infrastructure. Our robotic systems rely on a dedicated fleet of on-premises GPUs designed for demanding real-time computations and latency-sensitive applications within resource-constrained environments.Your Responsibilities:Manage and optimize our GPU compute fleets.Facilitate user-friendly access to GPUs for researchers, ensuring optimal utilization.Enhance ML data loading, transport, and storage systems in extensively utilized distributed environments.Oversee the orchestration of our robot inference fleets.You May Excel in This Position If You:Have experience managing large GPU fleets for large-scale, distributed training or inference.Possess significant expertise in using Slurm or Kubernetes for ML workload orchestration.Have developed high-scale ML data loaders and preparation systems.Understand the intricacies of ML hardware, storage, and networking systems.Are familiar with the Nvidia GPU ecosystem.

Feb 12, 2026

Apply

Senior Hardware Engineer - GPU & AI Infrastructure

Roblox

Full-time|$242.1K/yr - $293.8K/yr|On-site|San Mateo, CA, United States

Join the dynamic world of Roblox, where millions engage daily in exploring, creating, learning, and connecting through immersive 3D experiences crafted by a global community of developers and creators.At Roblox, we are committed to building innovative tools and platforms that empower our community to realize their creative visions. Our mission is to transform how individuals connect, regardless of geographical boundaries, and across any device. We strive to foster connections among a billion users with positivity and respect, and we are actively seeking exceptional talent to help us achieve this goal.A career at Roblox is an opportunity to influence the future of human interaction, tackle unique technical challenges at scale, and contribute to creating safer, more respectful shared experiences for everyone.As a vital member of our Infrastructure Foundation Hardware Engineering team, you will lead the charge in delivering a reliable, high-performance, and cost-effective infrastructure that supports the world’s play. In this specialized role, you will act as the technical lead for our GPU and AI accelerator ecosystem, managing the entire lifecycle of GPU hardware—from architectural evaluation and firmware qualification to large-scale fleet integration and performance optimization. Your expertise will ensure that Roblox's extensive rendering and machine learning workloads operate on the most efficient and stable hardware available.Your Responsibilities Will Include:Architect & Prototype: Develop next-generation GPU-accelerated hardware platforms, ensuring seamless integration between high-density compute nodes, high-speed interconnects (NVLink/PCIe Gen5/6), and system firmware.GPU Optimization: Lead the integration, performance testing, and debugging of GPUs within our fleet, focusing on hardware-level optimizations, driver tuning, and thermal/power management.Validation & Certification: Create and implement comprehensive evaluation and stress-testing strategies for GPU-centric server platforms to meet Roblox's unique requirements for real-time rendering and low-latency AI inference.Firmware & Systems: Spearhead firmware qualification (BIOS/BMC) and troubleshooting, along with implementing automation systems to monitor GPU health and manage firmware updates.Vendor Collaboration: Collaborate with technology partners to enhance our GPU and AI infrastructure.

Feb 10, 2026

Apply

Engineering Manager - Networking Infrastructure at Roblox

Roblox

Full-time|$293.8K/yr - $343.3K/yr|On-site|San Mateo, CA, United States

Join Roblox, where millions of users engage daily to explore, create, play, learn, and connect within 3D immersive digital environments fueled by our global community of innovators.At Roblox, we’re committed to developing tools and a platform that empower our community to turn their creative visions into reality. Our mission is to rethink how people connect from anywhere in the world, on any device. We aim to unite a billion people with a spirit of optimism and civility, and we are on the lookout for exceptional talent to help us achieve this goal.A role at Roblox means you will contribute to shaping the future of human interaction, address unique technical challenges at scale, and play a part in fostering safer, more respectful shared experiences for all.Roblox is redefining how individuals come together to connect, create, and express themselves. To support our extensive scale, we leverage microservices architecture. The Application Networking team is responsible for connecting and securing these services.Within this team, the Gateway Team acts as the "Front Door" of Roblox, overseeing critical infrastructure that manages all traffic entering Roblox (Ingress) and facilitates traffic across significant architectural boundaries.You Will:Steer the Gateway team towards delivering top-tier traffic management infrastructure.Lead the design and execution of ambitious "Moonshot" initiatives.Streamline our ingress stack and "Platformize" the gateway to enhance extensibility for developer teams.Oversee the reliability of Tier-0 systems.Guide engineers and team leads, ensuring sustainable on-call rotations and promoting continuous career advancement.

Feb 10, 2026

Apply

Research Engineer - Generative AI Infrastructure

Fireworks AI

Full-time|$175K/yr - $240K/yr|On-site|San Mateo, CA

About Us:At Fireworks AI, we are pioneering the future of generative AI infrastructure. Our innovative platform provides top-tier models with unmatched inference speed and scalability. Recognized as a leader in LLM inference, we are committed to driving transformative advancements through projects like our proprietary function calling and multimodal models. As a Series C company valued at $4 billion, we are proudly supported by esteemed investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. Our dynamic team, comprised of experts from Meta PyTorch and Google Vertex AI, is united in our ambition to redefine AI technology.The Role:As a Member of the Technical Staff within our Research team, you will explore the frontiers of generative AI, enhancing LLMs and multimodal systems through foundational research. Your contributions will improve model efficiency, accuracy, and scalability, directly influencing our high-performance AI infrastructure. Collaborating with leading experts in deep learning, distributed systems, and optimization, you will translate groundbreaking research into practical applications. Your work will empower some of the world's foremost companies in their AI development efforts.

Mar 5, 2026

Apply

Data Engineer Intern

Skydio Inc.

Intern|On-site|San Mateo, California, United States

Skydio is at the forefront of the drone revolution, recognized as the leading U.S. drone manufacturer and a pioneer in autonomous flight technology. Our innovative team is dedicated to harnessing artificial intelligence, alongside cutting-edge hardware and software development, to enhance aerial mobility. We are passionate about making drone technology accessible to diverse users, including utility inspectors, first responders, and military personnel in various scenarios.About the Position:As a member of Skydio’s Go-to-Market (GTM) AI team, you will be instrumental in developing data systems and AI-enhanced tools that drive our market strategies. This role bridges the fields of revenue operations, AI, and data engineering, allowing you to deliver production-ready solutions that enhance data quality, yield valuable insights, and facilitate intelligent automation. You will focus on crafting structured datasets and internal applications to support AI workflows and enhance the overall GTM technology stack.Your Contributions:AI Data Infrastructure: Design and develop structured datasets to empower Large Language Model (LLM) workflows and AI agents. Conduct data transformations, cleanse CRM data, and establish reliable schemas within GTM systems.Data Pipelines and Automation: Construct pipelines for data ingestion, transformation, and validation from core GTM systems. Enhance data integrity and develop reusable data models for analytics and AI applications.Internal Tools and Applications: Create user-friendly internal tools utilizing TypeScript and React, enabling non-technical teams to interact with structured data, initiate workflows, and retrieve insights.GTM Systems Optimization: Assist in improving data quality, enhancing workflows, and promoting operational reliability within Salesforce and related platforms.Applied AI Enablement: Develop structured object sets and feature tables to enhance LLM performance and reliability, contributing to the establishment of practical AI-driven workflows for GTM teams.Ideal Candidate Profile:Passionate about Data: You thrive on working with complex datasets and take pride in transforming disorganized data into structured, reliable systems.Curiosity in AI: You are eager to explore the full spectrum of applied AI, from data modeling to LLM reasoning and action orchestration.Innovative Mindset: You have a proactive approach to problem-solving and enjoy building robust solutions.

Feb 27, 2026

Create account — see all 471 results

1 - 20 of 471 Jobs

Select all on this page (20)

Apply

Data Infrastructure Engineer

Zaimler

Full-time|On-site|San Mateo, CA

Sep 3, 2025

Apply

Software Engineer - Data Infrastructure

Maxima