Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Proven experience in software engineering with a focus on cloud infrastructure. Strong programming skills in languages such as Java, Python, or C++. Deep understanding of distributed systems and cloud computing paradigms. Experience with container orchestration tools like Kubernetes and Docker. Excellent problem-solving abilities and a passion for technology. Strong communication skills and the ability to work in a team environment.
About the job
As a Senior Staff Software Engineer specializing in Compute Infrastructure, you'll play a pivotal role in designing, building, and optimizing our cloud infrastructure systems. Your expertise will help drive performance and scalability, ensuring our platforms support the ever-increasing demand for data processing and storage. You will work collaboratively with cross-functional teams to develop innovative solutions that enhance our technology stack.
About LinkedIn
LinkedIn is the world's largest professional network, dedicated to connecting talent and opportunity. Our mission is to empower every member of the global workforce to achieve more through our innovative solutions and services.
Join MatX in Shaping the Future of AI InfrastructureAt MatX, we are pioneering the development of vertically integrated full-stack solutions that span from silicon to sophisticated systems for artificial general intelligence compute platforms. Our unique blend of hardware and software is complemented by a robust infrastructure team that manages essential systems — including build systems, CI/CD pipelines, cloud environments, security measures, and developer tooling — enabling our nimble team to deliver results at the speed of a much larger organization.We are seeking a proactive and experienced Infrastructure and IT Manager who will take ownership of our infrastructure and lead our talented team. In this role, you will engage in writing infrastructure-as-code, debugging build systems, triaging CI failures, and pushing code, while also focusing on hiring, mentoring, and defining the technical direction for our expanding platform team.
Join Commure, Inc. as a Senior Operations Manager specializing in Infrastructure. In this pivotal role, you will oversee the development and execution of operational strategies that enhance our infrastructure capabilities. You will lead cross-functional teams to ensure operational excellence, drive efficiency, and foster a culture of continuous improvement.Your responsibilities will include managing infrastructure projects from conception to delivery, optimizing processes, and collaborating with various departments to align operations with business objectives.
Full-time|$228.6K/yr - $314.3K/yr|On-site|Mountain View, California
Databricks is seeking a skilled Senior Manager in Infrastructure Data Science to revolutionize our infrastructure through the power of data science. In this role, you will address intricate challenges associated with capacity planning, performance enhancement, reliability engineering, infrastructure optimization, and customer satisfaction. You will lead a talented team of data scientists and collaborate closely with engineering leaders to provide them with actionable, data-driven insights and solutions.At Databricks, our mission is to empower data teams to tackle the world's most pressing issues, from detecting security threats to advancing cancer treatments. We achieve this by creating and managing the leading data and AI infrastructure platform, allowing our clients to concentrate on the high-impact challenges that are vital to their missions.Founded in 2013 by the original creators of Apache Spark, Databricks has evolved from a small office in Berkeley, CA to a global powerhouse with over 7,000 employees. Numerous organizations, ranging from startups to Fortune 100 companies, depend on Databricks for their critical workloads, positioning us as one of the fastest-growing SaaS firms worldwide.Our engineering teams develop highly technical products that meet significant, real-world needs. We continuously push the limits of data and AI technology, all while maintaining the resilience, security, and scalability that are essential for ensuring our customers' success on our platform.
Join LinkedIn as a Staff Technical Program Manager in our Physical Infrastructure team, where you will spearhead critical projects that enhance our infrastructure capabilities. You'll collaborate closely with engineering teams to ensure seamless execution of large-scale initiatives while driving strategic alignment and operational efficiency.
As the Engineering Manager for the Machine Learning Infrastructure team, you will lead the charge in developing a state-of-the-art platform that underpins Moveworks' conversational AI capabilities. This pivotal role is essential for the long-term scalability of our flagship AI product and the overall success of the company.Your primary responsibility will be to oversee a team of skilled engineers dedicated to constructing, optimizing, and expanding the comprehensive systems that facilitate the entire ML/LLM lifecycle. This encompasses our infrastructure for distributed training and inference, model evaluation frameworks, and LLM latency optimization. You will shape the technical direction of the team, harmonizing the operational requirements of our core ML infrastructure with innovative research to create the next generation of LLMs utilizing advanced generative AI.The frameworks your team creates will form the backbone for all ML models deployed in production, serving hundreds of millions of enterprise employees. Your efforts will significantly influence the evolution of the Moveworks Enterprise Copilot platform and the trajectory of AI-driven employee services.
About Moveworks At Moveworks, our mission is to transform language into the universal user interface. We empower enterprises with a conversational interface that seamlessly integrates with all systems—from Microsoft to Workday to Salesforce. Utilizing advanced GPT-class machine learning models, our platform adapts to the unique language of each organization, effectively addressing thousands of use cases. Esteemed brands such as Databricks, Broadcom, DocuSign, and Palo Alto Networks rely on Moveworks’ proprietary enterprise data, ready-to-use solutions, and user-friendly developer tools to implement conversational automation across their businesses. Founded in 2016 and backed by leading investors including Kleiner Perkins and Lightspeed, Moveworks has raised $315 million in funding and reached a valuation of $2.1 billion. We are proud to be recognized as a part of the Forbes AI 50 list for five consecutive years and to have received accolades such as the 2023 Edison Awards for AI Optimized Productivity and the Best Bot Solution at the 2022 AI Breakthrough Awards. With over 500 employees across six global offices, we invite you to be a part of one of the most innovative teams in the industry! Your Role As the Senior Engineering Manager for our core infrastructure team, you will spearhead the development of the Moveworks AI infrastructure. As our company grows rapidly, your leadership will be crucial in designing and operating highly reliable and resilient foundational services and frameworks that enable seamless scalability and swift feature development for our engineering teams. You will collaborate closely with teams in machine learning, search, product management, data, and full-stack engineering to identify, define, and implement elegant solutions to complex product and engineering challenges. This is an exceptional opportunity to play a pivotal role at the fastest-growing AI startup in its sector. Lead the evolution of Moveworks infrastructure architecture and foundational services with a focus on high reliability, scalability, performance, and security. Collaborate with machine learning, search, product, data, and frontend teams to understand their infrastructure needs, influence the infrastructure roadmap, and oversee the execution of various projects. Architect and enhance the core infrastructure and foundational services to support our rapid growth and customer needs.
Full-time|$230K/yr - $292K/yr|Hybrid|Mountain View, CA, USA; San Francisco, CA, USA
Waymo is at the forefront of autonomous driving technology, driven by a mission to become the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has relentlessly pursued the development of the Waymo Driver—The World’s Most Experienced Driver™—aimed at enhancing mobility access while significantly reducing traffic-related fatalities. The Waymo Driver underpins our fully autonomous ride-hailing service and is adaptable across various vehicle platforms and use cases. With over ten million rides successfully completed, our technology has driven over 100 million miles on public roads and tens of billions of miles in simulation across more than 15 U.S. states.This position follows a hybrid work schedule and will report to a Manager of Technical Program Management.Responsibilities:Collaborate with engineering and product management teams to oversee and optimize the use of simulation compute resources.Formulate program frameworks and execution strategies that guarantee project success.Maintain a strong focus on decision-making while effectively communicating trade-offs to stakeholders.Exhibit the ability to influence without direct authority to ensure alignment and communication across all levels for your program.Requirements:Proven experience in large-scale program management, adept at handling multiple projects, diverse stakeholders, and complex team dynamics.A minimum of 5 years in infrastructure, software, or resource and capacity management.Educational background in computer science or a related technical field.A strong track record of delivering impactful strategic communications.
Contract|$156K/yr - $229K/yr|On-site|Mountain View, California, US
About the PositionJoin DeepMind as a Program Manager for our AI Platform and become integral to the operational success of a transformative program that fuels the Gemini and GenAI serving stack. This fixed-term contract role spans 12 months and is essential for delivering exceptional program support while driving operational excellence. Your primary focus will be on managing processes and execution to ensure seamless functioning of our technical infrastructure initiatives across various global time zones, all while establishing a structured framework that empowers our engineering teams to excel.Your RoleThis impactful Program Manager position emphasizes operational efficiency and program support in a mission-critical technical setting. You will oversee the operations of our GenAI serving stack, acting as a pivotal link between technical design and execution. This dynamic role demands a leader adept at thriving in an agile, fast-paced environment where priorities can shift rapidly. Key responsibilities include managing essential workflows, expediting technical design reviews, enhancing cross-functional collaborations, and ensuring that program objectives are achieved with accuracy and efficiency.Key ResponsibilitiesDefine, lead, and own comprehensive program charters addressing scope, timeline, OKRs, resources, and risks/dependencies for complex programs.Facilitate design reviews and technical steering discussions to effectively vet and accelerate engineering productivity.Collaborate closely with engineers to identify and resolve operational bottlenecks, supporting the timely delivery of infrastructure projects.Organize and lead program reviews, summits, and other collaborative events to enhance team cohesion.Identify and implement opportunities to revamp existing processes, transitioning from chaotic workflows to structured, efficient frameworks.
Full-time|$105K/yr - $155K/yr|On-site|Mountain View, CA
Join Our Team as a Software Engineer, Core InfrastructureAs a pivotal member of the Core Infrastructure team at Moveworks, you will play a crucial role in designing and implementing the next generation of our AI infrastructure. With Moveworks experiencing rapid growth, our infrastructure team is dedicated to creating and maintaining reliable, resilient foundational services and frameworks that enable our products to scale efficiently and support our engineering teams in delivering customer-facing features swiftly.Collaborate closely with machine learning, search, product, data, and frontend teams to assess their infrastructure requirements, influence the infrastructure roadmap, and lead various projects from conception to execution.Design and construct core infrastructure components and foundational functionalities, including distributed key-value stores, schema-less data stores, authentication and authorization systems, event streaming, distributed configuration management, rate limiting, circuit breaking, feature flag systems, A/B testing, and traffic capture and replay.Enhance the observability and reliability of Moveworks systems by building and refining distributed logging, tracing, monitoring, and alerting infrastructures.Establish methodologies and metrics to evaluate the performance of microservices and product functionalities, identify bottlenecks, and enhance the overall performance and scalability of Moveworks applications.Consistently deliver time-sensitive work that is interconnected with other engineering teams.
Full-time|$235K/yr - $352.3K/yr|On-site|Mountain View, California (HQ)
Join Our Mission at NuroNuro is pioneering the realm of self-driving technology with a vision to make autonomy accessible to everyone. Established in 2016, we are engineering the world’s most scalable driving solution by merging advanced AI with resilient automotive-grade hardware. Our flagship technology, the Nuro Driver™, is licensed for a variety of applications, from robotaxis to commercial fleets and private vehicles. With years of successful deployments, Nuro offers automakers and mobility platforms a streamlined pathway to commercial-scale autonomous vehicles, cultivating a safer, more integrated future.Role OverviewNuro is in search of a seasoned Technical Lead Manager with robust experience in large-scale infrastructure and workload orchestration, along with proficiency in batch and streaming data processing systems, to enhance our ML Infrastructure team. In this pivotal role, you will spearhead the development of our core platform, guaranteeing our researchers and engineers have uninterrupted access to the compute and data resources vital for advancing autonomous driving technology.As a Technical Lead Manager, you will define the strategy for automated resource provisioning, high-performance workload scheduling, and efficient feature management. You will strike a balance between hands-on technical leadership and effective people management, guiding a talented team while collaborating closely with ML Research and Autonomy teams to remove infrastructure bottlenecks and expedite the Nuro Driver™ development process.Key ResponsibilitiesAs the Technical Lead Manager for ML Platform Infrastructure, you will construct the backbone that fuels Nuro’s model development journey from experimentation to production, including:Technical Strategy Development: Crafting a roadmap for a cohesive ML platform that simplifies complex cloud infrastructure.Resource Provisioning & IaC: Expanding our automated infrastructure-as-code (IaC) pipelines to oversee thousands of GPU/CPU nodes across multiple environments.Intelligent Scheduling: Engineering and fine-tuning workload orchestration to optimize hardware use, minimize job wait times, and manage extensive distributed training.Data Processing & ETL: Creating robust pipelines for the extraction and transformation of petabyte-scale sensor and telemetry data into machine learning-ready formats.Feature Management: Implementing effective feature caching and storage solutions to diminish redundant computations and guarantee quick access to pre-computed features.Team Leadership: Fostering a high-performance culture while mentoring team members.
Join the dynamic team at matx as an Infrastructure Engineer, where you will play a pivotal role in designing, implementing, and maintaining cutting-edge infrastructure solutions. We are looking for innovative thinkers who thrive in a fast-paced environment and are passionate about leveraging technology to drive efficiency and performance.
Join Databricks as a Senior Manager in Engineering, where you will lead innovation in Cloud Intelligence and Infrastructure Economics. This pivotal role involves strategizing and executing engineering solutions that drive efficiency and enhance cloud performance.
Be Part of the Revolution in Home RoboticsAt sunday, we are pioneering the development of personal robots designed to help you reclaim valuable time lost to mundane tasks. Our ambitious mission is to make versatile robots accessible to everyone, allowing households to enjoy more quality moments together.After 18 months of assembling a skilled team, securing funding, and validating our innovative technology, we are in search of dedicated individuals to join us in this exciting phase of growth. If you're eager to leverage your skills at the cutting edge of robotics, we want to hear from you!Your RoleAs an Infrastructure & Release Engineer, you will take ownership of the
CENTRL is a rapidly expanding technology firm located in Silicon Valley, specializing in third-party risk management, due diligence, cyber risk, and security solutions. With offices spanning the San Francisco Bay Area, New York, Australia, and India, CENTRL serves a global clientele that includes numerous Fortune 500 companies. Our leadership team brings extensive experience and is supported by top-tier investors such as Providence Strategy Growth and Susquehanna Growth Equity. We are looking for a talented and experienced DevOps / Infrastructure Engineer to join our team. This role is pivotal in guiding the strategic direction, planning, and execution of our cloud and infrastructure operations, ensuring our IT systems maintain high availability, scalability, and performance. CENTRL's SaaS platform provides essential oversight and decision-making support for the investment and asset management sectors.
Full-time|$163K/yr - $286K/yr|On-site|Mountain View, CA
Your Role at MoveworksDevelop and enhance core infrastructure services and microservices that support our machine learning, frontend, and platform teams.Implement critical functionalities such as distributed configuration management, rate limiting, feature flagging, A/B testing, and traffic capture and replay.Optimize the performance, scalability, and observability of Moveworks cloud infrastructure.Consistently deliver time-sensitive projects in collaboration with other engineering teams.Take full ownership of features from inception to deployment while actively influencing the infrastructure roadmap.Engage in a highly collaborative, in-person role, working across various teams including Core and ML engineering.
As a Senior Staff Software Engineer specializing in Compute Infrastructure, you'll play a pivotal role in designing, building, and optimizing our cloud infrastructure systems. Your expertise will help drive performance and scalability, ensuring our platforms support the ever-increasing demand for data processing and storage. You will work collaboratively with cross-functional teams to develop innovative solutions that enhance our technology stack.
Full-time|$123K/yr - $190K/yr|On-site|Mountain View, CA
Your Role As a key member of the Core Infrastructure team at Moveworks, you will play a pivotal role in designing and implementing the next evolution of our AI infrastructure. With Moveworks experiencing rapid growth, the infrastructure team is charged with creating and maintaining robust foundational services and frameworks that empower our products to scale effortlessly while enabling our engineering teams to rapidly develop customer-facing features. Collaborate closely with teams in machine learning, search, product development, data, and frontend engineering to assess infrastructure requirements, shape the infrastructure roadmap, and spearhead various projects. Design and develop core infrastructure and essential functionalities including distributed key-value stores, schema-less data storage, authentication and authorization mechanisms, event streaming, distributed configuration management, rate limiting, circuit breaking, feature flagging systems, A/B testing, and traffic capture and replay. Enhance the observability and reliability of Moveworks systems by improving distributed logging, tracing, monitoring, and alerting capabilities. Establish methodologies and metrics for assessing the performance of microservices and product functionalities, identify and resolve bottlenecks, and enhance the performance and scalability of Moveworks applications. Consistently deliver time-sensitive work that is interdependent with other engineering teams.
Full-time|$180K/yr - $260K/yr|On-site|Mountain View, CA
Join Gatik: Pioneering Autonomous LogisticsAt Gatik, we are at the forefront of transforming the B2B supply chain through our innovative autonomous transportation-as-a-service (ATaaS) platform. Our commitment to safe and efficient freight movement is evident in our partnerships with Fortune 500 retailers, including the launch of the world's first fully driverless commercial service with Walmart in 2021. Operating our Class 3-7 autonomous trucks across key markets such as Texas, Arkansas, and Ontario, Canada, we are dedicated to driving forward the future of freight transportation.Our proprietary Level 4 technology, Gatik Carrier™, is designed to securely and effectively manage freight logistics between pick-up and drop-off locations on the middle mile. Integrating cutting-edge software and hardware, Gatik Carrier™ seamlessly enhances our customers' logistics operations.Role OverviewWe are in search of a Staff Infrastructure Engineer to expand our data collection, monitoring, and vehicle infrastructure, enhancing support for our expanding fleet of autonomous vehicles. This hands-on role requires close collaboration with our platform and autonomy teams to develop infrastructure solutions that are critical to autonomy software development, issue resolution, and deployment of autonomous vehicles to new customer sites. Your contributions will be pivotal in establishing reliable data pipelines, scalable CI systems, and efficient production deployments.This position is based at our Mountain View, CA office with an on-site requirement of 5 days a week!
Full-time|$204K/yr - $259K/yr|On-site|Mountain View, CA, USA; San Francisco, CA, USA
Waymo is a pioneering company in autonomous driving technology, dedicated to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has committed to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance mobility access and save countless lives lost in traffic accidents. Our technology powers a fully autonomous ride-hail service and can be adapted to various vehicle platforms and applications. With over ten million rider-only trips and extensive experience driving more than 100 million miles on public roads and tens of billions of simulation miles across 15+ U.S. states, we are at the forefront of this transformative journey.The Simulation ML Infrastructure team is focused on creating scalable AI/ML infrastructure that accelerates the Simulator team in developing state-of-the-art realistic simulations for testing and training the Waymo Driver. To enhance the realism and steerability of these simulations, we leverage large foundation models trained on vast datasets to accurately represent the real world, including realistic agents (vehicles, pedestrians, cyclists, motorcyclists), road systems, traffic control measures, and environmental conditions.We are looking for a seasoned senior individual contributor to spearhead the advancement of sophisticated AI/ML infrastructure for multi-billion parameter foundation models within ML accelerator-friendly simulations. Your expertise in massive model scaling, ML accelerators, and large-scale distributed systems will be essential in designing and scaling our systems.This position reports to an Engineering Manager.Your Responsibilities:Join a top-tier, high-performing research engineering team to push the boundaries of ultra-realistic multi-agent simulations using foundation models.Collaborate closely with the Waymo Realism Modeling team located in London and the Waymo Oxford team to utilize large foundation models for enhancing simulation realism.Operate at the intersection of data engineering, model development, and simulations, making architectural decisions. Take ownership of large, complex systems, and ensure that architectures and designs align with both technical and business objectives.Design and scale extensive distributed systems that encompass the entire ML lifecycle, facilitating planet-scale dataset generation, model training, and evaluation.Work cross-functionally to derive performance and system-level requirements for large ML systems. Convert product and business goals into measurable technical deliverables, ensuring alignment of system components.
Be Part of the Future of Home RoboticsAt Sunday Robotics, we are pioneering the development of personal robots that alleviate the burden of repetitive household tasks. Our mission is to democratize access to advanced robotics, allowing families to reclaim precious time.After an intensive 18 months of assembling a talented team, securing funding, and validating our innovative technology, we are eager to welcome passionate individuals to join us as we embark on the next exciting chapter of our journey. If you are enthusiastic about contributing your skills to the cutting edge of robotics, we want to hear from you!The RoleAs a Machine Learning Infrastructure Engineer at Sunday Robotics, you will play a pivotal role in shaping the future of home robotics. You will develop end-to-end machine learning models for robotic manipulation, creating foundational systems that will expedite our efforts to introduce robots into everyday homes.This versatile position can be customized to align with your specific expertise, whether it be in data pipelines, training infrastructure, or inference. Your contributions will span the entire robot learning pipeline: from ingesting and processing multimodal data to scaling distributed training, optimizing real-time inference, and developing research tools.What You Will AccomplishEnhance the research codebase for optimal ergonomics and rapid iteration.Oversee model training infrastructure, including job scheduling, checkpointing, metrics, and logging.Facilitate distributed training across GPU clusters with minimal friction for researchers.Enable the training of larger models through techniques such as sharding and memory optimization.Profile and enhance GPU utilization, memory efficiency, and training throughput.Create low-latency inference pipelines for real-time robot control, employing techniques to optimize performance.Collaborate closely with researchers and roboticists to transform research requirements into robust software and infrastructure.Data Pipelines and Research ToolsArchitect high-throughput pipelines for the ingestion, validation, and transformation of multimodal robot data such as video and proprioception.Develop efficient storage systems and metadata indexing for seamless data retrieval.
Feb 11, 2026
Sign in to browse more jobs
Create account — see all 370 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.