Senior Software Engineer for Large Model Evaluation

Waymo LLCMountain View, California, USA; San Francisco, California, USA

On-site Full-time $204K/yr - $259K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Over 5 years of industry experience in a quantitative software engineering domain. Proficient in navigating complex technical and product landscapes, setting technical strategies, and developing roadmaps. Fundamentals of Software Engineering:Expertise in programming languages such as Python or C++. Familiar with software design principles, coding best practices, testing methodologies, and version control systems.

About the job

The Large Model Evaluation team plays a crucial role in advancing Waymo's AI ambition. As we leverage advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs), our goal is to create cutting-edge AI systems that navigate the complexities of real-world driving scenarios. Our progress hinges on our ability to measure performance effectively. With robust evaluation being essential for deploying large models, the challenges we face are particularly intricate and safety-critical. We are seeking engineers with a quantitative mindset to explore and establish innovative methods for assessing the machine learning models utilized in the Waymo Driver.

About Waymo LLC

Waymo is a leader in autonomous driving technology aiming to provide safe and accessible mobility for everyone. With a history rooted in innovation since 2009, we continue to redefine transportation through advanced AI and machine learning systems.

Similar jobs

1 - 20 of 794 Jobs

Search for Software Engineer Perception Evaluation And Test Automation

794 results

Select all on this page (20)

Apply

Software Engineer - Perception Evaluation and Test Automation

Waymo LLC

Full-time|$170K/yr - $216K/yr|Hybrid|Mountain View, CA, USA; San Francisco, CA, USA

Waymo is at the forefront of autonomous driving technology, driven by the mission to become the world’s most trusted driver. Born from the Google Self-Driving Car Project in 2009, Waymo has dedicated itself to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance mobility and save lives that are often lost in traffic accidents. The Waymo Driver powers our fully autonomous ride-hailing service, which can be adapted for various vehicle platforms and use cases. With over ten million rider-only trips completed and experience accumulated from driving over 100 million miles on public roads, alongside tens of billions of miles in simulation across more than 15 U.S. states, Waymo is redefining transportation.The Perception team is integral to the Waymo Driver's success, crafting cutting-edge technology that enables our autonomous systems to accurately understand their environment, make informed decisions in real-time, and transport passengers safely to their destinations. We engage in research to tackle real-world challenges and collaborate with other innovative teams at Alphabet. Our access to extensive driving data collected through diverse sensors empowers software engineers like you to create and scale multi-modal models and techniques.Our objective is to establish a robust foundation for a high-level Perception pipeline, which is a fundamental component of the self-driving system. We serve as the essential link between Waymo's hardware teams and the broader engineering organization focused on self-driving technology, defining sensor specifications, providing vital feedback, and simplifying system complexities.In this hybrid role, you will report directly to a Technical Lead Manager.

Feb 10, 2026

Apply

Software Engineer - Perception Evaluation

Waymo

Full-time|$170K/yr - $216K/yr|Hybrid|Mountain View, CA, USA

Waymo is leading the way in autonomous driving technology with a mission to be the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo is dedicated to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance accessibility to transportation while preventing countless lives lost in traffic accidents. Our innovative Waymo Driver underpins our fully autonomous ride-hailing service and is adaptable to various vehicle platforms and applications. With over ten million rider-only trips facilitated by our technology, we have driven more than 100 million miles on public roads and logged tens of billions of miles in simulation across more than 15 U.S. states.The Perception Evaluation team at Waymo is pioneering advancements in autonomous driving, prioritizing the safety and reliability of our self-driving systems. Our team leverages state-of-the-art tools and methodologies to rigorously evaluate the performance of our perception systems, which are essential for the safe and effective operation of autonomous vehicles. We are currently seeking a Software Engineer to be instrumental in shaping the future of transportation and enhancing the quality and reliability of Waymo's autonomous vehicles.In this hybrid role, you will report directly to an Engineering Manager.

Feb 12, 2026

Apply

Senior Software Engineer - Perception Verification

Waymo LLC

Full-time|On-site|Mountain View, CA, USA; San Francisco, CA, USA

As a Senior Software Engineer specializing in Perception Verification at Waymo, you will play a crucial role in enhancing the safety and performance of autonomous driving technology. Your expertise will guide the development of innovative algorithms and systems that validate perception capabilities, ensuring our vehicles can navigate the complexities of real-world environments.This position offers an exciting opportunity to work at the forefront of technology, collaborating with a talented team of engineers and researchers dedicated to revolutionizing transportation.

Mar 12, 2026

Apply

Software Engineer - Perception Data Infrastructure

Aeva

Full-time|On-site|Mountain View, CA

Aeva is seeking a talented Software Engineer to join our Perception Data Infrastructure team. In this role, you will be responsible for designing and developing scalable systems to manage and process perception data efficiently. You will work collaboratively with cross-functional teams to innovate and enhance our perception capabilities, contributing to the overall success of our cutting-edge technology.

Mar 10, 2026

Apply

Senior Software Engineer - Perception Data Infrastructure

Nuro

Full-time|On-site|Mountain View, California (HQ)

Join Nuro as a Senior Software Engineer specializing in Perception Data Infrastructure. In this pivotal role, you will be responsible for developing and optimizing systems that process perception data, enabling our autonomous vehicles to navigate safely and efficiently. You will collaborate with cross-functional teams to enhance our data infrastructure, ensuring reliability and scalability.

Apr 6, 2026

Apply

Systems Test Engineer - Commercialization Test Automation

Waymo LLC

Full-time|On-site|Mountain View, CA, USA; San Francisco, CA, USA

Join Waymo, a pioneer in autonomous vehicle technology, as a Systems Test Engineer specializing in Commercialization Test Automation. In this role, you will be at the forefront of testing and enhancing our advanced systems, ensuring they meet high standards of performance and reliability.Your contributions will directly impact the future of transportation by supporting the development of self-driving technology. Collaborate with cross-functional teams to design, execute, and automate test cases, identifying any potential issues and implementing solutions.

Mar 12, 2026

Apply

Lead Principal Software Engineer - Perception Pretraining

Waymo LLC

Full-time|$332K/yr - $421K/yr|On-site|Mountain View, CA, USA; San Francisco, CA, USA

Waymo is at the forefront of autonomous driving technology, dedicated to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has tirelessly worked on developing the Waymo Driver—The World's Most Experienced Driver™—to enhance mobility access and save lives lost to traffic accidents. The Waymo Driver fuels our fully autonomous ride-hailing service and can adapt to various vehicle platforms and applications. With over ten million rider-only trips completed, Waymo has amassed more than 100 million miles driven autonomously on public roads, supplemented by tens of billions of simulated miles across 15+ U.S. states.The Perception team at Waymo is instrumental in crafting the technology that empowers the Waymo Driver. Our advanced models enable the Waymo Driver to interpret its surroundings, make informed decisions, and ensure safe passage for passengers. Engaging in research to tackle real-world challenges, we collaborate closely with Alphabet's research teams. Our access to extensive driving data from diverse sensors allows machine learning practitioners to develop scalable multi-modal models and techniques.You will report directly to our Director of Engineering within the Perception Organization.

Feb 10, 2026

Apply

Software Technical Lead Manager for Test Automation Infrastructure

Waymo LLC

Full-time|On-site|Mountain View, CA, US

Join Waymo as a Software Technical Lead Manager specializing in Test Automation Infrastructure. In this pivotal role, you will spearhead the development and optimization of cutting-edge test automation frameworks, ensuring our software systems are reliable, efficient, and scalable. Collaborate with cross-functional teams to design innovative testing solutions that enhance product quality and accelerate deployment cycles.Your leadership will guide a team of talented engineers, fostering a culture of excellence and continuous improvement. You will be instrumental in defining best practices and driving the technical vision of our automation infrastructure.

Mar 12, 2026

Apply

Senior Software Engineer - Perception Machine Learning Data

Nuro

Full-time|$193.9K/yr - $291.1K/yr|On-site|Mountain View, California (HQ)

About the RoleJoin our dynamic team of experts where machine learning and systems engineering intersect to enhance the performance of autonomous systems. As a Senior Software Engineer specializing in Perception Machine Learning Data, you will play a crucial role in integrating machine learning advancements with autonomy infrastructure, ensuring that our models are trained using the most pertinent, diverse, and high-quality datasets. Your contributions will significantly influence how autonomous systems recognize uncommon scenarios, adapt to various geographical contexts, and operate safely at scale.Key Responsibilities Include:Utilizing Vision Language Models (VLMs) to compile diverse datasets that reflect real-world driving patterns across different regions.Creating high-fidelity synthetic data frameworks across multiple sensor modalities.Enhancing machine learning-powered validation processes for data quality and model preparedness.Your Impact:High-Output Generalist: Collaborate across various domains including autonomy, infrastructure, databases, simulation, and machine learning development, while expanding your expertise in Robotics and ML.Robotics Specialist: Develop cutting-edge solutions for data discovery, automated labeling, and synthetic data generation in close cooperation with the Infrastructure and Autonomy teams.About the WorkTackle the most demanding data challenges in autonomy by applying machine learning and rigorous systems engineering principles:Design hybrid systems that combine deep learning with traditional algorithms for scalable data curation and annotation.Create frameworks to evaluate the real-world authenticity of synthetic data and enhance the quality of synthetic data rendering.Develop tools to automatically identify data gaps that affect the performance of perception models.Collaborate with autonomy engineers to transform raw sensor data into prioritized training objectives, addressing critical gaps that hinder perception and autonomy performance.About YouBachelor’s degree in Computer Science, Robotics, Statistics, Physics, Mathematics, or a related quantitative field.Experience:4+ years of professional software engineering experience, proficient in Python and familiar with C/C++. Demonstrated ability to lead cross-functional technical projects from conception to execution.You have hands-on experience in implementing machine learning solutions and enjoy embedding them into practical systems. Your focus is on delivering impactful, integrated solutions rather than solely theoretical ML projects.Bonus PointsExperience working with synthetic or autonomous driving data.Background in building machine learning systems for robotic applications.

Feb 17, 2026

Apply

Senior Staff Software Engineer - Perception Data

Waymo LLC

Full-time|$281K/yr - $356K/yr|Hybrid|Mountain View, California, USA

Waymo is a leader in autonomous driving technology dedicated to becoming the most trusted driver in the world. Originating from the Google Self-Driving Car Project in 2009, our focus has been on creating the Waymo Driver—an exceptional driver that enhances mobility access while significantly reducing traffic-related fatalities. The Waymo Driver not only supports our fully autonomous ride-hailing service but can also be adapted to various vehicle platforms and applications. With over ten million rider-only trips completed, our technology has safely navigated more than 100 million miles on public roads and processed tens of billions of miles in simulation across over 15 U.S. states.The Perception Data team at Waymo plays a critical role in shaping the strategy and technical direction for all data utilized in training and evaluating the Waymo Driver's perception capabilities. We manage the complete data lifecycle, developing automated systems and 'infrastructure-as-product' solutions that convert vast amounts of driving sensor data into high-quality training datasets. Our team tackles complex challenges, including active learning loops and open-vocabulary modeling, bridging raw data with advanced machine learning techniques.By integrating data ingestion, curation, and evaluation into a unified ecosystem, we facilitate the swift development of foundational models and next-generation perception systems. We work closely with Machine Learning, Infrastructure, and Evaluation teams to address intricate data challenges, ensuring our models can effectively interpret the long-tail of rare events. Ultimately, our contributions lay the groundwork for the Waymo Driver to operate safely in diverse environments.

Feb 23, 2026

Apply

Staff Software Engineer - Simulator Evaluation

Waymo LLC

Full-time|$238K/yr - $302K/yr|On-site|Mountain View, California, United States; San Francisco, California, United States.

Waymo is at the forefront of autonomous driving technology, committed to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has dedicated itself to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance mobility access while significantly reducing traffic-related fatalities. Our Waymo Driver powers a fully autonomous ride-hailing service and is adaptable to a diverse array of vehicle platforms and applications. With over ten million rider-only trips and experience from more than 100 million miles driven autonomously on public roads, Waymo is shaping the future of transportation.Our simulation environment is among the most sophisticated ever created, utilizing deterministic logic, physical dynamics, and cutting-edge Generative AI to establish a training ground for the Waymo Driver. The Simulator Evaluation team tackles the challenging question: How can we mathematically validate that a virtual world is perceptibly 'real'?We are on the lookout for a Staff Software Engineer who will take on the role of Technical Architect in this field. You will operate at the intersection of software engineering and AI, ensuring our simulated environments—whether governed by explicit rules or foundational models—accurately reflect reality.In this Staff-level position, you will report directly to a Senior Staff Software Engineering Manager and function as a Technical Lead, connecting intricate technical metrics with overarching product strategies.Your Responsibilities Will Include:Architecting Evaluation Standards: Define the 'Definition of Done' for simulation realism, anticipating product objectives (e.g., operational capabilities in snow or highway driving) and designing the evaluation roadmap to ensure our simulation fidelity evolves in alignment with onboard requirements.Acting as the System Critic: Create comprehensive mathematical frameworks to validate our hybrid virtual world, determining the balance between diverse evaluation needs—from verifying logical rules and dynamics to assessing the distribution quality of generative AI models.Building at Scale: Lead the development of large-scale, extensible evaluation platforms (in C++/Python), ensuring our metric pipelines are robust distributed systems capable of delivering clear, reproducible insights on petabytes of data.Providing Strategic, Cross-Functional Leadership: Serve as the technical liaison across various organizations, closely collaborating with AI research and other simulation teams. The evaluation workflows you design will facilitate rapid innovation and inform research trajectories.

Feb 10, 2026

Apply

Software Development Engineer in Test (SDET) at 360itprofessionals1 | Mountain View

360itprofessionals1

Full-time|On-site|Mountain View

Join our dynamic team at 360itprofessionals1 as a Software Development Engineer in Test (SDET). In this pivotal role, you will be responsible for designing, implementing, and maintaining automated test frameworks to ensure the quality of our software products. You will collaborate closely with development teams to identify testing requirements and create robust testing strategies that enhance product reliability and user experience.

Dec 13, 2016

Apply

Senior Software Engineer - Quantitative Evaluations

Waymo LLC

Full-time|$204K/yr - $259K/yr|Hybrid|Mountain View, CA, USA; San Francisco, CA, USA

Waymo is at the forefront of autonomous driving technology, dedicated to becoming the world’s most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has been committed to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance mobility access and save lives lost to traffic accidents. The Waymo Driver not only powers our fully autonomous ride-hailing service but can also be adapted to various vehicle platforms and applications. With over ten million rider-only trips under its belt, the Waymo Driver has driven autonomously over 100 million miles on public roads and completed tens of billions of miles in simulation across more than 15 U.S. states.The Planner Evaluation team tackles one of the primary challenges in autonomous driving: assessing and elevating the quality of the software steering the car. We seek data-driven software engineers and data scientists who are passionate about autonomous vehicles and adept at utilizing complex data to inform decision-making. If this excites you, we encourage you to apply!This position follows a hybrid work model and reports to an Engineering Manager.

Feb 10, 2026

Apply

Senior Perception Engineer at Aeva | Mountain View, CA

Aeva

Full-time|On-site|Mountain View, CA

About Us:Aeva is at the forefront of perception technology, revolutionizing a myriad of industries including automated driving, industrial robotics, consumer electronics, consumer health, and security. Our mission is to pioneer the next generation of perception technologies, integrating essential LiDAR components into a compact silicon photonics chip. Our cutting-edge 4D LiDAR sensors offer unique capabilities, such as instant velocity detection alongside 3D positioning, empowering autonomous devices like vehicles and robots to make smarter, safer decisions.Role Overview:We are seeking a passionate and experienced Senior Perception Engineer to enhance our classical perception algorithms stack. As a critical member of our dynamic team, you will delve into the capabilities of Aeva’s 4D FMCW LiDARs, pushing the boundaries of autonomous driving performance further than ever before.

Dec 1, 2025

Apply

Staff Software Engineer - Quantitative Evaluation at Waymo

Waymo LLC

Full-time|$238K/yr - $302K/yr|Hybrid|Mountain View, California, United States; San Francisco, California, United States; New York City, New York, United States.

Waymo is at the forefront of autonomous driving technology, striving to become the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has dedicated itself to creating the Waymo Driver—the world's most experienced driver™—with the aim of enhancing mobility access while preventing the countless lives lost in traffic accidents. The Waymo Driver powers our fully autonomous ride-hailing service and is adaptable across various vehicle platforms and use cases. With over ten million rider-only trips under its belt and extensive experience driving more than 100 million miles on public roads, complemented by extensive simulations across 15+ U.S. states, Waymo is leading the charge in safe and efficient transportation solutions.The Planner Evaluation team addresses a pivotal challenge in autonomous driving: enhancing and quantifying the software that drives our vehicles. We seek seasoned data-driven software engineers and data scientists passionate about autonomous vehicles and utilizing complex data to drive informed decisions. This is the perfect opportunity for someone eager to make a significant impact in the field!In this hybrid role, you will report directly to an Engineering Manager.You will:Develop performance metrics and driving quality signals for the Waymo driver utilizing various techniques, including statistics, mathematics, physics, algorithms, and machine learning.Innovatively use simulations and analyze real-world driving logs to assess driving performance.Design and implement robust methods to strengthen the correlation between changes in onboard software and simulated results.Advocate for code quality and best practices within a large and intricate code base.Analyze data and provide insights for enhancing metric quality and interpretability.Collaborate effectively with engineers, data scientists, statisticians, and leadership to deliver evaluation products and facilitate data-driven decisions.Rapidly validate the effectiveness of additional coverage, ensuring solutions are robust for customer teams to manage their own evaluations.

Feb 10, 2026

Apply

Software Engineer - Statistical Evaluation and Sampling

Waymo LLC

Full-time|$170K/yr - $216K/yr|On-site|Mountain View, CA, USA; San Francisco, CA, USA

Waymo is at the forefront of autonomous driving technology, aspiring to become the world's most reliable driver. Originating as the Google Self-Driving Car Project in 2009, we are dedicated to developing the Waymo Driver—The World’s Most Experienced Driver™—to enhance mobility access while significantly reducing traffic-related fatalities. With over ten million rider-only trips facilitated by our technology, the Waymo Driver has autonomously navigated more than 100 million miles on public roads and tens of billions in simulation across more than 15 states in the U.S.Our Release Evaluation organization is committed to ensuring each iteration of the Waymo Driver is safe for public deployment. We construct automated pipelines to identify rare and exceptional scenarios in autonomous driving, effectively searching for critical insights under limited time and resource constraints. Within Release Evaluation, the Sampling and Efficiency team employs advanced importance sampling techniques combined with machine learning to maximize the statistical efficiency of our discovery pipelines.

Feb 10, 2026

Apply

Software Engineer in Large Model Evaluation

Waymo LLC

Full-time|$170K/yr - $216K/yr|On-site|Mountain View, California, USA; San Francisco, California, USA

Waymo is at the forefront of autonomous driving technology, striving to become the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has been dedicated to developing the Waymo Driver—The World’s Most Experienced Driver™—with the goal of enhancing mobility access while aiming to reduce the tragic loss of lives due to traffic accidents. The Waymo Driver not only fuels our fully autonomous ride-hailing service but is also adaptable across various vehicle platforms and applications. With over ten million trips solely for riders and extensive experience from autonomously navigating over 100 million miles on public roads, alongside tens of billions of miles in simulation across more than 15 U.S. states, we are leading the charge in this transformative technology.The Large Model Evaluation team plays a pivotal role in advancing Waymo's AI vision. As we integrate cutting-edge Large Language Models (LLMs) and Vision-Language Models (VLMs), our aim is to construct sophisticated AI systems capable of addressing the multifaceted challenges of real-world driving. Central to our achievements is the ability to accurately measure our progress. In a landscape where robust evaluation serves as a critical barrier to deploying large models, the intricacies of this task at Waymo are particularly complex and safety-sensitive. We seek quantitatively driven engineers to innovate and propose novel methodologies for evaluating the ML models utilized in the Waymo Driver.

Feb 18, 2026

Apply

Senior Software Engineer - Statistical Evaluation & Sampling

Waymo LLC

Full-time|$204K/yr - $259K/yr|On-site|Mountain View, CA, USA; San Francisco, CA, USA; New York, NY, USA

Join Waymo, a pioneer in autonomous driving technology, dedicated to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, our mission is to enhance mobility access and eliminate traffic-related fatalities. The Waymo Driver has already facilitated over ten million rides, showcasing its capability after autonomously navigating over 100 million miles on public roads and trillions in simulations across more than 15 U.S. states.As part of the Release Evaluation team, you will play a key role in ensuring the safety of each Waymo Driver version prior to deployment. We develop automated systems to address rare and exceptional scenarios in autonomous driving, effectively identifying critical signals under constraints of time and resources. The Sampling and Efficiency team utilizes advanced importance sampling techniques and machine learning to enhance the statistical efficiency of our evaluation pipelines.

Feb 10, 2026

Apply

Machine Learning Engineer - Perception

Aeva, Inc.

Full-time|On-site|Mountain View, CA

Aeva, Inc. is seeking a Machine Learning Engineer with a focus on Perception to join the team in Mountain View, CA. This role centers on developing algorithms that help autonomous systems interpret and respond to their environment. The work draws on both machine learning and computer vision to improve how autonomous vehicles perceive the world around them. Key responsibilities Design and build perception algorithms for autonomous platforms Use machine learning and computer vision on real-world datasets Support progress in autonomous vehicle technology through technical contributions Work closely with engineers and researchers to expand product capabilities Location This position is based in Mountain View, CA.

Apr 27, 2026

Apply

Senior Software Engineer for Large Model Evaluation

Waymo LLC

Full-time|$204K/yr - $259K/yr|On-site|Mountain View, California, USA; San Francisco, California, USA

Waymo is a pioneering force in autonomous driving technology, dedicated to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo focuses on developing the Waymo Driver—The World's Most Experienced Driver™—to enhance mobility access and significantly reduce traffic-related fatalities. The Waymo Driver powers our fully autonomous ride-hailing service and is adaptable across various vehicle platforms and applications. With over ten million rider-only trips completed and extensive experience from driving more than 100 million miles on public roads, complemented by simulations totaling tens of billions of miles across over 15 U.S. states, we are at the forefront of our industry.The Large Model Evaluation team plays a crucial role in advancing Waymo's AI ambition. As we leverage advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs), our goal is to create cutting-edge AI systems that navigate the complexities of real-world driving scenarios. Our progress hinges on our ability to measure performance effectively. With robust evaluation being essential for deploying large models, the challenges we face are particularly intricate and safety-critical. We are seeking engineers with a quantitative mindset to explore and establish innovative methods for assessing the machine learning models utilized in the Waymo Driver.

Feb 17, 2026

Create account — see all 794 results