Multimodal AI Model Optimization Research Engineer

TavusSan Francisco (London/Europe - OK)

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Ideal Qualifications:Proven track record in deep learning methodologies and frameworks, particularly PyTorch. Hands-on experience in model optimization techniques and their applications. Strong analytical skills to define metrics and evaluate performance outcomes. Collaborative mindset and ability to work in fast-paced environments.

About the job

Tavus – Multimodal AI Model Optimization

Research Engineer

At Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.

We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.

By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.

As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.

The Role

We are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.

The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.

Your Mission

Transform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.
Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.
Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.

Requirements

Extensive experience in deep learning with PyTorch.
Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.
Familiarity with efficient architectures such as low-rank adapters.
Strong grasp of inference performance and GPU/accelerator fundamentals.
Proficient in Python coding and adherence to best practices in research engineering.
Experience with large models and datasets in cloud environments.
Capability to read ML literature, reproduce results, and modify ideas accordingly.

About Tavus

Tavus is at the forefront of creating the human layer of AI, driven by a mission to enhance human-AI interactions. Our innovative approaches in multimodal AI are transforming industries by enabling more authentic communication and engagement.

Similar jobs

1 - 20 of 5,532 Jobs

Search for Machine Learning Engineer Multimodal Foundation Models

5,532 results

Select all on this page (20)

Apply

Machine Learning Engineer - Multimodal Foundation Models

The Bot Company

Full-time|On-site|San Francisco

The Bot CompanyAt The Bot Company, we are on a mission to create an innovative robotic assistant for every household.Our dynamic team, composed of talented engineers, designers, and operators, is based in San Francisco. We have a rich background from industry leaders such as Tesla, Cruise, OpenAI, Google, and Pixar, and we have successfully delivered products to hundreds of millions of users, honing our ability to create exceptional products and experiences.We pride ourselves on maintaining a streamlined team structure that fosters swift decision-making and minimizes bureaucracy. Each member is considered an Individual Contributor, granted substantial autonomy, ownership, and accountability. Our culture enables us to work across the technology stack with an emphasis on rapid iteration and execution.What We Seek in CandidatesCandidates for all positions at The Bot Company must exhibit remarkable sharpness and the capacity to thrive in high-pressure environments. We expect candidates to showcase:Exceptional Cognitive Abilities: You possess quick thinking, instant learning capabilities, and the ability to reason across diverse domains.Engineering Curiosity: You demonstrate an innate desire to understand how systems function, even beyond your area of expertise.Performance-Driven Attitude: You excel in fast-paced settings, effectively navigate ambiguity, and thrive under demanding circumstances.Machine Learning: Multimodal Foundation ModelsWe are developing unified foundation models capable of reasoning across text, images, video, and kinematics to inform intelligent robotic behaviors.You will engage with large-scale multimodal networks, overseeing the complete process from data handling to model training and deployment.Your ResponsibilitiesConstruct Native Multimodal Policies: Create architectures where vision, language, and other modalities are represented in a unified manner.Enhance Cross-Modal Reasoning: Explore and implement strategies to ensure that the model not only correlates modalities but also comprehends them (e.g., linking visual physics to kinematic constraints).Manage the Training Loop from Start to Finish: Design, execute, troubleshoot, and refine large-scale training experiments; identify failure points, enhance data mixtures, and tighten evaluations to achieve measurable improvements.Deploy and Refine Real Systems: Integrate models into practical robotic frameworks, enhance robot code for model deployment, and optimize performance for edge inference.

Feb 25, 2026

Apply

Machine Learning Engineer - Foundation Models for Biology

Prima Mente

Full-time|On-site|San Francisco

Join Prima MenteAt Prima Mente, we are pioneers in the field of biology-focused artificial intelligence. Our mission is to generate unique datasets, develop versatile biological foundation models, and translate scientific breakthroughs into real-world clinical applications. Our primary focus is on understanding the brain in-depth, safeguarding it from neurological disorders, and enhancing its function during health. Our dynamic team of AI researchers, experimentalists, clinicians, and operational experts are strategically located in London, San Francisco, and Dubai.Your Role: Foundation Models for BiologyAs a Machine Learning Engineer, you will be instrumental in the design, implementation, and scaling of foundational AI models and infrastructure for multi-omics at an unprecedented scale. Your contributions will facilitate significant advancements in scientific comprehension and lead to groundbreaking applications in the medical and biological fields.Key Responsibilities:Develop high-performance machine learning algorithms optimized for large-scale applications, ensuring utmost reliability and efficiency.Design, implement, and maintain comprehensive experimentation pipelines that allow for rapid iterations, precise assessments, and reproducible research results.Refactor and enhance prototype research code into clean, maintainable, and efficient repositories prepared for production-level deployments.Create fast data processing workflows that can effectively manage extensive datasets to expedite research and model development.Engage in experimental design, with a focus on high-impact experiments that yield the greatest signal-to-noise ratio.Growth ExpectationsIn 1 month, you will initiate initial experiments utilizing state-of-the-art machine learning models, review and apply advanced research papers, and enhance existing code for improved efficiency and precision.By 3 months, you will take ownership of a prototype model architecture, showcasing notable algorithmic enhancements, and contribute to methods for large-scale data ingestion and training.Within 6 months, you will have significantly impacted the implementation of a high-performance foundation model, incorporating key algorithmic optimizations that improve scalability and throughput, along with publishing internal benchmarks that demonstrate substantial effects.

Mar 2, 2026

Apply

Senior Machine Learning Scientist - Multimodal & Relational Models

Altos Labs

Full-time|$251.7K/yr - $330K/yr|On-site|San Francisco Bay Area, CA

Our MissionAt Altos Labs, we are dedicated to restoring cell health and resilience through innovative cell rejuvenation techniques aimed at reversing diseases, injuries, and disabilities that can arise throughout life.For further insights, please visit our website at altoslabs.com.Our ValueOur singular Altos Value is: Everyone Owns Achieving Our Inspiring Mission.Diversity at AltosWe firmly believe that diverse perspectives are crucial for scientific innovation. At Altos, exceptional scientists and industry leaders collaborate globally to further our shared mission. We prioritize Belonging, ensuring all employees feel valued for their unique perspectives, and we hold ourselves accountable for maintaining a diverse and inclusive environment.Your Contributions to AltosAs a member of our team, you will accelerate and enhance our efforts in developing unified, multi-modal generative foundation models tailored for multiscale biology. You will be a key player in multidisciplinary teams that create the computational platforms essential for Altos to fulfill its mission.In this position, you will collaborate with other scientists and engineers across the Institute of Computation to design, develop, and scale cutting-edge foundation models that address biological inquiries and assist in discovering novel interventions for aging and disease. Your focus will be on synthesizing unstructured multimodal signals with structured relational data and knowledge graphs that depict biological realities.The ideal candidate will excel in a dynamic environment that values teamwork, transparency, scientific excellence, originality, and integrity.

Feb 19, 2026

Apply

Machine Learning Researcher - Multimodal LLMs

Bland Inc.

Full-time|On-site|San Francisco

About the Role Bland Inc. is looking for a Machine Learning Researcher with a focus on Multimodal Large Language Models (LLMs) to join the San Francisco team. This role centers on research and development to improve AI systems that handle language and other data types together. What You Will Do Conduct research on multimodal LLMs, exploring how language interacts with other modalities Develop and test new approaches to advance the capabilities of AI models Collaborate with team members on projects that aim to expand the limits of machine learning Location This position is based in San Francisco.

Apr 21, 2026

Apply

Senior Machine Learning Engineer in Multimodal AI

Hike Medical

Full-time|On-site|San Francisco, CA

About Hike Medical Hike Medical is building the future of musculoskeletal care by combining advanced technology with practical healthcare solutions. Based in San Francisco’s Rincon Hill, the team develops a platform that spans three core areas: an AI-powered vision system for rapid web-based foot scans that generate custom 3D-printed orthotics, an AI agent platform that manages the entire DME workflow from intake through claims, and SoleForge, a high-scale 3D printing facility for custom medical devices. Hike Medical partners with some of the world’s largest employers and major orthotics and prosthetics organizations. Fortune 50 companies trust the platform to support employee well-being, and a broad network of clinical partners keeps the company connected to real-world needs. Custom insoles are just the starting point. The long-term goal is to reshape the industry with bionic devices: AI-designed, robotically manufactured orthotic and prosthetic products. The company aims to reach this milestone by 2040. Learn more at bionics2040.com. With $22 million raised across Seed and Series A rounds from leading investors, Hike Medical offers a results-oriented culture for those interested in the intersection of AI, manufacturing, and healthcare.

Apr 16, 2026

Apply

Multimodal AI Model Optimization Research Engineer

Tavus

Full-time|On-site|San Francisco (London/Europe - OK)

Tavus – Multimodal AI Model OptimizationResearch EngineerAt Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.The RoleWe are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.Your MissionTransform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.RequirementsExtensive experience in deep learning with PyTorch.Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.Familiarity with efficient architectures such as low-rank adapters.Strong grasp of inference performance and GPU/accelerator fundamentals.Proficient in Python coding and adherence to best practices in research engineering.Experience with large models and datasets in cloud environments.Capability to read ML literature, reproduce results, and modify ideas accordingly.

Apr 3, 2026

Apply

Machine Learning Engineer for Foundation Models & Personalization

Eight Sleep

Full-time|On-site|San Francisco

Join the Sleep Fitness RevolutionAt Eight Sleep, we are dedicated to unlocking human potential through the power of optimal sleep. As pioneers in the sleep fitness domain, we are transforming the concept of well-being by developing cutting-edge hardware, software, and AI technologies designed to enhance sleep quality. Our innovative products are engineered to maximize mental, physical, and emotional performance, turning each night into a tailored, data-driven recovery session.Trusted by elite athletes and health-conscious individuals across over 30 countries, Eight Sleep has been recognized as one of Fast Company’s Most Innovative Companies in 2019, 2022, and 2023, as well as featured twice in TIME's “Best Inventions of the Year.” Our team operates like a high-performance unit: agile, focused, and driven by impactful results. We prioritize refining and iterating on our offerings to enhance our members' sleep experiences and empower them to wake up rejuvenated.Every position at Eight Sleep offers an opportunity to contribute to groundbreaking technology, collaborate with exceptional talent, and influence a future where sleep is a proactive element of living well. If you are passionate about pushing boundaries and creating innovative solutions, this is your chance to make a difference in how the world experiences sleep and its potential.High Standards. No Compromises.At Eight Sleep, we operate with intensity and commitment, reflecting the mindset of top performers. We embrace a relentless focus on excellence in our endeavors, akin to the mamba mentality applied to innovative ideas and next-gen technology. We are not just about meeting expectations; we strive to exceed them, working diligently not out of obligation, but from a passion for impactful work. If you flourish under pressure and seek to engage in the most meaningful projects of your career, you will find a home here. If you desire an easier path, this may not be the right place for you.The RoleWe are in search of a Machine Learning Engineer to develop and deploy consumer-oriented AI systems that enhance personalization, coaching, and next-gen “sleep intelligence.” You will collaborate across data science, modeling, product development, and engineering to convert research insights into tangible, measurable improvements for our members.This role is perfect for individuals who thrive on end-to-end ownership, from defining problems and prototyping to offline evaluations, online experimentation, production deployment, and continuous iteration.

Jan 21, 2026

Apply

Machine Learning Specialist in Behavioral Modeling

Palladio AI

Full-time|On-site|San Francisco Bay Area

Join the Revolution in Behavioral IntelligenceAmplify Your InfluenceYou have achieved remarkable success in your career, creating robust behavioral or neuroscience models that have driven significant outcomes. You possess a talent for discerning patterns in user behavior, comprehending motivations, and optimizing end-to-end user experiences.Now, envision extending your impact across multiple products and organizations, enhancing the entire app ecosystem. Every application at your fingertips becomes smarter, more engaging, and indispensable to its users.Your expertise can empower product teams to innovate more rapidly, delight users, and boost revenue, all thanks to the behavioral intelligence you develop once and deploy universally.We share this vision: our team has accomplished this repeatedly at industry leaders like Uber, Apple, Google, and Chime, generating tens of billions of dollars in value for products vital to billions globally. We are poised to elevate our impact even further.Does this resonate with the next chapter you're seeking? If so, continue reading.Palladio: Pioneering BreakthroughsPalladio AI is an innovative AI platform aimed at transforming product-led growth and enhancing the value our clients provide in users’ daily lives.Our initial focus is on mobile gaming, where development is swift, user engagement is high, and experimentation yields immediate results—making it the perfect testing ground for our platform.Your ContributionsOur team is constructing foundational systems in behavioral modeling, causal inference, forecasting, and agentic platforms. You will play a pivotal role in extending these areas: creating machine learning and AI-driven behavioral models to identify and highlight product opportunities while deploying self-improving learning loops with each iteration. Your work will analyze user sentiments, thoughts, decisions, and actions—translating behavioral insights into opportunities that enhance product intuitiveness, engagement, and rewards. In essence, you will convert first-principles data science, neuroscience, cognitive science, and machine learning into scalable solutions across various industries.Your ProfileUser-Focused. You empathize with users' challenges, needs, and goals throughout their journeys, measure success through user outcomes, and convert insights into innovative and engaging product experiences.Scientific Innovator. You...

Feb 14, 2026

Apply

Machine Learning Researcher in Generative Modeling

latentlabs

Full-time|On-site|San Francisco

Join latentlabs, a pioneering company at the forefront of biotechnology, as we seek a talented Machine Learning Researcher specializing in generative modeling. You will become part of a dynamic, interdisciplinary team comprising machine learning experts, protein engineers, and biologists, all committed to revolutionizing biological control and disease treatment. In this role, you will design innovative generative models aimed at creating new proteins that exhibit functionality in wet lab assays.

Feb 19, 2026

Apply

AI Researcher for Multimodal Perception Models

Tavus

Full-time|On-site|San Francisco

About TavusTavus is at the forefront of innovation in human computing. Our mission is to develop AI Humans: an advanced interface that bridges the gap between individuals and machines, eliminating the friction found in current technologies. Our state-of-the-art human simulation models empower machines to see, hear, respond, and even exhibit realistic appearances—facilitating genuine, face-to-face interactions. AI Humans integrate the emotional insight of humans with the scalability and dependability of machines, making them reliable agents accessible 24/7, in any language, on our terms.Imagine having access to an affordable therapist, a personal trainer that fits your schedule, or a team of medical assistants dedicated to providing personalized care for every patient. With Tavus, individuals, enterprises, and developers have the tools to create AI Humans that connect, comprehend, and act with empathy on a large scale.We are a Series A company supported by esteemed investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners.Join us in shaping a future where machines and humans genuinely understand one another.The PositionWe are seeking an AI Researcher to join our core AI team and advance the frontiers of multimodal conversational intelligence. If you excel in dynamic environments, enjoy transforming abstract concepts into functional code, and derive motivation from pushing the boundaries of possibility, this role is designed for you.Your Responsibilities Engage in research focusing on Foundational Multimodal Models specifically in the realm of Conversational Avatars (such as Neural Avatars and Talking-Heads).Develop models for video, audio, and language sequences utilizing Autoregressive and Predictive Architectures (e.g., V-JEPA) and/or Diffusion methodologies, with a focus on temporal and sequential data rather than static images.Collaborate closely with the Applied ML team to implement your research into production systems.Remain at the forefront of multimodal learning and assist us in defining what “cutting edge” will mean in the future.Ideal Candidate ProfilePhD (or nearing completion) in a relevant field, or equivalent practical research experience.Experience in multimodal machine learning, particularly focused on conversational interfaces.

Oct 8, 2025

Apply

Senior Machine Learning Engineer for Multi-Sensor Modeling

Gridware

Full-time|On-site|San Francisco, CA

Join Our Team at GridwareAs a Senior Machine Learning Engineer specializing in Multi-Sensor Modeling, you will be at the forefront of developing innovative solutions that enhance the reliability and safety of the electrical grid. Our groundbreaking Active Grid Response (AGR) platform leverages cutting-edge technology to monitor various aspects of the grid, enabling proactive maintenance and fault mitigation. Your expertise will play a pivotal role in advancing our mission to protect the grid and ensure efficient operations. Are you ready to make a significant impact?

Mar 9, 2026

Apply

Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI

Plaid Inc.

Full-time|On-site|San Francisco

About Plaid Plaid builds tools that help developers create new financial products and experiences. Since 2013, Plaid has connected millions of users to over 12,000 financial institutions across the US, Canada, the UK, and Europe. The company partners with organizations like Venmo, SoFi, Fortune 500 firms, and major banks to make linking financial accounts to apps and services easier. Headquarters are in San Francisco, with offices in New York, Washington D.C., London, and Amsterdam. Team: Data Foundation & AI The Data Foundation and AI team designs and maintains the machine learning and AI infrastructure that supports Plaid’s products. This group transforms Plaid’s financial network data into flexible formats used by teams across the company. Responsibilities span the entire system lifecycle: data curation for pretraining, model development, deployment, serving, and monitoring in production. Role Overview: Senior Machine Learning Engineer (Research Scientist) This position focuses on applied research for Plaid’s foundation model. The Senior Research Scientist leads efforts to design model architectures, set pretraining objectives, and implement fine-tuning strategies that work across a range of product needs. The role also involves building and maintaining production machine learning systems, including training pipelines, model serving, feature engineering, and performance monitoring. Key Responsibilities Design model architectures and define pretraining objectives for Plaid’s foundation model Develop and apply fine-tuning methods for diverse product use cases Build and maintain end-to-end machine learning systems, from data pipelines to model serving Engineer features and monitor system performance in production Create evaluation frameworks to measure model quality across multiple tasks and metrics Location This role is based in San Francisco.

Apr 15, 2026

Apply

Machine Learning Engineer: World Models at The Bot Company | San Francisco

The Bot Company

Full-time|On-site|San Francisco

The Bot CompanyJoin us in creating a revolutionary robot designed to enhance everyday living.Based in San Francisco, our dynamic team consists of talented engineers, designers, and operators hailing from industry leaders like Tesla, Cruise, OpenAI, Google, and Pixar. We've successfully delivered exceptional products and experiences to hundreds of millions of users.Our deliberately streamlined team structure fosters prompt decision-making, eliminating bureaucracy and hierarchy. Each team member is an individual contributor empowered with significant scope, radical ownership, and direct accountability, working collaboratively across the stack in a fast-paced environment focused on rapid iteration and execution.What We Value in CandidatesAt The Bot Company, we seek individuals who exhibit remarkable sharpness and can thrive in high-pressure situations. Throughout the selection process, we expect candidates to showcase:Exceptional mental acuity: You think quickly, absorb information rapidly, and navigate unfamiliar domains with ease.Engineering curiosity: You possess an innate desire to understand system functionalities, extending beyond your primary expertise.High performance mindset: You excel in ambiguous environments and maintain high productivity under challenging conditions.Machine Learning Engineer: World ModelsWe are developing neural simulators capable of comprehending the fundamental

Feb 25, 2026

Apply

Senior Machine Learning Engineer - Model Evaluations for Public Sector

Scale AI

Full-time|$216.3K/yr - $300.3K/yr|On-site|San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC

Senior Machine Learning Engineer - Model Evaluations for the Public Sector The Public Sector Machine Learning team at Scale AI pioneers the deployment of cutting-edge AI systems, including Large Language Models (LLMs), agentic models, and comprehensive multimodal pipelines, within critical government operations. We establish robust evaluation frameworks that ensure these models function reliably, safely, and effectively in real-world scenarios. As a Senior Machine Learning Engineer, you will architect, implement, and enhance automated evaluation pipelines that empower our clients to trust and effectively utilize advanced AI systems in defense, intelligence, and federal missions. Your Responsibilities Include: Creating and maintaining automated evaluation pipelines for machine learning models, focusing on functional, performance, robustness, and safety metrics, including evaluations based on LLM judges. Designing test datasets and benchmarks to assess generalization, bias, explainability, and potential failure modes. Building evaluation frameworks for LLM agents, which includes the infrastructure for scenario-based and environment-based testing. Conducting comparative analyses of model architectures, training procedures, and evaluation results. Implementing tools for continuous monitoring, regression testing, and quality assurance of machine learning systems. Designing and executing stress tests and red-teaming workflows to identify vulnerabilities and edge cases. Collaborating with operations teams and subject matter experts to generate high-quality evaluation datasets. This position requires an active security clearance or the ability to obtain one.

Mar 26, 2026

Apply

Senior Machine Learning Engineer - Conversion Modeling at unity3d | San Francisco

Unity

Full-time|$172.2K/yr - $258.4K/yr|On-site|San Francisco, CA, USA

About the OpportunityAt Unity, we are dedicated to fostering a culture of collaboration and innovation. Our dynamic environment allows us to tackle intricate challenges that create significant value for creators and users within our ecosystem.The Vector team is at the forefront of this mission, creating cutting-edge conversion rate (CVR) prediction and market price models that enhance our ad ranking and recommendation systems. These models enable advertisers to engage the right users at optimal moments by accurately assessing engagement and conversion probabilities. By harnessing extensive behavioral data, creative features, and contextual signals, we continually refine our predictions’ relevance and accuracy. This leads to crucial outcomes such as increased user engagement, improved conversion rates, and a better return on ad spend—empowering advertisers to meet their objectives while enhancing user experience.We are on the lookout for an experienced Senior Machine Learning Engineer to spearhead advanced bidding optimization systems that facilitate efficient budget management, goal-driven automated strategies, ongoing enhancements through experimentation, and sustainable growth for Unity Ads.

Feb 11, 2026

Apply

Machine Learning Infrastructure Engineer

Mach9

Full-time|On-site|San Francisco

Mach9’s Machine Learning Infrastructure Engineers create and maintain the backbone for production AI models used in civil engineering and surveying. The team manages a machine learning pipeline that processes over 10,000 miles of labeled survey data, supports image segmentation networks, and runs 3D prediction models. These systems deliver real-time inference capabilities directly to surveyors and engineers working in the field. Role overview This position is designed for mid-career engineers with a strong background in both training and inference aspects of machine learning infrastructure. The work involves handling large-scale data and ensuring reliable performance for demanding, real-world applications. What you will do Build and improve training pipelines for deep transformer models using hundreds of terabytes of 3D point cloud and image data. Design and implement inference infrastructure to support both offline detection algorithms and responsive, real-time inference integrated with CAD software. Location Based in San Francisco.

Apr 25, 2026

Apply

Senior Applied AI Engineer - Multimodal Perception & Reasoning

voltai

Full-time|On-site|San Francisco Bay Area

Join VOLT, a trailblazer in crafting advanced AI perception systems that enhance safety and security through real-time risk detection in the physical world.We are on the lookout for a Senior Applied AI & Machine Learning Engineer dedicated to designing, optimizing, and deploying multimodal AI models capable of functioning reliably in diverse real-world scenarios. This is a hands-on role focused on transitioning models from conceptual data to practical production, encompassing both edge devices and cloud infrastructures.In this position, you will engage with vision, video, and language-based models that interpret real-world scenes and events, ensuring their accuracy, latency, robustness, and cost-effectiveness in production systems.Reporting directly to the Head of Engineering, you will play a pivotal role in advancing VOLT AI’s core perception platform.

Jan 13, 2026

Apply

Machine Learning Engineer - NomadicML

Amari AI

Full-time|On-site|San Francisco

About NomadicMLAt NomadicML, we are harnessing the power of artificial intelligence to revolutionize the way machines understand and interpret motion. Our vision-language models (VLMs) transform vast amounts of video data into actionable insights, paving the way for advancements in self-driving technology, robotics, and industrial automation.Founded by Mustafa Bal and Varun Krishnan, both alumni of Harvard University, our team is comprised of experts who have previously developed critical AI systems at industry giants like Snowflake, Lyft, Microsoft, Amazon, and IBM Research. With a commitment to innovation, we are dedicated to mining insights from the 5 trillion miles driven by Americans annually, uncovering the next frontier in machine intelligence.About the RoleWe are looking for a passionate Machine Learning Engineer who excels at the intersection of foundational model research and production engineering. In this role, you will play a key part in optimizing how machines learn from motion, focusing on training and refining large-scale Vision-Language Models that analyze complex real-world video data.You will be responsible for creating multi-modal architectures that accurately perceive, localize, and describe motion events across millions of video frames, transforming these innovations into robust APIs and SDKs for enterprise clients.Working closely with the founders, your contributions will include:Training and assessing VLMs tailored for motion comprehension within autonomous driving and robotics datasets.Designing and scaling GPU-accelerated pipelines for training, fine-tuning, and inference on diverse data types (video, language, and sensor metadata).Developing evaluation frameworks that benchmark spatiotemporal reasoning and localization precision.

Oct 28, 2025

Apply

Senior/Staff Applied Machine Learning Engineer

Comfy Org

Full-time|On-site|San Francisco

The OpportunityJoin us at ComfyOrg as a Senior/Staff Applied Machine Learning Engineer! We are on the hunt for a passionate innovator who is enthusiastic about optimizing model inference. You will play a pivotal role in developing the heart of ComfyUI, our cutting-edge visual AI platform. Your expertise will help us push the limits of AI model performance, making them run faster and more efficiently than ever before.Are You a Match?You are fascinated by model inference, memory management, and torch optimizations.You possess experience in writing production-level PyTorch code that challenges performance standards.You have a passion for understanding the inner workings of AI models.You thrive on developing highly optimized code that consistently delivers results.You believe that the current landscape of ML deployment holds significant room for improvement.Your Responsibilities:Develop and enhance the core inference engine that drives ComfyUI.Optimize large models for speed and memory efficiency.Collaborate with our core team to architect new features.Tackle complex technical challenges within the visual AI domain.Contribute to the future direction of our technology.Experience with diffusion or LLM models, as well as creating custom nodes for ComfyUI, is highly beneficial.

May 29, 2025

Apply

Generative AI Researcher - Atomistic Foundation Models

Achira

Full-time|On-site|San Francisco Office

Join Achira in shaping the future of deep learning with cutting-edge generative, representational, and simulation models for molecules and materials. Our mission is to create foundational models that render the atomistic universe understandable, predictable, and designable.Why Choose Achira?Be part of an elite, cross-disciplinary team comprising ML researchers, physicists, chemists, and engineers who are redefining atomistic simulation through expansive foundation models.Advance the integration of deep learning with the principles of nature, merging generative AI, probabilistic reasoning, and molecular physics.Engage in projects at an unparalleled scale, tackling extensive datasets, computational challenges, and ambitious goals.Take full ownership of your research journey — from ideation and architecture to training, evaluation, and deployment.Flourish in a dynamic culture that values rigor, speed, creativity, and impact over bureaucracy.Position OverviewAs a Generative AI Researcher at Achira, you will contribute to the development of foundation simulation models — large-scale systems designed to learn the structure, dynamics, and energetics of the atomistic realm. These models will unite deep representation learning, generative modeling, and sophisticated simulation techniques.Your responsibilities will include:Crafting and training state-of-the-art deep generative models — including diffusion, autoregressive, flow-based, and latent-variable architectures focused on molecules, materials, and atomic systems.Creating expressive representations of molecular and atomistic structures and dynamics utilizing equivariant graph neural networks, geometric transformers, and latent encoders that respect physical symmetries and constraints.Innovating advanced sampling and simulation techniques that blend probabilistic inference, deep learning, and reinforcement learning to facilitate efficient exploration and simulation of learned energy landscapes.Developing models that comprehend, generate, and simulate the physical world, merging reasoning, simulation, and predictive capabilities.Working collaboratively with physicists and chemists to validate models against ab initio, molecular dynamics, and experimental datasets.Rapidly prototyping, benchmarking, and iterating — converting research concepts into reusable, scalable model components across Achira’s foundation model suite.

Oct 24, 2025

Create account — see all 5,532 results