Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Candidates should possess a strong background in computer vision, machine learning, and data processing. Experience with vision-language models, data annotation techniques, and familiarity with multimodal datasets are highly desirable. A Bachelor's degree in a related field or equivalent practical experience is required.
About the job
Eventual Computing builds tools that help AI teams work with large, complex datasets. Based in San Francisco, the company supports projects in robotics, autonomous vehicles, and advanced video generation. Its open-source engine, Daft, is already in use at organizations with demanding data needs. The team focuses on making data curation and model training more efficient, so the right datasets are always within reach.
The office is located in the Mission district, where collaboration with leading AI labs and infrastructure companies is part of daily work.
Role overview
The Research Engineer - Multimodal Data will join the Visual Understanding team. This position centers on building solutions to make vast amounts of video and sensor data accessible and easy to query. The work directly supports researchers who need to find and use specific datasets quickly.
What you will do
Develop and refine systems that process petabytes of multimodal data, including video and sensor streams.
Apply vision-language models to improve how data is discovered and retrieved.
Define and influence the roadmap for visual understanding features.
Train models to streamline large-scale data annotation and improve efficiency for research teams.
About Eventual Computing
Eventual Computing is transforming the landscape of Physical AI through advanced data solutions. With significant backing and a talented team, we are committed to facilitating rapid advancements in AI technology.
Zyphra is a cutting-edge artificial intelligence firm located in the heart of San Francisco, California, dedicated to advancing technology across various modalities.About the Position:We are seeking a Data Engineer - Multimodal Systems to play a pivotal role in the enhancement and expansion of Zyphra's datasets and data pipelines. This position offers a unique opportunity to collaborate with diverse teams and contribute to innovative data solutions. You will engage in the collection of extensive datasets and the development and optimization of high-performance parallel data pipelines.Your Responsibilities Will Include:Executing large-scale data collection across multiple modalities, including text, audio, and image.Designing and implementing highly efficient, parallelized data processing pipelines that integrate various modalities.Conducting rigorous experimental ablations to evaluate the effectiveness of new data enhancements.Candidate Requirements:Proven ability in implementation and prototyping.Capability to transform ideas into experimental frameworks swiftly.Strong collaborative skills, thriving in a dynamic research environment.Eagerness to learn and apply new concepts effectively.Exceptional communication and teamwork skills, capable of contributing to both research and large-scale engineering projects.Preferred Qualifications:Experience in the collection, management, and processing of large datasets.Familiarity with parallel programming frameworks in Python, such as Dask.In-depth understanding of state-of-the-art dataset curation practices.A detail-oriented mindset with a passion for data integrity and verification.Strong foundation in experimental methodologies for conducting thorough ablation studies and hypothesis testing.Knowledge and interest in large-scale, highly parallel data processing systems.Proficiency in PyTorch and Python.Experience with large, complex codebases and the ability to quickly become productive within them.Published research in respected machine learning venues.Postgraduate degree in a relevant field is a plus.
Eventual Computing builds tools that help AI teams work with large, complex datasets. Based in San Francisco, the company supports projects in robotics, autonomous vehicles, and advanced video generation. Its open-source engine, Daft, is already in use at organizations with demanding data needs. The team focuses on making data curation and model training more efficient, so the right datasets are always within reach. The office is located in the Mission district, where collaboration with leading AI labs and infrastructure companies is part of daily work. Role overview The Research Engineer - Multimodal Data will join the Visual Understanding team. This position centers on building solutions to make vast amounts of video and sensor data accessible and easy to query. The work directly supports researchers who need to find and use specific datasets quickly. What you will do Develop and refine systems that process petabytes of multimodal data, including video and sensor streams. Apply vision-language models to improve how data is discovered and retrieved. Define and influence the roadmap for visual understanding features. Train models to streamline large-scale data annotation and improve efficiency for research teams.
Join worldlabs as a Research Engineer focused on scaling multimodal data. In this dynamic role, you will leverage cutting-edge technologies and methodologies to enhance data processing capabilities. You will be responsible for developing innovative solutions that integrate various data types and drive impactful research outcomes.
About Our Team:At OpenAI, we are dedicated to ensuring that artificial general intelligence (AGI) serves the greater good of humanity. Our API has emerged as the most widely embraced AI platform in the industry, catering to a diverse clientele ranging from startups and independent developers to Fortune 500 companies. By leveraging our multimodal APIs—which encompass real-time interactions, text-to-speech (TTS), speech generation, and image creation—we empower users to effectively harness the full spectrum of AI capabilities at scale.About the Role:We are on the lookout for an Engineering Manager to spearhead our multimodal API product suite. In this pivotal role, you will guide a talented team focused on delivering cutting-edge APIs for real-time processing, speech transcription, speech generation, and image creation. You will be instrumental in shaping the product roadmap and developing the tools that enable developers to connect with millions of end users through AI-driven audio, video, and imagery.In this role, you will:Lead, mentor, and cultivate a high-performing engineering team dedicated to multimodal API products, including our real-time API, Whisper transcription models, TTS speech generation models, and DALLE image generation APIs.Collaborate with product managers, designers, and various stakeholders to articulate the strategic vision and product roadmap.Work alongside our research teams to enhance our core multimodal models tailored for API customer use cases.Steer technical and architectural decisions with a focus on scalability, robustness, and user experience.Promote a culture of innovation, continuous improvement, and accountability within your team.Qualifications:Demonstrated experience in managing engineering teams that successfully deliver complex, high-quality products at scale.Strong technical expertise with proficiency in modern software engineering practices and system architecture.Exceptional collaboration and communication skills to effectively engage with diverse teams and stakeholders.Familiarity with or a strong passion for multimodal AI, encompassing speech technologies, real-time systems, and image generation.Adept at thriving in a fast-paced, dynamic startup environment.
Join Cloudflare as a Principal Systems Engineer, Data, where you will lead innovative projects that enhance our data processing capabilities. You will work collaboratively with cross-functional teams to design, implement, and optimize systems that efficiently handle large-scale data. This role requires a deep understanding of systems engineering principles, strong analytical skills, and a passion for leveraging data to drive decisions. Your contributions will be pivotal in shaping the future of our data infrastructure.
RF Systems EngineerJoin our client's innovative team, where we develop cutting-edge connected hardware and software systems that offer real-time insight into physical infrastructure via expansive wireless sensing networks.Our platform employs advanced multimodal wireless mesh sensing technologies, facilitating distributed perception in industrial, logistics, and operational contexts. This approach not only simplifies the deployment of sensors but also empowers organizations to efficiently monitor their assets and operations in real time.We are on the lookout for a talented RF Systems Engineer to spearhead the design and optimization of a comprehensive wireless architecture that interlinks thousands of edge devices across vast, distributed deployments.In this pivotal role, you will engage in system-level modeling, simulation, and performance validation across diverse wireless systems. You will collaborate closely with RF, firmware, and software teams to create and implement robust multi-radio architectures tailored for real-world scenarios.
The Bot CompanyAt The Bot Company, we are on a mission to create an innovative robotic assistant for every household.Our dynamic team, composed of talented engineers, designers, and operators, is based in San Francisco. We have a rich background from industry leaders such as Tesla, Cruise, OpenAI, Google, and Pixar, and we have successfully delivered products to hundreds of millions of users, honing our ability to create exceptional products and experiences.We pride ourselves on maintaining a streamlined team structure that fosters swift decision-making and minimizes bureaucracy. Each member is considered an Individual Contributor, granted substantial autonomy, ownership, and accountability. Our culture enables us to work across the technology stack with an emphasis on rapid iteration and execution.What We Seek in CandidatesCandidates for all positions at The Bot Company must exhibit remarkable sharpness and the capacity to thrive in high-pressure environments. We expect candidates to showcase:Exceptional Cognitive Abilities: You possess quick thinking, instant learning capabilities, and the ability to reason across diverse domains.Engineering Curiosity: You demonstrate an innate desire to understand how systems function, even beyond your area of expertise.Performance-Driven Attitude: You excel in fast-paced settings, effectively navigate ambiguity, and thrive under demanding circumstances.Machine Learning: Multimodal Foundation ModelsWe are developing unified foundation models capable of reasoning across text, images, video, and kinematics to inform intelligent robotic behaviors.You will engage with large-scale multimodal networks, overseeing the complete process from data handling to model training and deployment.Your ResponsibilitiesConstruct Native Multimodal Policies: Create architectures where vision, language, and other modalities are represented in a unified manner.Enhance Cross-Modal Reasoning: Explore and implement strategies to ensure that the model not only correlates modalities but also comprehends them (e.g., linking visual physics to kinematic constraints).Manage the Training Loop from Start to Finish: Design, execute, troubleshoot, and refine large-scale training experiments; identify failure points, enhance data mixtures, and tighten evaluations to achieve measurable improvements.Deploy and Refine Real Systems: Integrate models into practical robotic frameworks, enhance robot code for model deployment, and optimize performance for edge inference.
About Hike Medical Hike Medical is building the future of musculoskeletal care by combining advanced technology with practical healthcare solutions. Based in San Francisco’s Rincon Hill, the team develops a platform that spans three core areas: an AI-powered vision system for rapid web-based foot scans that generate custom 3D-printed orthotics, an AI agent platform that manages the entire DME workflow from intake through claims, and SoleForge, a high-scale 3D printing facility for custom medical devices. Hike Medical partners with some of the world’s largest employers and major orthotics and prosthetics organizations. Fortune 50 companies trust the platform to support employee well-being, and a broad network of clinical partners keeps the company connected to real-world needs. Custom insoles are just the starting point. The long-term goal is to reshape the industry with bionic devices: AI-designed, robotically manufactured orthotic and prosthetic products. The company aims to reach this milestone by 2040. Learn more at bionics2040.com. With $22 million raised across Seed and Series A rounds from leading investors, Hike Medical offers a results-oriented culture for those interested in the intersection of AI, manufacturing, and healthcare.
Join VOLT, a trailblazer in crafting advanced AI perception systems that enhance safety and security through real-time risk detection in the physical world.We are on the lookout for a Senior Applied AI & Machine Learning Engineer dedicated to designing, optimizing, and deploying multimodal AI models capable of functioning reliably in diverse real-world scenarios. This is a hands-on role focused on transitioning models from conceptual data to practical production, encompassing both edge devices and cloud infrastructures.In this position, you will engage with vision, video, and language-based models that interpret real-world scenes and events, ensuring their accuracy, latency, robustness, and cost-effectiveness in production systems.Reporting directly to the Head of Engineering, you will play a pivotal role in advancing VOLT AI’s core perception platform.
Full-time|On-site|San Francisco (London/Europe - OK)
Tavus – Multimodal AI Model OptimizationResearch EngineerAt Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.The RoleWe are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.Your MissionTransform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.RequirementsExtensive experience in deep learning with PyTorch.Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.Familiarity with efficient architectures such as low-rank adapters.Strong grasp of inference performance and GPU/accelerator fundamentals.Proficient in Python coding and adherence to best practices in research engineering.Experience with large models and datasets in cloud environments.Capability to read ML literature, reproduce results, and modify ideas accordingly.
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California
At Databricks, we are driven by a passion for empowering data teams to tackle the world’s most challenging problems — from transforming transportation to accelerating medical innovations. We achieve this by creating and maintaining the leading data and AI infrastructure platform, enabling our clients to leverage profound data insights for business enhancement. Founded by engineers with a customer-first mentality, we eagerly embrace every opportunity to tackle complex technical challenges, ranging from the design of next-generation UI/UX for data interactions to scaling our services across millions of virtual machines. Our journey has just begun.As a member of the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems. These systems will surpass specialized SQL query engines in relational query performance while offering the programming abstractions necessary to support a variety of workloads, from ETL to data science.Example projects include:Apache Spark™: Contribute to the de facto open-source standard framework for big data.Data Plane Storage: Develop reliable and high-performance services and client libraries for managing vast amounts of data within cloud storage backends like AWS S3 and Azure Blob Store.Delta Lake: Design a storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, providing features like ACID transactions and time travel.Delta Pipelines: Simplify the orchestration and operation of numerous data pipelines, enabling clients to deploy, test, and upgrade pipelines effortlessly.Performance Engineering: Create the next-generation query optimizer and execution engine that is fast, scalable, and robust.
Join Cloudflare as a Systems Engineer specializing in Data, where you will play a critical role in enhancing our infrastructure and ensuring the reliability of our services. You will collaborate with cross-functional teams to design, implement, and maintain systems that handle vast amounts of data efficiently and securely. Your contributions will be pivotal in optimizing performance and delivering exceptional user experiences.
About TavusTavus is at the forefront of innovation in human computing. Our mission is to develop AI Humans: an advanced interface that bridges the gap between individuals and machines, eliminating the friction found in current technologies. Our state-of-the-art human simulation models empower machines to see, hear, respond, and even exhibit realistic appearances—facilitating genuine, face-to-face interactions. AI Humans integrate the emotional insight of humans with the scalability and dependability of machines, making them reliable agents accessible 24/7, in any language, on our terms.Imagine having access to an affordable therapist, a personal trainer that fits your schedule, or a team of medical assistants dedicated to providing personalized care for every patient. With Tavus, individuals, enterprises, and developers have the tools to create AI Humans that connect, comprehend, and act with empathy on a large scale.We are a Series A company supported by esteemed investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners.Join us in shaping a future where machines and humans genuinely understand one another.The PositionWe are seeking an AI Researcher to join our core AI team and advance the frontiers of multimodal conversational intelligence. If you excel in dynamic environments, enjoy transforming abstract concepts into functional code, and derive motivation from pushing the boundaries of possibility, this role is designed for you.Your Responsibilities Engage in research focusing on Foundational Multimodal Models specifically in the realm of Conversational Avatars (such as Neural Avatars and Talking-Heads).Develop models for video, audio, and language sequences utilizing Autoregressive and Predictive Architectures (e.g., V-JEPA) and/or Diffusion methodologies, with a focus on temporal and sequential data rather than static images.Collaborate closely with the Applied ML team to implement your research into production systems.Remain at the forefront of multimodal learning and assist us in defining what “cutting edge” will mean in the future.Ideal Candidate ProfilePhD (or nearing completion) in a relevant field, or equivalent practical research experience.Experience in multimodal machine learning, particularly focused on conversational interfaces.
Bland Inc. seeks a Machine Learning Researcher specializing in Multimodal Large Language Models (LLMs) to join the team in San Francisco. The focus is on advancing AI systems that integrate language with other types of data. Role overview This position centers on research and development aimed at improving how AI models process and understand information from multiple sources, such as text combined with images or other modalities. What you will do Investigate how language interacts with additional data types within multimodal LLMs Create and evaluate new methods to enhance AI model performance Work closely with colleagues on projects designed to push the boundaries of machine learning Location This role is based in San Francisco.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, tailored specifically for AI applications. Our team is dedicated to developing large-scale infrastructure that efficiently crawls the internet, trains advanced embedding models for indexing, and constructs high-performance vector databases in Rust for optimized searching. We also manage a state-of-the-art $5M H200 GPU cluster that activates thousands of machines simultaneously.As a Software Engineer specializing in Distributed Data Systems, you will be responsible for designing and implementing the data infrastructure that drives our operations—from crawling billions of web pages to training sophisticated embedding models and delivering real-time search functionalities. You will enjoy significant autonomy in creating systems capable of scaling to hundreds of petabytes. This is your opportunity to work on data pipelines at an unprecedented scale.
About the TeamJoin our Online Data team, where we design and maintain the foundational online database and indexing services for OpenAI’s cutting-edge AI applications. We support the phenomenal growth of ChatGPT, the leading AI application globally, as well as Codex, the fastest-growing development toolset.Our mission is to uphold the reliability, accuracy, and scalability of our extensive online data infrastructure, enabling product and research teams to innovate swiftly without getting entangled in complex multi-region, multi-cloud, exabyte-scale data systems.About the RoleWe are on the lookout for an Engineering Manager to spearhead our Online Data Systems team. This role entails guiding a talented group of engineers dedicated to developing and managing hyperscale data storage and retrieval technologies.You will oversee the execution of highly complex engineering tasks in areas such as distributed query execution, multi-region data federation, self-orchestrating services, performance optimization, and more.This is a unique opportunity to shape cutting-edge technology at an unprecedented scale, where you won’t just be another cog in the machine but will have a significant impact on our trajectory.In this role, you will:Build, lead, and develop high-performing infrastructure engineering teams.Drive the advancement of OpenAI’s proprietary online data technologies, including our core database systems, indexing technologies, and vector search capabilities.Establish delivery metrics based on measurable reliability goals (SLOs, etc.) to ensure superior system performance and resilience.Advocate for the efficient use of agent technology to enhance execution speed.Minimize operational burdens and incident occurrences through improved abstractions and self-healing systems.You might thrive in this role if you:Possess a relentless pursuit of operational excellence, with hands-on experience in building...
Join Cloudflare as a Senior Systems Engineer specializing in our Data Platform. In this pivotal role, you'll be responsible for designing and implementing scalable systems that process vast amounts of data, ensuring performance and reliability across our services. Your expertise will drive innovative solutions that enhance our platform capabilities.
Full-time|$211.2K/yr - $290K/yr|On-site|San Francisco Bay Area, CA;San Diego, CA
Our MissionAt Altos Labs, we are dedicated to revitalizing cell health and resilience through innovative cell rejuvenation techniques to reverse disease, injury, and the disabilities that arise throughout life.Discover more about our vision at altoslabs.com.Our ValuesOur core value is simple yet powerful: Everyone Owns Achieving Our Inspiring Mission.Diversity at AltosWe understand that diverse perspectives are crucial for scientific breakthroughs and exploration. At Altos, exceptional scientists and industry leaders collaborate from around the globe to drive our shared mission forward. We prioritize a culture of belonging, ensuring that every employee feels valued for their unique contributions. We are all responsible for maintaining a diverse and inclusive workplace.Your Contributions to AltosJoin Altos Labs in creating a premier AI ecosystem aimed at addressing the most intricate challenges in human biology. You will be instrumental in designing and developing high-performance, scalable solutions that integrate high-dimensional biomedical imaging with molecular and linguistic data.Your role will involve implementing large-scale multimodal data fusion, advancing beyond basic image analysis to develop predictive models that span various biological domains. You will engage directly with data and coding, partnering with our engineering team to ensure these models are scalable, efficiently trainable in distributed cloud environments, and accessible to our global research network.Key ResponsibilitiesModel Development: Create, implement, and train large-scale foundational models (e.g., Vision Transformers, Multimodal LLMs) capable of embedding spatial data and integrating diverse modalities.Innovative Data Fusion: Apply cutting-edge cross-domain mapping and fusion techniques to align heterogeneous biological datasets.Scaling & Training: Develop and oversee high-performance ML pipelines designed to handle petabyte-scale image collections and multi-omics data streams in a cloud infrastructure.Technical Collaboration: Work closely with experimental scientists and software engineers to convert biological complexity into high-performance code and reliable distributed systems.Who You AreWe seek a technical expert who excels at unraveling "unsolvable" challenges through programming and meticulous experimentation. We welcome candidates at the Scientist I, Scientist II, or Senior Scientist levels.
Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California
P-186 At Databricks, we are passionate about empowering data teams to tackle some of the world’s most challenging problems, from security threat detection to cancer drug development. Our mission is to build and operate the leading data and AI infrastructure platform, enabling our customers to concentrate on the high-value challenges that are integral to their own objectives. Founded in 2013 by the original creators of Apache Spark™, Databricks has rapidly evolved from a small office in Berkeley, California, to a global powerhouse with over 1000 employees. Trusted by thousands of organizations, from startups to Fortune 100 companies, we are recognized as one of the fastest-growing SaaS companies worldwide. Our engineering teams create highly sophisticated products that address significant needs in the industry. We continuously push the limits of data and AI technology while maintaining the resilience, security, and scalability essential for our customers' success on our platform. We manage one of the largest-scale software platforms, consisting of millions of virtual machines that generate terabytes of logs and process exabytes of data daily. At this scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must effectively shield our customers from these challenges. Modern data analysis leverages advanced techniques, such as machine learning, that far exceed the capabilities of traditional SQL query engines. As a Software Engineer on the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems that outshine specialized SQL query engines in relational query performance, while providing the flexibility and programming abstractions to support a variety of workloads, from ETL to data science. Examples of projects you may work on include: Apache Spark™: Contributing to the de facto open-source framework for big data. Data Plane Storage: Developing reliable, high-performance services and client libraries for storing and accessing vast amounts of data on cloud storage backends like AWS S3 and Azure Blob Store. Delta Lake: A storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, featuring low latency streaming. Its higher-level abstractions and guarantees, including ACID transactions and time travel, significantly reduce the complexity of real-world data engineering architectures. Delta Pipelines: Aiming to simplify the management of data engineering pipelines.
Join Cloudflare as a Distributed Systems Engineer within our dynamic Data Platform team, focusing on Analytics and Alerts. In this position, you will play a pivotal role in building and optimizing distributed systems that power our data analytics capabilities, providing real-time insights and alerts to enhance our customer experience.
Mar 4, 2026
Sign in to browse more jobs
Create account — see all 6,054 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.