Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
Your ProfileExpertise in Machine Learning: You possess a strong background in generative modeling and have contributed to influential machine learning projects, as evidenced by your work on widely adopted open-source libraries or impactful publications at prestigious venues such as NeurIPS, ICML, or Nature. Proficient ML Developer: Your code is robust, well-tested, and maintainable. You are adept at using version control and code review tools, and you excel at developing efficient prototypes as well as polished production code, having experience with cloud computing and model parallelization. Data Engineering Skills: You have a solid track record in building machine learning data pipelines for training and evaluation of deep learning models, including data analysis and effective dataset construction. Model Optimization Enthusiast: With a deep understanding of the interaction between ML libraries, hardware, and data, you are passionate about optimizing model performance for both training and inference speeds. Curious and Mission-Driven: You are dedicated to making a meaningful impact, adapting your methods to achieve your goals, and thriving in fast-paced environments.
About the job
Join latentlabs, a pioneering company at the forefront of biotechnology, as we seek a talented Machine Learning Researcher specializing in generative modeling. You will become part of a dynamic, interdisciplinary team comprising machine learning experts, protein engineers, and biologists, all committed to revolutionizing biological control and disease treatment. In this role, you will design innovative generative models aimed at creating new proteins that exhibit functionality in wet lab assays.
About latentlabs
At latentlabs, we are dedicated to advancing biotechnology through innovative research and development. Our interdisciplinary team leverages cutting-edge machine learning and biological expertise to create solutions that address critical health challenges. We are passionate about harnessing the power of technology to change lives.
Similar jobs
1 - 20 of 4,487 Jobs
Search for Technical Program Manager Adversarial Model Research
Team OverviewThe Human Data team at OpenAI is at the forefront of identifying and mitigating risks associated with advanced AI systems. Our mission is to enhance model reliability and public trust by designing thorough evaluations, uncovering vulnerabilities, and collaborating closely with researchers.Role OverviewAs a Technical Program Manager, you will spearhead initiatives aimed at assessing the safety and robustness of OpenAI’s models through innovative experimentation and methodical evaluation. Your role will involve orchestrating efforts across research and engineering teams, translating ambiguous risk signals into actionable research programs that will shape the future of AI model development and deployment.We seek candidates who possess technical acumen, thrive in uncertain environments, and are passionate about pioneering the future of safe AI.This position is based in San Francisco, CA, employing a hybrid work model of three days in the office each week, with relocation assistance available for new hires.Key ResponsibilitiesLead programs that investigate unexpected model behaviors and identify potential failure modes.Convert ambiguous risk signals into clear priorities and actionable research agendas.Design and execute innovative evaluations, experiments, and red-teaming initiatives.Collaborate with research, product, and deployment teams to integrate findings into the model training and deployment pipelines.Establish repeatable systems for monitoring model performance and interpreting emerging behavior patterns.Ideal Candidate ProfileProven experience in technical program management with exceptional organizational and communication abilities.Familiarity with large language models, prompt engineering, or model evaluation methodologies.Ability to manage fast-paced, high-uncertainty projects, shaping them from inception.Creative and resourceful in developing novel methods for evaluating model behavior and performance.Skilled in coordinating effectively across both technical and non-technical stakeholders to ensure alignment and execution.About OpenAIOpenAI is a pioneering AI research and deployment company committed to ensuring that general-purpose artificial intelligence benefits all of humanity. We continually push the boundaries of AI capabilities and strive to deploy them safely through our innovative products. Our mission is to harness the extraordinary potential of AI responsibly and equitably for a better future.
Full-time|Remote|San Francisco, CA | New York City, NY
Anthropic is seeking a Technical Program Manager for Research Initiatives to coordinate and deliver advanced projects in artificial intelligence. This position is based in San Francisco, CA or New York City, NY. Role overview This role centers on managing research projects that push the boundaries of AI. The Technical Program Manager will oversee cross-functional teams, keeping projects on track and ensuring research goals are met. What you will do Lead and organize research initiatives focused on artificial intelligence Coordinate teams from different disciplines to achieve project objectives Track project timelines and deliverables, ensuring milestones are reached Requirements Experience managing technical or research-focused projects Ability to work with cross-functional teams Strong organizational and communication skills
Full-time|$243.9K/yr - $286.9K/yr|Hybrid|Remote - USA
Are you ready to exceed your own expectations? At Coinbase, we are driven by our mission to enhance economic freedom worldwide. This is a monumental challenge that requires our utmost dedication as we develop the onchain platform that will shape the future of the global financial system.We are on the lookout for a unique candidate who shares our passion and believes in the transformative potential of cryptocurrency and blockchain technology to revolutionize finance. We seek someone who is motivated to make a significant impact, thrives under pressure alongside top-tier colleagues, and actively pursues feedback to continuously improve. If you are someone who faces challenges head-on and is ready to tackle our most complex problems, we want to hear from you!Our work culture is demanding and not suited for everyone. However, if you aspire to build the future with exceptional individuals who hold high expectations for themselves and others, this is the perfect environment for you.Although many roles at Coinbase operate on a remote-first basis, we are not exclusively remote. In-person participation is required several times a year during team and company-wide offsites to promote collaboration, connection, and alignment. Attendance is expected and fully supported.Security is a vital competency at Coinbase, and our Security Operations team vigilantly monitors all aspects of it. We confront some of the world's most sophisticated attackers daily to protect billions of dollars in digital assets, ensuring a secure and trustworthy experience for our customers and employees. As Coinbase expands globally, our team will grow accordingly, leveraging a mix of tools, automation, and strategic team development to safeguard the next billion crypto users.The Senior Manager, Adversary Management will oversee strategy, operational governance, and all facets of cyber threat intelligence at Coinbase, including supporting intelligence needs for Security Operations and other Information Security requirements. The ideal candidate will possess extensive technical expertise in threat intelligence, cyber fraud, and both blockchain (Web 3.0) and traditional (Web 2.0) threat landscapes. Reporting to the Senior Director of Security Operations, this leader will play a crucial role in protecting the Coinbase ecosystem, its products, and customers by defining and delivering actionable intelligence services that disrupt threat-actor activities.
We are seeking a highly motivated and detail-oriented Research Program Manager to join our dynamic team at mercor. The ideal candidate will play a pivotal role in overseeing various research initiatives, coordinating projects, and ensuring that all objectives are met efficiently and effectively.This position offers an exciting opportunity to work in a fast-paced environment where innovation and collaboration are key. You will be responsible for managing project timelines, budgets, and resources while fostering a culture of continuous improvement within the team.
About UsAt Twelve Labs, we are at the forefront of creating groundbreaking multimodal foundation models that enable machines to perceive and comprehend the world akin to human senses. Our innovative models have set new benchmarks in video-language modeling, empowering developers to create tools with unparalleled capabilities in semantic search, summarization, and analytical insights.Having secured $107 million through Seed and Series A funding from prestigious venture capital and corporate partners including NVIDIA, NEA, Radical Ventures, Index Ventures, Snowflake, and Databricks, our advisory board comprises AI pioneers and founders such as Fei-Fei Li, Silvio Savarese, and Alexandr Wang. Based in San Francisco and with a significant presence in the APAC region through our Seoul office, we are dedicated to advancing global innovation.Role OverviewWe are on the lookout for a Director of Technical Program Management who will spearhead large-scale execution and foster alignment among our research, engineering, product, and go-to-market teams. In this pivotal role, you will manage intricate, cross-functional projects involving over 60 contributors, ensuring the establishment of clear roadmaps, efficient processes, and the prioritization of customer needs in technical initiatives. Additionally, you will cultivate and mentor a high-performing TPM team, creating scalable systems for visibility, prioritization, and effective communication. As a strategic partner within our leadership team, you will provide clarity, promote accountability, and facilitate rapid growth and impactful outcomes for Twelve Labs.
Join Perplexity as a Research Engineering Manager, where you will spearhead a team of exceptional AI researchers and engineers dedicated to crafting the advanced models that power our innovative products. Our talented team has pioneered some of the most sophisticated models in agentic research, query understanding, and other critical domains that demand precision and depth. As we broaden our user base and expand our product offerings, our proprietary models are increasingly essential for delivering a premium experience to the world's most discerning users.You will explore our extensive datasets of conversational and agentic queries, applying state-of-the-art training methodologies to enhance AI model performance. Through proactive technical and organizational leadership, you will empower your team to create cutting-edge models for the applications that are most significant to our business and our users.
Zyphra is a cutting-edge artificial intelligence firm headquartered in the vibrant city of San Francisco, California.Position Overview:As a Research Scientist specializing in Model Architectures, you will play a pivotal role in Zyphra’s AI Architecture Research Team. Your responsibilities will include the design and thorough evaluation of innovative model architectures and training methodologies aimed at enhancing essential modeling capabilities (e.g., loss per flop or loss per parameter) and tackling core limitations inherent in current models. You will collaborate closely with our pre-training team to ensure that your findings are seamlessly integrated into our next-generation models.Qualifications:A strong research acumen and intuition.Proven ability to navigate research projects from initial conception to execution and final write-up.Exceptional implementation and prototyping skills, with the capability to swiftly transform ideas into experimental outcomes.A collaborative spirit and the ability to thrive in a fast-paced research environment.A deep curiosity and enthusiasm for understanding intelligence.Requirements:Experience with long-term memory, RAG/retrieval systems, dynamic/adaptive computation, and alternative credit assignment strategies.Knowledge of reinforcement learning, control theory, and signal processing techniques.A passion for exploring and critically evaluating unconventional ideas, with the ability to maintain a unique perspective.Familiarity with modern training pipelines and the hardware necessities for designing efficient architectures compatible with GPU hardware.Strong understanding of experimental methodologies for conducting rigorous ablations and hypothesis testing.High proficiency in PyTorch and Python programming.Ability to quickly assimilate into large pre-existing codebases and contribute effectively.Prior publication of machine learning research in reputable venues.Postgraduate degree in a scientific discipline (e.g., Computer Science, Electrical Engineering, Mathematics, Physics).Why Join Zyphra?We emphasize a structured research methodology that systematically addresses ambitious challenges in AI.
Full-time|Remote|Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY
Anthropic is looking for a Research Engineer focused on model evaluations. This position involves research and development to assess and strengthen the performance of AI models. Teams are based in San Francisco and New York City, and the role supports remote work with required travel. Key responsibilities Design and implement evaluations for Anthropic's AI models Collaborate with team members to enhance model performance Contribute to research that pushes the boundaries of AI systems Location Remote-friendly (travel required) San Francisco, CA New York City, NY
Zyphra is an innovative artificial intelligence company located in the heart of San Francisco, California.The Opportunity:Join our dynamic team as a Research Engineer - Audio & Speech Models, where you will play a pivotal role in advancing Zyphra’s Audio Team. You will be instrumental in developing cutting-edge open-source text-to-speech and audio models. Your contributions will span the full spectrum of the model training process, from data collection and processing to the design of innovative architectures and training approaches.Your Responsibilities:Conduct large-scale audio training operationsOptimize the performance of our training infrastructureCollect, process, and evaluate audio datasetsImplement architectural and methodological improvements through rigorous testingWhat We Seek:A strong research mindset with the ability to navigate projects from ideation to implementation and documentation.Proficiency in rapid prototyping and implementation, allowing for swift experimentation.Effective collaboration skills in a fast-paced research environment.A quick learner who is eager to embrace and implement new concepts.Excellent communication abilities, enabling you to contribute to both research and engineering tasks at scale.Preferred Qualifications:Expertise in training audio models, such as text-to-speech, ASR, speech-to-speech, or emotion recognition.Experience with training audio autoencoders.Solid understanding of signal processing, particularly in audio.Familiarity with diffusion models, consistency models, or GANs.Experience with large-scale (multi-node) GPU training environments.Strong understanding of experimental methodologies for conducting rigorous tests and ablations.Interest in large-scale, parallel data processing pipelines.Competence in PyTorch and Python programming.Experience contributing to large, established codebases with rapid adaptation.
Join Cartesia as a Model Architecture ResearcherAt Cartesia, our vision is to revolutionize AI by creating interactive intelligence that is seamlessly integrated into your daily life. Unlike current models, our goal is to develop systems capable of processing extensive streams of audio, video, and text—1 billion text tokens, 10 billion audio tokens, and 1 trillion video tokens—directly on devices.As pioneers in innovative model architectures, our founding team, which originated from the Stanford AI Lab, has developed State Space Models (SSMs)—a groundbreaking foundation for training efficient, large-scale models. Our diverse team merges deep expertise in model innovation with a design-focused engineering approach, allowing us to create and deploy state-of-the-art models and applications.Backed by leading investors such as Index Ventures, Lightspeed Venture Partners, and many others, including industry veterans and advisors, we are poised to shape the future of AI.Your ContributionIn this role, you will drive forward-thinking research in neural network architecture, focusing on alternative models like state space models, efficient transformers, and hybrid architectures.Create innovative architectures that enhance model performance, inference speed, and adaptability in various environments, from cloud infrastructures to on-device implementations.Develop advanced capabilities for models, including statefulness, long-range memory, and novel conditioning mechanisms to boost expressiveness and generalization.Analyze architectural decisions and their effects on model characteristics such as scalability, robustness, latency, and energy consumption.Create frameworks and tools to assess architectural advancements, benchmarking their performance in both research and production contexts.Collaborate with interdisciplinary teams to translate architectural insights into scalable systems that deliver real-world impact.Your QualificationsExtensive experience in architecture design with a focus on advanced models such as state space models, transformers, and RNN/CNN variants.In-depth understanding of the interplay between architectural designs and system constraints, particularly in cloud and on-device deployments.Strong proficiency in the design and evaluation of neural network architectures.
Role overview Decagon seeks a Technical Program Manager based in San Francisco to coordinate work across multiple teams and deliver new technical solutions. This position guides projects from initial planning through completion, ensuring schedules stay on track and technical milestones are achieved. What you will do Lead cross-functional teams to meet project goals Track project progress and adjust plans when necessary Share updates and collect feedback from stakeholders Clarify and verify technical requirements throughout each project Help advance Decagon’s strategic programs through strong program management
Remote|Remote|Remote-Friendly (Travel Required) | San Francisco, CA
Join Anthropic as a Senior Research Scientist on our Reward Models team, where you will spearhead groundbreaking research aimed at enhancing our understanding of human preferences at scale. Your innovative contributions will directly influence how our AI models, including Claude, align with human values and optimize for user needs. You will delve into the forefront of reward modeling for large language models, designing novel architectures and training methodologies for Reinforcement Learning from Human Feedback (RLHF). Your research will explore advanced evaluation techniques, including rubric-based grading, and tackle challenges such as reward hacking. Collaboration is key, as you'll work alongside teams in Finetuning, Alignment Science, and our broader research organization to ensure your findings result in tangible advancements in AI capabilities and safety. This role offers you an opportunity to address critical AI alignment challenges, leveraging cutting-edge models and substantial computational resources to further the science of safe and capable AI systems.
About TavusTavus is at the forefront of innovation in human computing. Our mission is to develop AI Humans: an advanced interface that bridges the gap between individuals and machines, eliminating the friction found in current technologies. Our state-of-the-art human simulation models empower machines to see, hear, respond, and even exhibit realistic appearances—facilitating genuine, face-to-face interactions. AI Humans integrate the emotional insight of humans with the scalability and dependability of machines, making them reliable agents accessible 24/7, in any language, on our terms.Imagine having access to an affordable therapist, a personal trainer that fits your schedule, or a team of medical assistants dedicated to providing personalized care for every patient. With Tavus, individuals, enterprises, and developers have the tools to create AI Humans that connect, comprehend, and act with empathy on a large scale.We are a Series A company supported by esteemed investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners.Join us in shaping a future where machines and humans genuinely understand one another.The PositionWe are seeking an AI Researcher to join our core AI team and advance the frontiers of multimodal conversational intelligence. If you excel in dynamic environments, enjoy transforming abstract concepts into functional code, and derive motivation from pushing the boundaries of possibility, this role is designed for you.Your Responsibilities Engage in research focusing on Foundational Multimodal Models specifically in the realm of Conversational Avatars (such as Neural Avatars and Talking-Heads).Develop models for video, audio, and language sequences utilizing Autoregressive and Predictive Architectures (e.g., V-JEPA) and/or Diffusion methodologies, with a focus on temporal and sequential data rather than static images.Collaborate closely with the Applied ML team to implement your research into production systems.Remain at the forefront of multimodal learning and assist us in defining what “cutting edge” will mean in the future.Ideal Candidate ProfilePhD (or nearing completion) in a relevant field, or equivalent practical research experience.Experience in multimodal machine learning, particularly focused on conversational interfaces.
Full-time|$250K/yr - $325K/yr|On-site|San Francisco
About World Labs: At World Labs, we create foundational world models capable of perceiving, generating, reasoning, and interacting with the 3D environment. Our mission is to unlock the full potential of AI through spatial intelligence, transforming perception into action, reasoning into insight, and imagination into creation. We believe that spatial intelligence will revolutionize storytelling, creativity, design, simulation, and immersive experiences across both virtual and physical realms. Our world-class team is driven by curiosity and passion, boasting diverse backgrounds in technology, from AI research and systems engineering to product design. This synergy fosters a tight feedback loop between our cutting-edge research and user-empowering products. Role Overview We are seeking an innovative Research Scientist specializing in generative modeling, especially diffusion models, to join our modeling team. This position is ideal for individuals with extensive expertise in applying diffusion models to images, videos, or 3D assets and scenes. While not mandatory, experience in any of the following areas will be considered a significant advantage: Large-scale model trainingResearch in 3D computer vision In this role, you will work closely with researchers, engineers, and product teams to translate advanced 3D modeling and machine learning techniques into practical applications, ensuring our technology stays at the forefront of visual innovation. This position entails substantial hands-on research and engineering work, taking projects from conception to production deployment. Key Responsibilities Design, implement, and train large-scale diffusion models for generating 3D worlds. Develop and experiment with large-scale diffusion models to introduce novel control signals, align with target aesthetic preferences, or optimize for efficient inference. Collaborate closely with research and product teams to comprehend and translate product requirements into actionable technical roadmaps. Contribute actively to all phases of model development, including data curation, experimentation, evaluation, and deployment. Continuously investigate and integrate the latest research in diffusion and generative AI. Serve as a key technical resource within the team, mentoring peers and promoting best practices in generative modeling and machine learning engineering.
Full-time|$150K/yr - $200K/yr|On-site|San Francisco
Astranis Space Technologies Corp. is at the forefront of satellite innovation, crafting advanced satellites designed for high orbits that extend humanity's reach into the cosmos. Our satellites deliver dedicated and secure communications networks to a diverse clientele, including large enterprises, sovereign governments, and the U.S. military. With five satellites successfully in orbit and numerous launches on the horizon, we are addressing a backlog of over $1 billion in commercial contracts.As a trusted partner in satellite communications, we cater to clients with rigorous demands for reliability, data security, network visibility, and tailored solutions. Backed by over $750 million in funding from leading investors such as Andreessen Horowitz, Blackrock, and Fidelity, our team of 450 engineers and entrepreneurs operates from a 153,000 sq. ft. state-of-the-art headquarters in Northern California.Technical Program Manager, USG ProgramsAs a Technical Program Manager focused on U.S. Government programs, you will be pivotal in steering the execution of Astranis's satellite initiatives from initial concept to final delivery. Reporting directly to the Director of USG Programs, you will be responsible for program outcomes related to schedule, budget, risk management, and customer engagement. This role demands a combination of programmatic leadership and technical expertise, engaging in hands-on problem solving while maintaining strategic oversight. You will conduct trade studies to enhance mission system architectures, facilitate coordination across spacecraft and ground segments, and ensure all technical, contractual, and mission requirements are fulfilled. Additionally, you will help refine and scale our USG program execution frameworks and cross-functional processes as our portfolio grows.Key ResponsibilitiesOversee government-related program execution from inception to completion, encompassing small internal R&D projects to comprehensive multi-spacecraft missions.Establish, manage, and communicate program schedules, budgets, and resource allocations to guarantee successful project delivery.Lead the risk management process, including documentation, tracking of mitigation strategies, and approval of risk closures.Ensure that programs meet all technical and programmatic specifications by monitoring and reporting key performance indicators.Conduct regular internal and external program reviews, emphasizing schedule trends, risks, and progress against significant milestones.Act as the primary liaison for government programs, leading design reviews, reporting, and milestone readiness assessments.Collaborate with engineering, operations, and mission assurance teams to ensure alignment on requirements, interfaces, and design development.
Full-time|$200K/yr - $240K/yr|Hybrid|Hybrid - San Francisco, California
About Us:Motive empowers physical operations with innovative tools designed to enhance safety, productivity, and profitability. For the first time, safety, operations, and finance teams can manage their drivers, vehicles, equipment, and fleet expenditures through a unified system. With cutting-edge AI technology, the Motive platform delivers comprehensive visibility and control, significantly minimizing manual workloads through automation and task simplification.Serving nearly 100,000 clients, from Fortune 500 corporations to small businesses, Motive operates across industries such as transportation and logistics, construction, energy, field services, manufacturing, agriculture, food and beverage, retail, and the public sector.Discover more at gomotive.com.Role Overview:We are looking for a dynamic leader to establish and steer our Technical Program Management team. This strategic, high-impact position will be pivotal in shaping our execution across the engineering domain as we create state-of-the-art custom hardware and software solutions. You will lead a team of Technical Program Managers—experts in embedded software and feature integration—who facilitate the daily execution of our Connected Devices Engineering initiatives.This role requires exceptional cross-functional leadership, acting as a vital link between engineering, product, customer-facing, and business stakeholders. You will gain executive-level visibility and be responsible for ensuring that our most crucial programs are delivered on schedule while continuously enhancing our operational efficiency. In addition to keeping programs on track, you will foster a culture of open communication, proactive problem-solving, and seamless collaboration among full-stack software, product, and Quality Assurance teams.Key Responsibilities:Develop and expand a high-performing Technical Program Management organization by attracting, nurturing, and retaining top talent.Ensure impeccable execution of mission-critical programs by upholding schedule integrity, proactively identifying and resolving blockers, clarifying requirements, and aligning all stakeholders.Influence strategic direction by shaping product roadmaps, driving data-informed feature prioritization, and owning end-to-end schedule commitments with full accountability for delivery.Promote operational excellence by designing, implementing, and continually refining scalable processes and frameworks that enhance efficiency and productivity.
Join latentlabs, a pioneering company at the forefront of biotechnology, as we seek a talented Machine Learning Researcher specializing in generative modeling. You will become part of a dynamic, interdisciplinary team comprising machine learning experts, protein engineers, and biologists, all committed to revolutionizing biological control and disease treatment. In this role, you will design innovative generative models aimed at creating new proteins that exhibit functionality in wet lab assays.
Sigma Computing builds cloud-scale analytics and business intelligence tools that keep the familiar feel of a spreadsheet. The platform helps business professionals, non-technical users, and data teams explore, analyze, visualize, and collaborate on data throughout their organizations. About the Temp-to-Hire Program This program is designed for early-career Technical Program Managers interested in the operational side of engineering, with a focus on Infrastructure and Data Services. The role combines technical knowledge with project management skills. Success in this position relies on a proactive approach and the ability to work with cross-functional teams to improve execution and meet goals. A strong interest in building quality products and supporting customer satisfaction is important. Note: This is a temp-to-hire position. The initial engagement lasts three months as a temporary employee. Full-time conversion is possible, depending on performance during that period.
Full-time|Remote|San Francisco, CA, New York, NY, Portland, OR, or Remote within Canada or United States
As Mercury continues to expand, our revenue platforms increasingly operate at the nexus of product launches, data systems, go-to-market strategies, and external partnerships. In a landscape where every task holds significance, the greatest threat is not a lack of effort, but rather a stagnation in momentum. We are seeking a Senior Technical Program Manager for our Revenue Technology team, who will ensure that intricate, multi-team projects not only progress but do so efficiently: in the right order, with clear dependencies highlighted, and tangible advancements visible on a weekly basis. This role transcends mere status updates; we need a hands-on technical TPM adept enough to directly resolve issues — whether that involves Linear, Salesforce, data workflows, or collaboration across Product, Engineering, Data, and Revenue teams. In this position, you will report to the Head of Platforms & Infrastructure and play a pivotal role in maintaining execution focus, allowing specialized technical roles to function at their highest capacity. *Mercury is a fintech company and not an FDIC-insured bank. Banking services are provided through Choice Financial Group and Column N.A., Members FDIC.
Full-time|On-site|San Francisco (London/Europe - OK)
Tavus – Multimodal AI Model OptimizationResearch EngineerAt Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.The RoleWe are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.Your MissionTransform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.RequirementsExtensive experience in deep learning with PyTorch.Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.Familiarity with efficient architectures such as low-rank adapters.Strong grasp of inference performance and GPU/accelerator fundamentals.Proficient in Python coding and adherence to best practices in research engineering.Experience with large models and datasets in cloud environments.Capability to read ML literature, reproduce results, and modify ideas accordingly.
Apr 3, 2026
Sign in to browse more jobs
Create account — see all 4,487 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.