AnthropicRemote-Friendly (Travel-Required) | San Francisco, CA | New York City, NYNew
Remote Full-time
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Strong background in machine learning and AI model evaluation techniques. Experience with programming languages such as Python or similar. Ability to work collaboratively in a remote team environment. Excellent analytical and problem-solving skills.
About the job
Anthropic is looking for a Research Engineer focused on model evaluations. This position involves research and development to assess and strengthen the performance of AI models. Teams are based in San Francisco and New York City, and the role supports remote work with required travel.
Key responsibilities
Design and implement evaluations for Anthropic's AI models
Collaborate with team members to enhance model performance
Contribute to research that pushes the boundaries of AI systems
Location
Remote-friendly (travel required)
San Francisco, CA
New York City, NY
About Anthropic
Anthropic is at the forefront of AI research, dedicated to developing safe and beneficial AI technologies. Our mission is to ensure that AI systems are aligned with human intentions and values. We foster a collaborative and innovative environment that encourages curiosity and creativity.
Similar jobs
1 - 20 of 4,572 Jobs
Search for Generative Ai Researcher Atomistic Simulation Models
Join Achira in shaping the future of deep learning with cutting-edge generative, representational, and simulation models for molecules and materials. Our mission is to create foundational models that render the atomistic universe understandable, predictable, and designable.Why Choose Achira?Be part of an elite, cross-disciplinary team comprising ML researchers, physicists, chemists, and engineers who are redefining atomistic simulation through expansive foundation models.Advance the integration of deep learning with the principles of nature, merging generative AI, probabilistic reasoning, and molecular physics.Engage in projects at an unparalleled scale, tackling extensive datasets, computational challenges, and ambitious goals.Take full ownership of your research journey — from ideation and architecture to training, evaluation, and deployment.Flourish in a dynamic culture that values rigor, speed, creativity, and impact over bureaucracy.Position OverviewAs a Generative AI Researcher at Achira, you will contribute to the development of foundation simulation models — large-scale systems designed to learn the structure, dynamics, and energetics of the atomistic realm. These models will unite deep representation learning, generative modeling, and sophisticated simulation techniques.Your responsibilities will include:Crafting and training state-of-the-art deep generative models — including diffusion, autoregressive, flow-based, and latent-variable architectures focused on molecules, materials, and atomic systems.Creating expressive representations of molecular and atomistic structures and dynamics utilizing equivariant graph neural networks, geometric transformers, and latent encoders that respect physical symmetries and constraints.Innovating advanced sampling and simulation techniques that blend probabilistic inference, deep learning, and reinforcement learning to facilitate efficient exploration and simulation of learned energy landscapes.Developing models that comprehend, generate, and simulate the physical world, merging reasoning, simulation, and predictive capabilities.Working collaboratively with physicists and chemists to validate models against ab initio, molecular dynamics, and experimental datasets.Rapidly prototyping, benchmarking, and iterating — converting research concepts into reusable, scalable model components across Achira’s foundation model suite.
Join our innovative team at Achira, where we are pioneering the integration of probabilistic generative models with advanced simulation frameworks to revolutionize drug discovery. In this role, you will harness state-of-the-art techniques to accelerate molecular design and enhance biomolecular conformational sampling. Why Choose Achira?Become part of a highly skilled team of researchers, scientists, and engineers dedicated to merging probabilistic AI/ML with molecular simulation for transformative small molecule drug discovery.Contribute to the development of cutting-edge architectures for 3D generation and proposal mechanisms that leverage physical insights.Engage with large-scale models and datasets, utilizing high-throughput evaluation within a ML-optimized biomolecular simulation ecosystem.Take ownership of your projects, from the initial conception of models to the design of sampling tools and applications.Thrive in a dynamic culture that values speed, scientific rigor, and a sense of ownership.Role OverviewAs a Generative AI Researcher, you will be at the forefront of Achira's efforts to develop foundation simulation models and conditional generators tailored for molecular systems. Your expertise in designing probabilistic generative models, including diffusion models and normalizing flows, will be crucial in optimizing biomolecular simulation potentials. This position offers an opportunity to enhance the efficiency of small molecule generation and biomolecular landscape exploration.A background in statistical mechanics, particularly nonequilibrium perspectives, will be beneficial, although the primary focus will be on probabilistic AI/ML methodologies.
Overview: Join us at Spellbrush as we innovate in the world of gaming! We are developing an immersive 3D first-person adventure game where an AI companion drives the gameplay experience. Our goal is to create a game that integrates large language models (LLMs) in a way that enhances storytelling and player engagement, moving beyond simple chat interactions.About SpellbrushAt Spellbrush, we are dedicated to crafting exceptional anime games. As the leading generative AI studio, we are the creative force behind niji・journey. Our mission is clear: to harness AI in bringing vibrant characters to life and to redefine narrative-driven gaming experiences.Our ProjectWe have developed an innovative in-house LLM storytelling system that seamlessly integrates AI with narrative and gameplay, offering players a depth of interaction that transcends traditional gaming.Role OverviewAs an integral member of our small but highly skilled team, you will have the opportunity to shape the future of gaming. Collaborate with leading minds in the industry, including the creator of Warudo and a veteran from Google DeepMind behind Project Astra. Your role will afford you substantial creative and research freedom to pioneer LLM-driven storytelling.
Full-time|$250K/yr - $325K/yr|On-site|San Francisco
About World Labs: At World Labs, we create foundational world models capable of perceiving, generating, reasoning, and interacting with the 3D environment. Our mission is to unlock the full potential of AI through spatial intelligence, transforming perception into action, reasoning into insight, and imagination into creation. We believe that spatial intelligence will revolutionize storytelling, creativity, design, simulation, and immersive experiences across both virtual and physical realms. Our world-class team is driven by curiosity and passion, boasting diverse backgrounds in technology, from AI research and systems engineering to product design. This synergy fosters a tight feedback loop between our cutting-edge research and user-empowering products. Role Overview We are seeking an innovative Research Scientist specializing in generative modeling, especially diffusion models, to join our modeling team. This position is ideal for individuals with extensive expertise in applying diffusion models to images, videos, or 3D assets and scenes. While not mandatory, experience in any of the following areas will be considered a significant advantage: Large-scale model trainingResearch in 3D computer vision In this role, you will work closely with researchers, engineers, and product teams to translate advanced 3D modeling and machine learning techniques into practical applications, ensuring our technology stays at the forefront of visual innovation. This position entails substantial hands-on research and engineering work, taking projects from conception to production deployment. Key Responsibilities Design, implement, and train large-scale diffusion models for generating 3D worlds. Develop and experiment with large-scale diffusion models to introduce novel control signals, align with target aesthetic preferences, or optimize for efficient inference. Collaborate closely with research and product teams to comprehend and translate product requirements into actionable technical roadmaps. Contribute actively to all phases of model development, including data curation, experimentation, evaluation, and deployment. Continuously investigate and integrate the latest research in diffusion and generative AI. Serve as a key technical resource within the team, mentoring peers and promoting best practices in generative modeling and machine learning engineering.
About TavusTavus is at the forefront of innovation in human computing. Our mission is to develop AI Humans: an advanced interface that bridges the gap between individuals and machines, eliminating the friction found in current technologies. Our state-of-the-art human simulation models empower machines to see, hear, respond, and even exhibit realistic appearances—facilitating genuine, face-to-face interactions. AI Humans integrate the emotional insight of humans with the scalability and dependability of machines, making them reliable agents accessible 24/7, in any language, on our terms.Imagine having access to an affordable therapist, a personal trainer that fits your schedule, or a team of medical assistants dedicated to providing personalized care for every patient. With Tavus, individuals, enterprises, and developers have the tools to create AI Humans that connect, comprehend, and act with empathy on a large scale.We are a Series A company supported by esteemed investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners.Join us in shaping a future where machines and humans genuinely understand one another.The PositionWe are seeking an AI Researcher to join our core AI team and advance the frontiers of multimodal conversational intelligence. If you excel in dynamic environments, enjoy transforming abstract concepts into functional code, and derive motivation from pushing the boundaries of possibility, this role is designed for you.Your Responsibilities Engage in research focusing on Foundational Multimodal Models specifically in the realm of Conversational Avatars (such as Neural Avatars and Talking-Heads).Develop models for video, audio, and language sequences utilizing Autoregressive and Predictive Architectures (e.g., V-JEPA) and/or Diffusion methodologies, with a focus on temporal and sequential data rather than static images.Collaborate closely with the Applied ML team to implement your research into production systems.Remain at the forefront of multimodal learning and assist us in defining what “cutting edge” will mean in the future.Ideal Candidate ProfilePhD (or nearing completion) in a relevant field, or equivalent practical research experience.Experience in multimodal machine learning, particularly focused on conversational interfaces.
About Sygaldry TechnologiesSygaldry Technologies is at the forefront of innovation, developing quantum-accelerated AI servers designed to significantly enhance the speed of AI training and inference. By merging quantum computing with AI, we are navigating the challenges of increasing compute costs and energy constraints, paving the way towards superintelligence. Our AI servers leverage a diverse range of qubit types in a fault-tolerant architecture, achieving the necessary balance of cost, scalability, and speed for advanced AI applications. We are committed to pioneering new frontiers in physics, engineering, and AI, tackling the most complex challenges with a culture grounded in optimism and rigor. We seek individuals passionate about defining the convergence of quantum and AI and making a meaningful global impact.About the RoleGenerative AI is revolutionizing computational possibilities but reveals the limitations of classical hardware. While diffusion models yield remarkable outcomes, their iterative sampling and high-dimensional score estimation often lead to computational inefficiencies.We are convinced that quantum computing holds the key to overcoming these challenges. As an ML Research Scientist, you will operate at the intersection of generative modeling and quantum acceleration, formulating theoretical foundations and practical applications that merge these domains. Your focus will be on identifying areas where quantum methods can deliver substantial advantages in generative workflows, providing not just incremental enhancements but transformative improvements grounded in mathematical principles.Your ResponsibilitiesGenerative Model Architecture & EfficiencyInnovate state-of-the-art diffusion and score-based generative models.Investigate computational bottlenecks in sampling, denoising, and likelihood estimation.Design and evaluate novel solver techniques for diffusion ODEs/SDEs.Quantum-Classical IntegrationDiscover mathematical structures in generative models that are suitable for quantum acceleration.Prototype hybrid workflows that utilize quantum subroutines to enhance classical processes.Conduct rigorous benchmarks comparing theoretical advantages against practical benefits in realistic scenarios.Research to ProductionTransform research findings into scalable implementations.Collaborate with quantum hardware teams to guide architectural specifications.Facilitate the integration of research insights into production environments.
Join latentlabs, a pioneering company at the forefront of biotechnology, as we seek a talented Machine Learning Researcher specializing in generative modeling. You will become part of a dynamic, interdisciplinary team comprising machine learning experts, protein engineers, and biologists, all committed to revolutionizing biological control and disease treatment. In this role, you will design innovative generative models aimed at creating new proteins that exhibit functionality in wet lab assays.
Full-time|On-site|San Francisco (London/Europe - OK)
Tavus – Multimodal AI Model OptimizationResearch EngineerAt Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.The RoleWe are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.Your MissionTransform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.RequirementsExtensive experience in deep learning with PyTorch.Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.Familiarity with efficient architectures such as low-rank adapters.Strong grasp of inference performance and GPU/accelerator fundamentals.Proficient in Python coding and adherence to best practices in research engineering.Experience with large models and datasets in cloud environments.Capability to read ML literature, reproduce results, and modify ideas accordingly.
Join Our Innovative TeamTavus is a forward-thinking research laboratory at the forefront of human-computer interaction. We are dedicated to creating AI Humans, a revolutionary interface that bridges the gap between individuals and machines, eliminating the friction commonly encountered in today's technology. Our advanced human simulation models empower machines to see, hear, respond, and even exhibit lifelike appearances, facilitating genuine, face-to-face interactions. By merging human emotional intelligence with machine efficiency, our AI Humans serve as reliable, empathetic agents available around the clock, in any language, tailored to user needs.Picture a therapist within reach for everyone, a personal trainer that fits seamlessly into your schedule, or a comprehensive team of medical assistants providing focused attention to every patient. With Tavus, individuals and organizations can develop AI Humans to foster connection, understanding, and responsiveness on a grand scale.As a Series A startup, we are backed by prestigious investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners. Join us in crafting a future where humans and machines communicate effortlessly.Your RoleWe seek an AI Researcher to join our dynamic AI team and explore the frontiers of large language modeling within conversational AI. If you excel in fast-paced startup settings, enjoy experimenting with innovative ideas, and are eager to see your contributions realized in production, you will thrive here.Your Mission Conduct in-depth research on large language modeling and its adaptation for Conversational Avatars (e.g., Neural Avatars, Talking-Heads).Create methodologies to effectively model both verbal and non-verbal communication, dynamically adjusting avatar behaviors in real-time.Experiment with fine-tuning, adaptation, and conditioning techniques to enhance the expressiveness, control, and task specificity of LLMs.Collaborate with the Applied ML team to transition research from prototype to full-scale deployment.Stay informed on the latest advancements and contribute to defining the next breakthroughs.
At Genmo, we are pioneering advancements in video generation technology through our state-of-the-art research lab. Our mission is to develop open models that contribute to the evolution of Artificial General Intelligence (AGI). Join us as we redefine the capabilities of AI and explore the vast potential of video generation.Role Overview:We are on the lookout for an outstanding Research Scientist specializing in diffusion models to be a part of our innovative team. Your primary focus will be on creating advanced diffusion models aimed at transforming text into captivating video content. This role places you at the cutting edge of AI research, where you will devise new architectures and algorithms to generate visually appealing and coherent videos from textual descriptions.
Join Cartesia as a Model Architecture ResearcherAt Cartesia, our vision is to revolutionize AI by creating interactive intelligence that is seamlessly integrated into your daily life. Unlike current models, our goal is to develop systems capable of processing extensive streams of audio, video, and text—1 billion text tokens, 10 billion audio tokens, and 1 trillion video tokens—directly on devices.As pioneers in innovative model architectures, our founding team, which originated from the Stanford AI Lab, has developed State Space Models (SSMs)—a groundbreaking foundation for training efficient, large-scale models. Our diverse team merges deep expertise in model innovation with a design-focused engineering approach, allowing us to create and deploy state-of-the-art models and applications.Backed by leading investors such as Index Ventures, Lightspeed Venture Partners, and many others, including industry veterans and advisors, we are poised to shape the future of AI.Your ContributionIn this role, you will drive forward-thinking research in neural network architecture, focusing on alternative models like state space models, efficient transformers, and hybrid architectures.Create innovative architectures that enhance model performance, inference speed, and adaptability in various environments, from cloud infrastructures to on-device implementations.Develop advanced capabilities for models, including statefulness, long-range memory, and novel conditioning mechanisms to boost expressiveness and generalization.Analyze architectural decisions and their effects on model characteristics such as scalability, robustness, latency, and energy consumption.Create frameworks and tools to assess architectural advancements, benchmarking their performance in both research and production contexts.Collaborate with interdisciplinary teams to translate architectural insights into scalable systems that deliver real-world impact.Your QualificationsExtensive experience in architecture design with a focus on advanced models such as state space models, transformers, and RNN/CNN variants.In-depth understanding of the interplay between architectural designs and system constraints, particularly in cloud and on-device deployments.Strong proficiency in the design and evaluation of neural network architectures.
Zyphra is an innovative artificial intelligence company located in the heart of San Francisco, California.The Opportunity:Join our dynamic team as a Research Engineer - Audio & Speech Models, where you will play a pivotal role in advancing Zyphra’s Audio Team. You will be instrumental in developing cutting-edge open-source text-to-speech and audio models. Your contributions will span the full spectrum of the model training process, from data collection and processing to the design of innovative architectures and training approaches.Your Responsibilities:Conduct large-scale audio training operationsOptimize the performance of our training infrastructureCollect, process, and evaluate audio datasetsImplement architectural and methodological improvements through rigorous testingWhat We Seek:A strong research mindset with the ability to navigate projects from ideation to implementation and documentation.Proficiency in rapid prototyping and implementation, allowing for swift experimentation.Effective collaboration skills in a fast-paced research environment.A quick learner who is eager to embrace and implement new concepts.Excellent communication abilities, enabling you to contribute to both research and engineering tasks at scale.Preferred Qualifications:Expertise in training audio models, such as text-to-speech, ASR, speech-to-speech, or emotion recognition.Experience with training audio autoencoders.Solid understanding of signal processing, particularly in audio.Familiarity with diffusion models, consistency models, or GANs.Experience with large-scale (multi-node) GPU training environments.Strong understanding of experimental methodologies for conducting rigorous tests and ablations.Interest in large-scale, parallel data processing pipelines.Competence in PyTorch and Python programming.Experience contributing to large, established codebases with rapid adaptation.
Zyphra is a cutting-edge artificial intelligence firm headquartered in the vibrant city of San Francisco, California.Position Overview:As a Research Scientist specializing in Model Architectures, you will play a pivotal role in Zyphra’s AI Architecture Research Team. Your responsibilities will include the design and thorough evaluation of innovative model architectures and training methodologies aimed at enhancing essential modeling capabilities (e.g., loss per flop or loss per parameter) and tackling core limitations inherent in current models. You will collaborate closely with our pre-training team to ensure that your findings are seamlessly integrated into our next-generation models.Qualifications:A strong research acumen and intuition.Proven ability to navigate research projects from initial conception to execution and final write-up.Exceptional implementation and prototyping skills, with the capability to swiftly transform ideas into experimental outcomes.A collaborative spirit and the ability to thrive in a fast-paced research environment.A deep curiosity and enthusiasm for understanding intelligence.Requirements:Experience with long-term memory, RAG/retrieval systems, dynamic/adaptive computation, and alternative credit assignment strategies.Knowledge of reinforcement learning, control theory, and signal processing techniques.A passion for exploring and critically evaluating unconventional ideas, with the ability to maintain a unique perspective.Familiarity with modern training pipelines and the hardware necessities for designing efficient architectures compatible with GPU hardware.Strong understanding of experimental methodologies for conducting rigorous ablations and hypothesis testing.High proficiency in PyTorch and Python programming.Ability to quickly assimilate into large pre-existing codebases and contribute effectively.Prior publication of machine learning research in reputable venues.Postgraduate degree in a scientific discipline (e.g., Computer Science, Electrical Engineering, Mathematics, Physics).Why Join Zyphra?We emphasize a structured research methodology that systematically addresses ambitious challenges in AI.
Full-time|Remote|Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY
Anthropic is looking for a Research Engineer focused on model evaluations. This position involves research and development to assess and strengthen the performance of AI models. Teams are based in San Francisco and New York City, and the role supports remote work with required travel. Key responsibilities Design and implement evaluations for Anthropic's AI models Collaborate with team members to enhance model performance Contribute to research that pushes the boundaries of AI systems Location Remote-friendly (travel required) San Francisco, CA New York City, NY
Overview: At Spellbrush, we are revolutionizing the gaming experience by crafting a 3D first-person adventure game where an AI companion plays a pivotal role. Imagine MiSide enhanced with language learning models, seamlessly integrated into gameplay rather than merely serving as a role-play chatbot.About UsAt Spellbrush, we are dedicated to creating exceptional anime games, and we proudly stand as a global leader in generative AI. Our flagship project is niji・journey.Our mission is straightforward: to use AI to animate characters and redefine narrative-driven gaming.What We’re CreatingWe have engineered an innovative in-house LLM storytelling system that merges AI, narrative, and gameplay, transcending the limitations of conventional chat-only encounters.This results in an AI companion that collaborates with players in solving puzzles, retains memories across different worlds, and alters the progression of each chapter.About The RoleJoin our elite team to redefine video game experiences. Collaborate with leading minds in the industry, including the creator of Warudo and Cytoid, as well as a Google DeepMind veteran behind Project Astra, and top-tier AI researchers.As an integral early member of this team, you will enjoy significant artistic and research autonomy in shaping what could be the next era of LLM-driven storytelling.
Remote|Remote|Remote-Friendly (Travel Required) | San Francisco, CA
Join Anthropic as a Senior Research Scientist on our Reward Models team, where you will spearhead groundbreaking research aimed at enhancing our understanding of human preferences at scale. Your innovative contributions will directly influence how our AI models, including Claude, align with human values and optimize for user needs. You will delve into the forefront of reward modeling for large language models, designing novel architectures and training methodologies for Reinforcement Learning from Human Feedback (RLHF). Your research will explore advanced evaluation techniques, including rubric-based grading, and tackle challenges such as reward hacking. Collaboration is key, as you'll work alongside teams in Finetuning, Alignment Science, and our broader research organization to ensure your findings result in tangible advancements in AI capabilities and safety. This role offers you an opportunity to address critical AI alignment challenges, leveraging cutting-edge models and substantial computational resources to further the science of safe and capable AI systems.
AI Financial Modeling Extern — F2 AILocation: San Francisco, CA / In-Person or RemoteCommitment: 5+ hours per week | 4 - 12+ weeksCompensation: $50/hrAbout F2 AIAt F2 AI, we are revolutionizing private market investments. Our cutting-edge AI technology streamlines the process of analyzing complex, unstructured deal materials, transforming them into actionable, investment-grade insights in mere minutes. By empowering private credit funds, commercial banks, and private equity firms, we enable faster and more confident capital deployment. Supported by top-tier investors such as NFX and Y Combinator, we are committed to expanding our exceptional product and engineering teams, shaping the future of vertical AI for finance.Role OverviewWe are on the lookout for 1–2 exceptional externs with a strong foundation in Investment Banking or Private Equity to contribute to the development of AI-driven financial modeling on the F2 platform.In this role, you will collaborate closely with our Engineering, Product, and Design (EPD) teams in the San Francisco office to translate institutional-level financial modeling standards into automated, intelligent workflows. This hands-on experience will allow you to shape the future of AI in financial modeling.Key ResponsibilitiesEducate F2 agents on best practices for financial modeling.Create and standardize financial modeling templates optimized for AI execution using a first principles approach.Establish formatting, structure, and best practices that align with institutional modeling standards.Conduct rigorous quality assurance on AI-generated outputs to guarantee precision that meets investor expectations.Test edge cases and assist in identifying potential failures in automated modeling workflows.Ideal Candidate ProfilePossess prior experience in Investment Banking, Private Credit, or Private Equity with extensive exposure to financial modeling.Demonstrated ability to build and audit complex 3-statement, LBO, or credit models from the ground up.Strong understanding of model hygiene, structure, and institutional formatting standards.Critical thinker who enjoys analyzing model logic and stress-testing systems.Passionate about leveraging AI to enhance financial workflows.
Full-time|$273K/yr - $393K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY
At Scale AI, we are at the forefront of artificial intelligence, driving innovation through our advanced data, infrastructure, and tooling that empower the most sophisticated models worldwide. Our teams thrive at the intersection of pioneering research, extensive engineering, and practical deployment, collaborating with leading labs, enterprises, and government entities to explore the vast potential of Generative AI. As AI technology evolves from static models to dynamic, intelligent systems, Scale AI is dedicated to establishing the essential research foundations, evaluation methodologies, and reinforcement learning infrastructure that will shape this transformative era. Join our high-impact research organization, where you will contribute to advancing large language models, post-training evaluation, and agent-based reinforcement learning environments, influencing the future of AI development and implementation. As the Research Scientist Manager, you will spearhead a distinguished team of research scientists and engineers, define the strategic research roadmap, and oversee projects from initial prototyping to final deployment. You will excel in a fast-paced environment, harmonizing deep technical leadership with effective people management, visionary goal setting, and successful delivery.
Team OverviewThe Human Data team at OpenAI is at the forefront of identifying and mitigating risks associated with advanced AI systems. Our mission is to enhance model reliability and public trust by designing thorough evaluations, uncovering vulnerabilities, and collaborating closely with researchers.Role OverviewAs a Technical Program Manager, you will spearhead initiatives aimed at assessing the safety and robustness of OpenAI’s models through innovative experimentation and methodical evaluation. Your role will involve orchestrating efforts across research and engineering teams, translating ambiguous risk signals into actionable research programs that will shape the future of AI model development and deployment.We seek candidates who possess technical acumen, thrive in uncertain environments, and are passionate about pioneering the future of safe AI.This position is based in San Francisco, CA, employing a hybrid work model of three days in the office each week, with relocation assistance available for new hires.Key ResponsibilitiesLead programs that investigate unexpected model behaviors and identify potential failure modes.Convert ambiguous risk signals into clear priorities and actionable research agendas.Design and execute innovative evaluations, experiments, and red-teaming initiatives.Collaborate with research, product, and deployment teams to integrate findings into the model training and deployment pipelines.Establish repeatable systems for monitoring model performance and interpreting emerging behavior patterns.Ideal Candidate ProfileProven experience in technical program management with exceptional organizational and communication abilities.Familiarity with large language models, prompt engineering, or model evaluation methodologies.Ability to manage fast-paced, high-uncertainty projects, shaping them from inception.Creative and resourceful in developing novel methods for evaluating model behavior and performance.Skilled in coordinating effectively across both technical and non-technical stakeholders to ensure alignment and execution.About OpenAIOpenAI is a pioneering AI research and deployment company committed to ensuring that general-purpose artificial intelligence benefits all of humanity. We continually push the boundaries of AI capabilities and strive to deploy them safely through our innovative products. Our mission is to harness the extraordinary potential of AI responsibly and equitably for a better future.
About UsTavus is an innovative research lab at the forefront of human computing technology. Our mission is to create AI Humans—advanced interfaces that bridge the gap between individuals and machines, eliminating the friction found in current systems. Our real-time human simulation models empower machines to see, hear, respond, and appear realistic, facilitating genuine, face-to-face conversations. With AI Humans, we blend the emotional intelligence inherent in humans with the extensive reach and reliability of machines, enabling them to serve as capable and trusted agents available 24/7, capable of communicating in any language.Envision a therapist accessible to everyone, a personal trainer that tailors sessions to your schedule, or a fleet of medical assistants dedicated to providing personalized attention to every patient. Tavus enables individuals, enterprises, and developers to create AI Humans that connect, empathize, and act with understanding on a large scale.Backed by prestigious investors like Sequoia Capital, Y Combinator, and Scale Venture Partners, we are a Series A company ready to shape the future of human-machine interaction.Join us in transforming a future where humans and machines genuinely comprehend one another.The RoleWe are seeking a passionate AI Researcher to join our core AI team and advance the science of audio-visual avatar generation. If you thrive in dynamic startup environments, enjoy experimenting with generative models, and are excited to see your research translated into production, you will find a welcoming home here.Your Mission Conduct research and develop cutting-edge audio-visual generation models for conversational agents (e.g., Neural Avatars, Talking Heads).Focus on models that intricately align with conversation flows, ensuring seamless integration of verbal and non-verbal cues.Experiment with diffusion models (DDPMs, LDMs, etc.), long-video generation, and audio synthesis.Collaborate closely with the Applied ML team to transition your research into practical applications.Stay updated on the latest breakthroughs in multimodal generation and contribute to the evolution of this field.You Will Excel If You Have:A PhD (or nearing completion) in a relevant discipline, or equivalent hands-on research experience.Proficiency in applying image/video generation techniques and a solid understanding of machine learning principles.
Oct 16, 2025
Sign in to browse more jobs
Create account — see all 4,572 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.