Post Training Applied Ai Researcher jobs in San Francisco – Browse 4,482 openings on RoboApply Jobs

Post Training Applied Ai Researcher jobs in San Francisco

Open roles matching “Post Training Applied Ai Researcher” with location signals for San Francisco. 4,482 active listings on RoboApply Jobs.

4,482 jobs found

1 - 20 of 4,482 Jobs
Apply
companyDistyl AI logo
Full-time|$130K/yr - $250K/yr|On-site|San Francisco

About Distyl AIDistyl AI is at the forefront of developing production-grade AI systems that enhance core operational workflows for Fortune 500 companies. Our innovative solutions, powered by a strategic alliance with OpenAI and bolstered by in-house software accelerators, deliver AI systems with rapid time-to-value, often within just a quarter.Our cutting-edge products have successfully transformed the operations of Fortune 500 clients across a multitude of sectors, including insurance, consumer packaged goods, and non-profit organizations. As a member of our team, you will be instrumental in assisting companies to identify, construct, and unlock the potential of their GenAI investments, frequently for the first time. We pride ourselves on being customer-centric, addressing client challenges directly, and holding ourselves accountable for generating financial impact while enhancing the experiences of end-users.Led by distinguished leaders from prestigious organizations such as Palantir and Apple, Distyl is also supported by renowned investors including Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (former CEO of GitHub), and Brad Gerstner (Founder and CEO of Altimeter), alongside board members from over a dozen Fortune 500 companies.

Oct 16, 2025
Apply
companyThinking Machines Lab logo
Post-Training Researcher

Thinking Machines Lab

Full-time|$350K/yr - $475K/yr|On-site|San Francisco

At Thinking Machines Lab, our mission is to empower humanity by advancing collaborative general intelligence. We strive to build a future where everyone has access to the knowledge and tools essential for making AI work effectively for their unique objectives.Our team comprises scientists, engineers, and innovators who have contributed to some of the most widely adopted AI products, including ChatGPT and Character.ai, as well as notable open-weight models like Mistral and popular open-source projects such as PyTorch, OpenAI Gym, Fairseq, and Segment Anything.About the RoleThe Post-Training Researcher position is pivotal to our roadmap. It serves as a crucial connection between raw model intelligence and a system that is genuinely beneficial, safe, and collaborative for human users.This role uniquely combines fundamental research with practical engineering, as we do not differentiate between these functions internally. Candidates will be expected to produce high-performance code and analyze technical reports. This position is ideal for individuals who relish both deep theoretical inquiry and hands-on experimentation, aiming to influence the foundational aspects of AI learning.Note: This position is classified as an 'evergreen role', meaning we continuously accept applications in this research domain. Given the high volume of applications, an immediate match for your skills and experience may not always be available. However, we encourage you to apply; we regularly review submissions and reach out as new opportunities arise. You are welcome to apply again after gaining more experience, but we ask that you refrain from applying more than once every six months. Additionally, specific postings for singular roles may be available for distinct projects or team needs, in which case you are welcome to apply directly in conjunction with this evergreen role.What You’ll DoDevelop and Optimize Recipes: Refine post-training recipes, encompassing various datasets, training stages, and hyperparameters, while assessing their impact on multiple performance metrics.Iterate on Evaluations: Engage in a continuous process of defining evaluation metrics, optimizing them, and recognizing their limitations. You will be accountable for enhancing performance metrics and ensuring they are meaningful.Debug and Analyze: During the fine-tuning of training configurations, you may encounter results that appear inconsistent. You will be responsible for troubleshooting and cultivating a deeper understanding to apply to subsequent challenges.Scale and Investigate: Assess and expand the capabilities of our models while exploring potential improvements.

Nov 23, 2025
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Role overview OpenAI is looking for a Researcher focused on Agentic Post-Training, based in San Francisco. This role centers on analyzing and improving how AI systems behave after their initial training. The goal is to broaden the capabilities of AI and refine how models respond in complex situations. What you will do Study and assess agentic behaviors in trained AI models Create new approaches to strengthen these behaviors after training Collaborate with a talented team on projects that shape the future of artificial intelligence research Collaboration and impact This position involves hands-on research with other specialists at OpenAI. The work directly supports the advancement of AI capabilities and helps define new benchmarks for agentic performance in artificial intelligence.

Apr 23, 2026
Apply
companyThinking Machines Lab logo
Post-Training Researcher

Thinking Machines Lab

Full-time|$350K/yr - $475K/yr|On-site|San Francisco

At Thinking Machines Lab, our mission is to empower humanity by advancing collaborative general intelligence. We envision a future where everyone can harness the knowledge and tools necessary for AI to serve their unique needs and aspirations. Our team comprises scientists, engineers, and builders who have developed some of the most widely utilized AI products, such as ChatGPT and Character.ai, as well as open-weight models like Mistral and popular open-source projects including PyTorch, OpenAI Gym, Fairseq, and Segment Anything.About the RoleThe role of a Post-Training Researcher is pivotal to our strategic vision. This position serves as the essential link between raw model intelligence and a practical, safe, and collaborative system for human users.Our research in post-training data sits at the intersection of human insights and machine learning. By integrating human and synthetic data techniques alongside innovative methodologies, we capture the subtleties of human behavior to inform and guide our models. We investigate and model the mechanisms that derive value for individuals, enabling us to articulate, predict, and enhance human preferences, behaviors, and satisfaction. Our objective is to translate research concepts into actionable data through meticulously planned data labeling and collection initiatives, while also understanding the science behind high-quality data that effectively trains our models. Additionally, we develop and assess quantitative metrics to evaluate the success and impact of our data and training strategies.Beyond execution, we explore new paradigms for human-AI interaction and scalable oversight, experimenting with optimal ways for humans to supervise, guide, and collaborate with models. This interdisciplinary role merges research, data operations, and technical implementation, pushing the boundaries of aligned, human-centered AI systems.This position combines foundational research and practical engineering, as we do not differentiate between these roles internally. You will be expected to write high-performance code and comprehend technical reports. This role is perfect for individuals who thrive on deep theoretical exploration and hands-on experimentation, eager to shape the foundational aspects of AI learning.Note: This is an evergreen role that we maintain continuously to express interest in this research area. We receive a high volume of applications, and while there may not always be an immediate fit for your skills and experience, we encourage you to apply. We regularly review applications and reach out to candidates as new opportunities arise. You are welcome to reapply after gaining more experience, but please limit applications to once every six months. You may also notice postings for specific roles for targeted positions.

Nov 23, 2025
Apply
companyBaseten logo
Full-time|Remote|San Francisco

Join Baseten as a Post-Training Applied Researcher, where you will be at the forefront of innovative research applications. Your expertise will help bridge the gap between training and real-world applications, making a tangible impact in the industry.

Mar 17, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

OpenAI is hiring a Software Engineer for Post-Training Research in San Francisco. This position centers on improving the performance and capabilities of advanced machine learning models after their initial training phase. Role overview Work closely with a skilled team to explore new ways of strengthening AI systems. The focus is on researching and developing methods that push the boundaries of what these models can achieve once training is complete. Collaboration Expect to contribute to ongoing research efforts and share insights with colleagues who are passionate about advancing AI. Teamwork and knowledge exchange are key parts of this role. Location This position is based in San Francisco.

Apr 29, 2026
Apply
companyBaseten logo
Full-time|On-site|San Francisco

Join Baseten as a Post-Training Research Engineer and contribute to groundbreaking advancements in machine learning and AI. In this role, you will leverage your engineering skills to analyze and enhance models post-training, ensuring optimal performance and efficiency.

Mar 23, 2026
Apply
companyScale AI logo
Full-time|$252K/yr - $315K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY

At Scale AI, we collaborate with leading AI laboratories to supply high-quality data and foster advancements in Generative AI research. We seek innovative Research Scientists and Research Engineers with a strong focus on post-training techniques for Large Language Models (LLMs), including Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and reward modeling. This position emphasizes optimizing data curation and evaluation processes to boost LLM performance across text and multimodal formats. In this pivotal role, you will pioneer new methods to enhance the alignment and generalization of extensive generative models. You will work closely with fellow researchers and engineers to establish best practices in data-driven AI development. Additionally, you will collaborate with top foundation model labs, providing critical technical and strategic insights for the evolution of next-generation generative AI models.

Mar 26, 2026
Apply
companyLetta logo
Full-time|On-site|San Francisco Office

Advancing Self-Improving SuperintelligenceAt Letta, we are on a mission to revolutionize artificial intelligence by creating self-improving agents that learn and adapt like humans. Unlike current AI systems that are often rigid and brittle, our innovative approach aims to build adaptable AI that continually evolves through experience.Founded by the visionaries behind MemGPT at UC Berkeley's Sky Computing Lab, the birthplace of Spark and Ray, we are backed by notable figures in AI infrastructure, including Jeff Dean and Clem Delangue. Our agents are already enhancing production systems for industry leaders such as 11x and Bilt Rewards, continually learning and improving in real-time.Join our elite team of researchers and engineers dedicated to tackling AI's most significant challenges: creating machines that can reason, remember, and learn as humans do.This position requires in-person attendance (no hybrid options) at our downtown San Francisco office, five days a week.

Feb 4, 2025
Apply
companyGeneralist logo
Full-time|On-site|San Francisco Bay Area (San Mateo) or Boston (Somerville)

About the RoleIn the realm of machine learning, pretraining lays the foundation for a general model, while post-training refines that model, enhancing its utility, controllability, safety, and performance in real-world applications. As a Post-Training Research Scientist, you will transform large pretrained robot models into production-ready systems through methodologies such as fine-tuning, reinforcement learning, steering, human feedback, task specialization, evaluation, and on-robot validation at scale. This position offers a unique opportunity for individuals from diverse backgrounds to evolve into full-stack ML roboticists, adept at swiftly identifying challenges across machine learning and control domains. This is where innovative research converges with practical implementation.Your Responsibilities Include:Crafting fine-tuning and adaptation strategies tailored for specific robotic tasks and embodiments.Developing methodologies to enhance reliability, robustness, and controllability of robotic systems.Establishing evaluation frameworks to assess real-world robot performance beyond just offline metrics.Collaborating with ML infrastructure teams to optimize inference-time performance, including latency, stability, and memory usage.Utilizing advanced techniques such as imitation learning, reinforcement learning, distillation, synthetic data, and curriculum learning.Bridging the gap between model outputs and tangible outcomes in the physical world.You Might Excel in This Role If You:Possess experience in fine-tuning large models for downstream applications, including RLHF, imitation learning, reinforcement learning, distillation, and domain adaptation.Have a background in embodied AI, robotics, or real-world machine learning systems.Demonstrate a strong commitment to evaluation, benchmarking, and failure analysis.Are comfortable troubleshooting and debugging across the entire ML stack, from analyzing loss curves to understanding robot behavior.Enjoy rapid iteration and thrive on real-world feedback loops.Aspire to connect foundational models with practical deployment scenarios.About GeneralistAt Generalist, we are dedicated to realizing the vision of general-purpose robots. We envision a future where industries and homes benefit from collaborative interactions between humans and machines, enabling us to achieve more than ever before. Our focus is on building embodied foundation models, starting with dexterity, and advancing the frontiers of data, models, and hardware to empower robots to intelligently engage with their environments.

Feb 12, 2026
Apply
company
Full-time|On-site|San Francisco

About UsAt Applied Compute, we are pioneering the development of Specific Intelligence for enterprises, creating agents that continuously learn from a company’s processes, data, expertise, and objectives. Our mission is to bridge the gap between isolated AI capabilities and their effective application within real business environments. Traditional AI systems often fall short as they lack the ability to adapt based on feedback. Our innovative continual learning layer captures context, memory, and decision-making processes across the enterprise, enabling specialized agents to engage in meaningful work.What Excites Us: We operate at the exciting intersection of product development and cutting-edge research. Our product team designs the platform that empowers a new generation of digital coworkers, while our research team drives advancements in post-training and reinforcement learning to enhance user experiences. As an applied research engineer, you will work directly with clients to implement models in production, combining robust product development with deep research insights to facilitate AI integration in enterprises.Meet Our Team: Our diverse team consists of engineers, researchers, and operators, many of whom are former founders. We have previously built reinforcement learning infrastructure at OpenAI, established data foundations at Scale AI, and contributed to significant systems at companies like Together, Two Sigma, and Watershed. We collaborate with Fortune 50 clients, including DoorDash, Mercor, and Cognition, and are proud to be backed by reputable investors such as Benchmark, Sequoia, and Lux.Who Thrives Here: We seek individuals who are passionate about applying innovative research and complex systems to solve real-world challenges. You should feel comfortable navigating new environments rapidly—be it a fresh codebase, a client’s data architecture, or an unfamiliar problem domain. A genuine enjoyment for customer interaction, empathy, and a deep understanding of their operational workflows are essential. Candidates with entrepreneurial backgrounds, extensive side projects, or a proven track record of end-to-end ownership typically excel in our environment.

Oct 29, 2025
Apply
companyDistyl AI logo
Full-time|On-site|San Francisco

About Distyl AIDistyl AI specializes in creating high-performance AI systems that enhance the fundamental operational processes of Fortune 500 companies. Through a strategic alliance with OpenAI, proprietary software accelerators, and extensive expertise in enterprise AI, we deliver effective AI solutions with swift time-to-value, often within a quarter.Our innovations have empowered Fortune 500 clients in various sectors, including insurance, consumer packaged goods, and non-profit organizations. Joining our team means you will assist organizations in recognizing, developing, and extracting value from their Generative AI investments, frequently for the first time. We prioritize customer needs, working backward from the client's challenges and ensuring we generate financial benefits while enhancing the experiences of end-users.Distyl is guided by seasoned leaders from top-tier companies like Palantir and Apple and enjoys backing from prominent investors including Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (Former CEO of GitHub), Brad Gerstner (Founder and CEO of Altimeter), along with board members from numerous Fortune 500 firms.What We Are Looking ForAt Distyl, we are at the forefront of leveraging AI within enterprises. We seek imaginative researchers who aspire to go beyond incremental enhancements on benchmarks and are eager to redefine the application of software in innovative ways.Our researchers hail from diverse academic disciplines but possess a robust research background, operate in an AI-centric manner, and would find conventional research environments unfulfilling.Key ResponsibilitiesThe AI Systems team is dedicated to architecting complex, comprehensive solutions that integrate perception, reasoning, planning, and execution. Researchers amalgamate various components (LLMs, retrievers, evaluators, memory systems, and execution agents) into resilient, scalable systems that deliver consistent performance across dynamic enterprise workflows.Researchers in AI Systems examine the principles governing intricate system interactions. They analyze coordination, information flow, and emergent behavior across multiple agents and models. Their research reveals the foundational mechanics of robustness, composability, and alignment, ultimately establishing the design paradigm for constructing intelligent systems.

Oct 16, 2025
Apply
companyCartesia logo
Full-time|On-site|*HQ - San Francisco, CA

Join Cartesia: Pioneering AI InnovationAt Cartesia, we are on a mission to redefine the landscape of artificial intelligence. Our goal is to create the next generation of AI that is interactive, ubiquitous, and capable of continuous reasoning across vast streams of audio, video, and text data. With an impressive foundation built on our pioneering work in State Space Models (SSMs) at the Stanford AI Lab, our team is uniquely positioned to advance model architectures that will make on-device reasoning a reality.Backed by prominent investors like Index Ventures and Lightspeed Venture Partners, along with a network of 90+ advisors, including top experts in AI, we are committed to pushing the boundaries of model innovation and systems engineering.About the RoleWe believe that the next significant advancement in model intelligence will stem from enhanced post-training methods and alignment strategies. As a Post-Training Researcher, you will be at the forefront of developing systems and methodologies that ensure our multimodal models are not just adaptive, but also aligned with human intentions.In this role, you will collaborate across machine learning research, alignment, and infrastructure, crafting innovative techniques for preference optimization, model evaluation, and feedback-driven learning. You will investigate how feedback signals can enhance reasoning capabilities across various modalities while establishing the necessary infrastructure to scale and improve these processes.Your contributions will be pivotal in shaping the learning and improvement trajectory of Cartesia’s foundational models, ultimately enhancing their connection with users.Your ImpactLead research initiatives aimed at enhancing the capabilities and alignment of multimodal models.Create cutting-edge post-training methods and evaluation frameworks to assess model advancements.Collaborate closely with research, product, and platform teams to establish best practices for specialized model development.Design, debug, and scale experimental systems to ensure reliability and reproducibility throughout training cycles.Convert research insights into production-ready systems that enhance model reasoning, consistency, and alignment with human values.

Oct 21, 2025
Apply
companyBaseten logo
Full-time|On-site|San Francisco

Join Baseten as a Post-Training Research Scientist, where you will play a vital role in advancing our machine learning capabilities. In this position, you will have the opportunity to conduct innovative research, analyze data, and contribute to the development of cutting-edge technologies. Your work will directly impact our projects and enhance the performance of our models.

Mar 17, 2026
Apply
companyGranica logo
Full-time|On-site|Bay Area Office

About GranicaGranica is an innovative AI research and infrastructure firm dedicated to creating reliable, steerable representations of enterprise data.We establish trust through Crunch, a policy-driven health layer optimizing large tabular datasets for efficiency, reliability, and reversibility. Utilizing this foundation, we are developing Large Tabular Models—systems designed to learn cross-column and relational structures, delivering trustworthy answers and automation with integrated provenance and governance.Our MissionCurrent AI capabilities are hindered not only by model design but also by the inefficiencies of the data that supports it. At scale, each redundant byte, poorly organized dataset, and inefficient data pathway contributes to significant costs, latency, and energy waste.Granica’s mission is to eliminate these inefficiencies. We leverage groundbreaking research in information theory, probabilistic modeling, and distributed systems to craft self-optimizing data infrastructure: systems that continually enhance how information is represented and utilized by AI.Led by Prof. Andrea Montanari from Stanford, Granica’s Research group merges advances in information theory with learning efficiency in large-scale distributed systems. We collectively believe that the next significant leap in AI will originate from innovations in efficient systems, rather than merely larger models.Granica is at the forefront of developing a new category of structured AI models: foundational models designed to learn and reason from the relational, tabular, and structured data that drives the global economy. While many focus on unstructured text or media, we are venturing into the next frontier: systems capable of comprehending and reasoning over structured information.Your ContributionsCreate and prototype algorithms that form the core of structured AI, enhancing representation learning and efficient information modeling for enterprise and tabular data at petabyte scale.Develop adaptive learners merging statistical learning theory with systems optimization at scale, contributing to a new generation of foundational models for structured information.Design architectures that unify symbolic, relational, and neural components, enabling AI systems to reason directly over structured enterprise data.Construct cost models and optimization frameworks that enhance the efficiency of structured learning, both computationally and economically.

Nov 13, 2025
Apply
companyAfterQuery logo
Full-time|$250K/yr - $450K/yr|On-site|San Francisco

About AfterQuery AfterQuery builds training data and evaluation frameworks used by leading AI labs around the world. The team partners with advanced research groups to create high-quality datasets and run detailed evaluations that go beyond standard benchmarks. As a small, post-Series A company based in San Francisco, every team member plays a key role in shaping how future AI models learn and improve. Role Overview The Post-Training Research Scientist focuses on proving the impact of AfterQuery's datasets. This work involves designing and running training experiments to isolate how specific data influences model performance. Projects span Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) post-training, with an emphasis on measuring effects on capability, generalization, and alignment. Working closely with partner labs, the scientist turns data into clear, verifiable results: showing exactly how a dataset leads to measurable improvements under defined conditions. The work is experimental and directly shapes the value of AfterQuery's products. What You Will Do Run controlled SFT and RL experiments to measure how datasets affect model outcomes. Quantify gains in areas like reasoning, tool use, long-horizon tasks, and specialized workflows. Share findings with partner labs to support sales and demonstrate value. Work with internal subject matter experts to improve data quality based on experimental results. What We Look For Strong background in LLM training and evaluation methods. Curiosity about how data structure, selection, and quality shape model behavior. Skill in designing experiments, executing quickly, and drawing practical insights from complex results. Comfort working across fields such as finance, software engineering, and policy. Focus on real-world implementation, not just theory. Research experience at the undergraduate or master's level is preferred; a PhD is not required. Compensation $250,000 - $450,000 total compensation plus equity

Apr 14, 2026
Apply
companyNetic logo
Full-time|On-site|San Francisco

At Netic, we are revolutionizing the essential services sector with our advanced AI-driven revenue engine, which supports the backbone of the American economy.Backed by $43M in funding from illustrious investors such as Founders Fund, Greylock, Hanabi, and Dylan Field, who spearheaded our Series B, we have empowered our clients to secure hundreds of thousands of jobs across various service industries throughout North America. Our platform has enabled companies to operate with an AI-first approach.Join our innovative team of relentless builders hailing from renowned organizations like Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard. Together, we are applying frontier AI to solve complex challenges in the physical economy, where data is intricate and the results are both immediate and impactful.As an Applied AI Research Engineer, you will immerse yourself in pioneering research, gain a thorough understanding of the business functions we automate, and lead targeted machine learning projects that yield remarkable outcomes.

May 30, 2025
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Join Our Innovative TeamAt OpenAI, we are pioneering the field of artificial intelligence, empowering innovation and shaping the future through transformative research. Our mission is to democratize AI, ensuring its benefits are accessible to all. We are on the lookout for forward-thinking Research Engineers to join our Applied Group, where you will convert groundbreaking research into practical applications that can revolutionize industries, enhance human creativity, and tackle complex challenges.Your Impactful RoleAs a Research Engineer within OpenAI's Applied Group, you will collaborate with some of the brightest minds in AI. Your work will involve deploying cutting-edge models in production settings, transforming theoretical breakthroughs into impactful solutions. If you are passionate about making AI technology accessible and effective, this is your opportunity to leave a significant impact.In this role, you will:Innovate and Deploy: Create and implement advanced machine learning models addressing real-world issues. Translate OpenAI's research from theory to practice, developing AI-driven applications that make a meaningful difference.Collaborate with Experts: Engage closely with researchers, software engineers, and product managers to comprehend intricate business challenges and deliver AI-based solutions. Become part of a vibrant team where creativity and ideas flourish.Optimize and Scale: Develop scalable data pipelines, fine-tune models for peak performance and precision, and ensure readiness for production. Contribute to projects that leverage state-of-the-art technology and innovative methodologies.Learn and Lead: Stay at the forefront of advancements in machine learning and AI. Participate in code reviews, share insights, and exemplify best practices to maintain high standards in engineering.Make a Difference: Oversee and maintain deployed models, ensuring they consistently deliver value. Your contributions will directly shape how AI benefits individuals, businesses, and society as a whole.You may excel in this position if you possess:A Master's or PhD in Computer Science, Machine Learning, Data Science, or a related discipline.Proven experience in deep learning and transformer models.Expertise with frameworks such as PyTorch or TensorFlow.A robust understanding of data structures, algorithms, and software engineering principles.Experience with cloud platforms and deploying machine learning models in production.

May 22, 2024
Apply
company
Full-time|On-site|San Francisco

About Liquid AIFounded as a spin-off from MIT CSAIL, Liquid AI specializes in the development of versatile artificial intelligence systems optimized for performance across various deployment environments, ranging from data center accelerators to on-device hardware. Our focus on low latency, minimal memory consumption, privacy, and reliability allows us to partner effectively with enterprises in sectors such as consumer electronics, automotive, life sciences, and financial services. As we experience rapid growth, we are eager to welcome talented individuals who can contribute to our mission.The OpportunityThis unique position places you at the forefront of advanced foundation models and their practical applications. You will oversee post-training projects from start to finish for some of the world’s leading enterprises, while also playing a vital role in the ongoing development of Liquid’s core models.In this role, you will not have to choose between impactful customer work and foundational development; instead, you will enjoy deep involvement in both. You will have significant influence over how models are adapted, assessed, and deployed, directly contributing to the enhancement of Liquid’s post-training capabilities.If you are passionate about data integrity, evaluation processes, and ensuring that models perform effectively in real-world scenarios, this is your chance to redefine the standards of applied AI at a foundation-model company.What We're Looking ForWe seek an individual who:Takes ownership: You will lead post-training initiatives from customer requirements to delivery and evaluation.Thinks end-to-end: You will connect the dots across data generation, training, alignment, and evaluation as a cohesive system.Is pragmatic: You prioritize model quality and customer satisfaction over theoretical publications.Communicates clearly: You can interpret customer needs and effectively communicate with internal technical teams, providing constructive feedback when necessary.The WorkServe as the technical lead for post-training engagements with enterprise clients.Translate client requirements into actionable post-training specifications and workflows.Design and implement data generation, filtering, and quality assessment methodologies.Conduct supervised fine-tuning, preference alignment, and reinforcement learning processes.Create task-specific evaluations, analyze outcomes, and integrate insights back into core post-training workflows.

Jan 23, 2026
Apply
companyDistyl AI logo
Full-time|$130K/yr - $250K/yr|On-site|San Francisco

Join Our Team at Distyl AIDistyl AI is at the forefront of developing advanced, production-grade AI systems that enhance operational workflows for Fortune 500 companies. Through our strategic alliance with OpenAI, proprietary software accelerators, and extensive enterprise AI expertise, we deliver effective AI solutions with rapid results, often in a matter of weeks.Our innovative products have transformed operations for clients across various sectors, including insurance, consumer packaged goods, and non-profit organizations. By joining our dynamic team, you will play a crucial role in helping clients discover, build, and leverage value from their Generative AI investments, frequently for the first time. We prioritize a customer-first approach, focusing on their challenges and holding ourselves accountable for generating financial impact while enhancing user experiences.Distyl is led by accomplished leaders from prestigious companies like Palantir and Apple, with backing from top-tier investors including Lightspeed, Khosla, Coatue, and Dell Technologies Capital, among others.Your RoleAt Distyl, we are redefining the application of AI in enterprise environments. We seek imaginative researchers who aspire to go beyond mere incremental advancements and, instead, aim to innovatively reshape how software is utilized.Our research team comprises individuals from diverse academic backgrounds with robust research accomplishments. We operate in an AI-native manner and thrive in an environment that encourages unconventional thinking.Key ResponsibilitiesSimilar to generating hypotheses in AI for scientific exploration, the System Discovery team focuses on identifying new classes of AI systems. Researchers will investigate various architectures, modalities, and combinations to discover how AI can fundamentally transform work processes.Make connections across different domains to unveil new system archetypes. Engage in experimental problem definitions and develop prototypes that facilitate novel forms of human–AI collaboration.Who You AreBroad Systems Development Experience: You possess a diverse range of experience, having built systems in various paradigms, including retrieval pipelines, reasoning agents, evaluation harnesses, multimodal integrations, or workflow graph systems, and can distill these experiences into innovative solutions.

Oct 16, 2025

Sign in to browse more jobs

Create account — see all 4,482 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.