Technical Staff Member - Pre-Training Infrastructure

Reflection AISan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Desired QualificationsProven experience in building or managing distributed training systems for large-scale machine learning models. Strong familiarity with modern distributed training frameworks such as Megatron, DeepSpeed, or comparable large-scale training systems. Understanding of large-scale model parallelism techniques (data, tensor, pipeline, or expert parallelism). Demonstrated experience in optimizing training throughput and GPU utilization within extensive distributed settings. Familiarity with GPU communication libraries like NCCL and performance tuning for distributed workloads. Experience collaborating closely with ML researchers to bring experimental training workflows to production. Exceptional debugging skills across GPU compute, distributed training systems, and large-scale workloads.

About the job

Our Mission

At Reflection AI, our goal is to develop open superintelligence and make it universally accessible.

We are pioneering open weight models tailored for individuals, agents, enterprises, and even entire nations. Our diverse team comprises talented AI researchers and industry veterans from prestigious organizations such as DeepMind, OpenAI, Google Brain, Meta, Character. AI, Anthropic, and many more.

Role Overview

Construct and enhance distributed training systems that drive the pre-training of cutting-edge models.
Collaborate with research teams to design and execute extensive training runs for foundational models.
Create infrastructure that facilitates efficient training across thousands of GPUs leveraging contemporary distributed training frameworks.
Enhance training throughput, stability, and efficiency for extensive model training tasks.
Work closely with pre-training researchers to convert experimental concepts into scalable, production-ready training systems.
Boost performance of distributed training tasks through optimization of communication, memory management, and GPU utilization.
Develop and maintain training pipelines that accommodate large-scale datasets, checkpointing, and iterative experiments.
Identify and resolve performance bottlenecks within distributed training systems, including model parallelism, GPU communication, and training runtime environments.
Contribute to the creation of systems that promote swift experimentation and iteration on novel training methods.

About Reflection AI

Reflection AI is at the forefront of artificial intelligence, dedicated to creating open superintelligence that is accessible to everyone. Our innovative approach leverages the expertise of a diverse team from top-tier AI organizations, ensuring that we are equipped to tackle the challenges of tomorrow.

Similar jobs

1 - 20 of 5,943 Jobs

Search for Distributed Training Engineer Member Of Technical Staff

5,943 results

Select all on this page (20)

Apply

Distributed Training Engineer - Member of Technical Staff

Liquid AI

Full-time|On-site|San Francisco

About Liquid AIOriginating from MIT CSAIL, Liquid AI specializes in the development of general-purpose AI systems designed to operate seamlessly across various platforms, including data center accelerators and on-device hardware. Our focus is on delivering low latency, efficient memory usage, privacy, and reliability. We collaborate with organizations in diverse sectors such as consumer electronics, automotive, life sciences, and financial services. As we experience rapid growth, we seek outstanding talent to join our mission.The OpportunityThe Training Infrastructure team is at the forefront of building the distributed systems that empower our next-generation Liquid Foundation Models. As our operations expand, we aim to innovate, implement, and enhance the infrastructure crucial for large-scale training.This role is centered around high ownership of training systems, emphasizing runtime, performance, and reliability rather than a typical platform or SRE function. You will collaborate within a small, agile team, creating vital systems from the ground up instead of working with pre-existing infrastructure.While San Francisco and Boston are preferred, we are open to other locations.What We're Looking ForWe are seeking an individual who:Embraces the complexity of distributed systems: Our team is dedicated to maintaining stability during extensive training runs, troubleshooting training failures across GPU clusters, and enhancing overall performance.Is passionate about building: We value team members who take pride in developing robust, efficient, and reliable infrastructure.Excels in uncertain environments: Our systems are designed to support evolving model architectures. You will be making decisions based on incomplete information and rapidly iterating.Aligns with team goals and delivers results: The best engineers on our team align with collective priorities while providing data-driven feedback when challenges arise.The WorkDesign and develop core systems that ensure quick and reliable large training runs.Create scalable distributed training infrastructure for GPU clusters.Implement and refine parallelism and sharding strategies for evolving architectures.Optimize distributed efficiency through topology-aware collectives, communication/compute overlap, and straggler mitigation.Develop data loading systems to eliminate I/O bottlenecks for multimodal datasets.

Jul 29, 2025

Apply

Technical Staff Member - Distributed Systems

Gimlet Labs

Full-time|On-site|San Francisco

At Gimlet Labs, we are pioneering the first heterogeneous neocloud tailored for AI workloads. As AI technology evolves, the industry confronts critical limitations in power, capacity, and cost linked to the traditional homogeneous, vertically integrated infrastructure. Gimlet addresses these challenges by decoupling AI workloads from the fundamental hardware, intelligently partitioning them into components and orchestrating each to the hardware that best meets its performance and efficiency needs. This innovative approach facilitates heterogeneous systems across diverse vendors and generations of hardware, including the latest emerging accelerators, resulting in significant improvements in performance and cost efficiency at scale.Building upon this platform, Gimlet is developing a production-grade neocloud for agentic workloads. Our customers can deploy and manage their workloads through stable, production-ready APIs without the complexities of hardware selection, placement, or low-level performance optimization.Gimlet collaborates with foundational labs, hyperscalers, and AI-native companies to enable real production workloads designed to scale to gigawatt-class AI datacenters.We are currently in search of a Technical Staff Member specializing in distributed systems. In this role, you will be instrumental in developing the core platform responsible for scheduling, routing, and managing AI workloads reliably at production scale. You will engage with systems that coordinate execution across thousands of nodes, provide stable production APIs, and guarantee predictable workload performance under real-world conditions of load and failure.This position is ideal for engineers passionate about building foundational infrastructure, grasping end-to-end systems, and operating at scale.

Mar 10, 2026

Apply

Technical Staff Member

Catalog

Full-time|On-site|San Francisco

At Catalog, we are pioneering the commerce infrastructure for AI—creating the essential framework that enables digital agents to not only explore the web but also comprehend, analyze, and engage with products. Our innovations drive the future of AI-driven shopping experiences, fundamentally transforming how consumers discover and purchase items online.Role OverviewAs a Technical Staff Member, you will be instrumental in developing core systems, shaping our engineering culture, and transitioning our vision from prototype to a robust platform. This role requires full-stack expertise and a commitment to owning and resolving challenges from start to finish.Who You AreYou have experience creating beloved and trusted products from the ground up.You combine technical proficiency with a keen product sense and data-driven intuition.You are well-versed in AI technologies.You prioritize speed, write clean code, and ensure thorough instrumentation.You seek a high level of ownership within a small, talent-rich team based in San Francisco.Challenges You Will TackleDevelop and deploy agentic-search APIs that deliver structured and real-time product data in milliseconds.Build checkout systems enabling agents to conduct transactions with any merchant.Create an embeddings and retrieval layer that optimizes recall, precision, and cost efficiency.Establish a product graph and ranking pipeline that adapts based on actual user outcomes.Preferred QualificationsProven experience shipping data-centric products in a live environment.Experience with recommendation systems or information retrieval methodologies.Familiarity with API development, search indexing, and data pipeline construction.Our Work CultureWe operate with a small, high-trust, and highly motivated team, fostering an environment of in-person collaboration in North Beach, San Francisco. Our process involves debate, decision-making, and execution.If your profile aligns with our needs, we will contact you to arrange 2-3 brief technical interviews, followed by an onsite meeting in our office where you will collaborate on a small project, exchange ideas, and meet the team.

Oct 15, 2025

Apply

Technical Staff Member - Product Engineering

Composio

Full-time|On-site|sf

At Composio, we are developing advanced infrastructure that enables agents to seamlessly interact with essential work tools such as GitHub, Gmail, Notion, Salesforce, and more. Our dedicated team of engineers is committed to tackling challenges ranging from contextual understanding to search functionalities, ensuring we provide an exceptional bridge between your agents and their tools.Having secured a $25M Series A funding from Lightspeed, alongside prominent angel investors like Guillermo Rauch (CEO of Vercel), Dharmesh Shah (CTO of HubSpot), and Gokul Rajaram, we have experienced remarkable growth, tripling our ARR at the start of this year. Our clientele includes notable names from Y Combinator cohorts to Wabi, Glean, Zoom, and beyond.Your RoleEnhance the experience of teams utilizing our platform by refining our core APIs and SDK.Create intuitive interfaces for both frontend and SDK applications.Take ownership of product development from concept through to production.Collaborate closely with customers to cultivate their loyalty while enhancing the product.Craft clear and concise documentation.

Feb 10, 2026

Apply

Technical Staff Member - Pre-Training Infrastructure

Reflection AI

Full-time|On-site|San Francisco

Our MissionAt Reflection AI, our goal is to develop open superintelligence and make it universally accessible.We are pioneering open weight models tailored for individuals, agents, enterprises, and even entire nations. Our diverse team comprises talented AI researchers and industry veterans from prestigious organizations such as DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic, and many more.Role OverviewConstruct and enhance distributed training systems that drive the pre-training of cutting-edge models.Collaborate with research teams to design and execute extensive training runs for foundational models.Create infrastructure that facilitates efficient training across thousands of GPUs leveraging contemporary distributed training frameworks.Enhance training throughput, stability, and efficiency for extensive model training tasks.Work closely with pre-training researchers to convert experimental concepts into scalable, production-ready training systems.Boost performance of distributed training tasks through optimization of communication, memory management, and GPU utilization.Develop and maintain training pipelines that accommodate large-scale datasets, checkpointing, and iterative experiments.Identify and resolve performance bottlenecks within distributed training systems, including model parallelism, GPU communication, and training runtime environments.Contribute to the creation of systems that promote swift experimentation and iteration on novel training methods.

Mar 24, 2026

Apply

Founding Member of Technical Staff

tierzero

Full-time|Hybrid|SF HQ

About TierZero TierZero helps engineering teams use AI to build and ship code more efficiently. The platform targets the bottleneck of human speed in production, giving teams tools for faster incident response, better operational visibility, and shared knowledge. TierZero is backed by $7M in funding from investors including Accel and SV Angel. Companies like Discord, Drata, and Framer trust TierZero to strengthen their infrastructure for AI-driven engineering. Role Overview: Founding Member of Technical Staff This is an on-site role based at TierZero’s San Francisco headquarters, with three days a week in the office. As a founding member, direct collaboration with the CEO, CTO, and early customers shapes the direction of both product and systems. The work spans hands-on development and close engagement with users and leadership. What You Will Do Design and build intelligent AI systems to analyze large volumes of unstructured data. Deliver full-stack features based on real user feedback. Improve the product experience so AI agents are both reliable and easy for engineers to use. Develop systems that automatically evaluate LLM outputs and advance agentic reasoning using self-play and feedback loops. Create machine learning pipelines, including data ingestion, feature generation, embedding stores, retrieval-augmented generation (RAG), vector search, and graph databases. Prototype with open-source and new LLMs, comparing their strengths and weaknesses. Build scalable infrastructure for long-running, multi-step agents, with attention to memory, state, and asynchronous workflows. What We Look For Over five years of relevant professional or open-source experience. Comfort working in environments with uncertainty and evolving challenges. Strong product focus and a drive for customer satisfaction. Interest in large language models (LLMs), Model Control Planes (MCPs), cloud infrastructure, and observability tools. Previous startup experience is a plus. Location This position is based in San Francisco. Expect to work on-site three days per week at TierZero’s HQ.

Apr 15, 2026

Apply

Founding Member of Technical Staff

TierZero

Full-time|Hybrid|SF HQ

TierZero builds tools that help engineering teams deliver and manage code efficiently. The platform enables quicker incident response, clearer operational visibility, and shared knowledge among engineers. Backed by $7 million from investors like Accel and SV Angel, TierZero supports clients such as Discord, Drata, and Framer as they strengthen infrastructure for AI-driven work. This in-person role is based at TierZero's San Francisco headquarters, with a hybrid schedule requiring three days onsite each week. As a founding member of the technical staff, work directly with the CEO, CTO, and customers to influence the direction of TierZero’s core products and systems. The position calls for flexibility as priorities shift and close collaboration across the company. What you will do Design and develop AI systems that handle large volumes of unstructured data. Build full-stack product features, informed by direct feedback from users. Enhance the product so agents are intelligent, reliable, and easy for engineers to use. Create systems to automatically evaluate outputs from large language models and improve agentic reasoning through self-play and feedback. Construct machine learning pipelines, including data ingestion, feature creation, embedding stores, retrieval-augmented generation (RAG) pipelines, vector search, and graph databases. Experiment with open-source and emerging large language models to compare different approaches. Develop scalable infrastructure for long-running, multi-step agents, including memory, state management, and asynchronous workflows. Requirements Interest in working with large language models, managed cloud platforms, cloud infrastructure, and observability tools. At least 5 years of professional experience or significant open-source contributions. Comfort with shifting priorities and tackling new technical problems. Strong product focus and commitment to customer outcomes. Openness to learning from a team with a track record of delivering over $10 billion in value. Ability to work onsite in San Francisco three days per week. Bonus: Experience in a startup setting and familiarity with startup dynamics.

Apr 24, 2026

Apply

Distributed Training Engineer

Sciforium

Full-time|On-site|San Francisco

Sciforium is a pioneering AI infrastructure company dedicated to developing state-of-the-art multimodal AI models and a proprietary, high-efficiency serving platform. With substantial multi-million-dollar funding and direct collaboration from AMD engineers, our team is rapidly expanding to create the complete stack that drives cutting-edge AI models and real-time applications.About the RoleWe are on the lookout for a talented Distributed Training Engineer to develop, optimize, and maintain the essential software stack that supports our extensive AI training operations. In this role, you will engage with the entire machine learning infrastructure, ranging from low-level CUDA/ROCm runtimes to high-level frameworks such as JAX and PyTorch, ensuring that our distributed training systems are swift, scalable, stable, and efficient.This opportunity is perfect for individuals passionate about deep systems engineering, troubleshooting complex hardware-software interactions, and enhancing performance at every level of the machine learning stack. You will significantly contribute to the training and deployment of next-generation LLMs and generative AI models.Key ResponsibilitiesSoftware Stack Maintenance: Manage, update, and enhance critical ML libraries and frameworks, including JAX, PyTorch, CUDA, and ROCm across various environments and hardware configurations.End-to-End Stack Ownership: Construct, sustain, and continually refine the entire ML software stack, from ROCm/CUDA drivers to high-level JAX/PyTorch tooling.Distributed Training Optimization: Ensure optimal sharding, partitioning, and configuration of all model implementations for large-scale distributed training.System Integration: Consistently integrate and validate modules for runtime correctness, memory efficiency, and scalability across multi-node GPU/accelerator clusters.Profiling & Performance Analysis: Perform detailed profiling of compilation graphs, training workloads, and runtime execution to enhance performance and eliminate bottlenecks.Debugging & Reliability: Diagnose intricate hardware-software interaction issues, including vLLM compilation failures on ROCm, CUDA memory leaks, distributed runtime failures, and kernel-level inconsistencies.Collaborate with research, infrastructure, and kernel engineering teams to enhance system throughput, stability, and developer experience.

Dec 6, 2025

Apply

Technical Staff Member

Adyen

Full-time|On-site|San Francisco

Join our dynamic team at Adyen as a Technical Staff Member in San Francisco! We are seeking innovative minds passionate about technology and problem-solving. In this role, you will collaborate with cross-functional teams to craft solutions that enhance our services and improve customer experiences.

Mar 6, 2026

Apply

Founding Member of Technical Staff

tierzero

Full-time|On-site|SF HQ

tierzero is looking for a Founding Member of Technical Staff to help shape the direction of its technology from the ground up. This role is based at the company's San Francisco headquarters. Role overview As an early technical hire, you will work closely with engineers and product managers to build new products and features. The work centers on designing, coding, and delivering software solutions that address client needs and support tierzero's growth. Impact Contributions in this role will directly influence the company's future. The team values initiative and hands-on problem solving, giving each member a chance to make a visible difference in how the company evolves. Collaboration This position involves regular collaboration with a small, focused team. Input and ideas from every member help guide product direction and technical decisions.

Apr 29, 2026

Apply

Distributed Training Engineer, Sora

OpenAI

Full-time|Hybrid|San Francisco

Join the Sora TeamAt Sora, we are at the forefront of integrating video capabilities into OpenAI’s foundational models. Our innovative hybrid research and product team is dedicated to expanding the boundaries of video model capabilities while ensuring their reliability and safety. We achieve this through rigorous research, experimentation, and real-world deployment, aiming to disseminate our advancements to a broader audience.Your Role as a Distributed Systems/ML EngineerIn this pivotal role, you will be instrumental in enhancing the training throughput of our internal framework, empowering researchers to experiment with cutting-edge ideas. Your responsibilities will encompass designing, implementing, and optimizing state-of-the-art AI models, ensuring that your machine learning code is bug-free, and leveraging your expertise in supercomputer performance. We seek individuals who are passionate about performance optimization, possess a deep understanding of distributed systems, and have a zero-tolerance policy for bugs in code.This position is based in San Francisco, CA, following a hybrid work model with three days in the office each week. We also provide relocation assistance for new team members.Key Responsibilities:Collaborate closely with researchers to facilitate the development of systems-efficient video models and architectures.Implement the latest techniques within our training framework to achieve exceptional hardware efficiency during training runs.Profile and optimize our training framework to ensure peak performance.You Will Excel in This Role If You:Possess experience with multi-modal machine learning pipelines.Enjoy delving into system implementations and grasping their fundamentals to enhance performance and maintainability.Demonstrate strong software engineering expertise and proficiency in Python.Have experience in understanding and optimizing training kernels.Are eager to explore stable training dynamics.About OpenAIOpenAI is a pioneering AI research and deployment organization committed to ensuring that general-purpose artificial intelligence is beneficial for all of humanity. We continually push the boundaries of what is possible with AI, striving to create a positive impact in various fields.

Mar 15, 2024

Apply

Founding Member of Technical Staff at tierzero | San Francisco

tierzero

Full-time|Hybrid|SF HQ

About tierzero tierzero helps engineering teams build and deploy code with greater speed and operational clarity in an AI-driven world. The company focuses on improving incident response, operational visibility, and knowledge sharing for engineers. Backed by $7 million in funding from investors like Accel and SV Angel, tierzero supports large-scale systems for clients such as Discord, Drata, and Framer. Role Overview: Founding Member of Technical Staff This role is based at tierzero's San Francisco headquarters. In-person work is required three days a week. As a founding member of the technical team, you will help design and build core products and systems from the ground up. Collaboration is central: expect to work closely with the CEO, CTO, and customers. Projects span a wide range of technical challenges and product areas. What You Will Do Design and implement intelligent AI systems that process and reason over large volumes of unstructured data. Develop full-stack features, incorporating direct feedback from users. Improve the product experience so intelligent agents are practical and reliable for engineers. Create systems that automatically evaluate LLM outputs and refine agent reasoning using self-play and feedback loops. Build machine learning pipelines covering data ingestion, feature generation, embedding stores, RAG pipelines, vector search, and graph databases. Prototype and experiment with open-source and advanced LLMs to weigh different approaches. Set up scalable infrastructure for long-running, multi-step agents, including memory management, state handling, and asynchronous workflows. What We Look For At least 5 years of professional or open-source experience in a relevant technical field. Comfort working in a setting that changes and evolves quickly. Strong product focus and an understanding of customer needs. Interest in LLMs, MCPs, cloud infrastructure, and observability tools. Ability to learn from and collaborate with engineers who have delivered over $10 billion in value. Commitment to working onsite in San Francisco three days per week. Startup experience is a plus.

Apr 20, 2026

Apply

Founding Member of Technical Staff

tierzero

Full-time|On-site|SF HQ

tierzero seeks a Founding Member of Technical Staff to play a key role in building the company’s technology from the earliest stages. This position is based at the San Francisco headquarters and offers the chance to collaborate directly with founders and engineers. Role overview As an early team member, you will help design and develop new products and systems. The work involves close collaboration with others in the office, shaping both the technical direction and the culture of the engineering team. What you will do Develop core technology in partnership with founders and engineers Contribute ideas and code that guide the evolution of tierzero’s products Help define engineering standards and establish best practices Location This position is based onsite at the San Francisco HQ.

Apr 27, 2026

Apply

Technical Staff Member - Developer Experience & Data Tooling Engineer

magic.dev

Full-time|$200K/yr - $550K/yr|On-site|San Francisco

At Magic, we are on a mission to create safe AGI that propels humanity forward in tackling the world's most pressing challenges. We believe that the key to achieving safe AGI lies in automating research and code generation, allowing us to enhance models and ensure alignment more reliably than human capabilities alone. Our innovative approach integrates frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and advanced inference-time computing to realize this vision.About the Role:We are seeking a passionate individual to spearhead developer experience and data tooling within our pre-training data team. This role involves creating internal tools and infrastructure that enhance team productivity, including dashboards, command-line interfaces (CLIs), data exploration UIs, and the systems that interconnect them.Focusing on developer experience and tooling, we need someone who enjoys solving problems, deploying solutions quickly, and experimenting with new ideas.Potential Projects:Lead tooling initiatives across the architecture: develop systems, implement continuous integration, create CLI utilities, and design internal web interfaces.Design internal tools for dataset exploration, data labeling, quality assessment, and data inventory management.Enhance data infrastructure ergonomics—optimizing IO patterns in Ray/dataflow jobs, improving dataset tracking, and enhancing pipeline observability.Spot opportunities by engaging with the team, understanding their challenges, and proactively refining workflows.Elevate standards for code organization, packaging, and engineering best practices.What We Are Looking For:Preferred QualificationsSolid foundation in software engineering principles.Genuine interest in developer experience and best practices for code organization.Effective communicator, adept at collaborating with teammates to understand their requirements.Proactive mindset—identifies issues and implements solutions.Local to San Francisco (this role requires in-office attendance).Ideal Background (in order of importance)Open source contributor—experience with tools similar to Ruff, uv, or other developer-centric projects.Experience in build systems and CI—has developed or overseen build systems, CI pipelines, or developer tools on a large scale.Data pipeline experience—understanding of optimizing data workflows and data handling.

Feb 27, 2026

Apply

Technical Staff Member - Design

Listen Labs

Full-time|On-site|San Francisco, CA

Overview: Due to the increasing market demand and a robust six-month product roadmap, Listen Labs is expanding its engineering team. We seek a technically adept individual (our team includes three IOI medalists) who is eager to contribute to a product that is revolutionizing corporate decision-making. If you are passionate about solving intricate problems from start to finish, we invite you to connect with us.About Listen LabsListen Labs is an innovative AI-driven research platform that empowers teams to swiftly extract insights from customer interviews in hours rather than months. Our technology enables clients to analyze conversations, identify recurring themes, and expedite informed product decisions.Company Highlights:Exceptional Team: Composed of seasoned entrepreneurs (with prior AI exits), co-founders, and experts from leading firms such as Jane Street, Twitter, Stripe, Affirm, Bain, Goldman Sachs, and more, our team is built on a foundation of excellence.Rapid Growth: We are a dynamic team of 40, supported by Sequoia, achieving a remarkable growth trajectory from $0 to $14 million run-rate in less than a year. We prioritize speed, craftsmanship, and collaboration with individuals who embrace ownership.Impressive Traction: We have seen rapid growth across various sectors, securing enterprise clients such as Google, Microsoft, Nestlé, and P&G.Outstanding Performance: Our industry-leading win rate is a direct result of our uniquely differentiated product.Market Validation: We consistently attract customers across every segment, often landing six-figure deals that lead to quick expansions.Viral Product: Our interviews are shared with tens of thousands of viewers, driving product-led growth, organic expansion, and daily inquiries from Fortune 500 companies.Technical Challenges:Research Agent Development: Unlike traditional software purchases, hiring McKinsey involves gaining insights and execution expertise. We are building Listen Labs with that mindset — an AI agent that understands our platform and best research practices, assisting users in project setup, interview execution, and response analysis.Human Database Creation: A core value proposition is our capability to connect users with specific demographics. We are developing a database of millions of individuals, continually enhancing our understanding of user needs as they engage with Listen Labs.

Feb 25, 2026

Apply

Technical Staff Member

Mirendil

Full-time|On-site|San Francisco

Technical Staff MemberMirendil is a pioneering technology company dedicated to addressing fundamental challenges that propel significant advancements in science and technology. Our primary mission is to democratize access to cutting-edge AI research and development across various scientific fields. We believe that accelerating scientific discovery is one of the most impactful ways to enhance humanity's future, with AI playing a crucial role in achieving this vision.We are in the process of establishing a leading AI research company, developing our own models from the ground up. Our focus encompasses model training, reinforcement learning, reasoning systems, and the infrastructure necessary for large-scale experiments. Our team comprises accomplished researchers and engineers from esteemed organizations such as Anthropic, Google DeepMind, xAI, OpenAI, Microsoft, Apple, and MIT.Position OverviewWe are seeking skilled engineers and researchers to join us as Members of Technical Staff.This role is designed to be flexible and open-ended. Depending on your expertise and interests, you may engage in:Enhancing and training advanced AI modelsDeveloping reinforcement learning and reasoning systemsBuilding infrastructure for extensive experimental projectsCreating systems to automate or expedite research workflowsIf you are passionate about tackling ambitious challenges at the crossroads of AI, research, and scientific innovation, we would love to connect with you.

Mar 13, 2026

Apply

Technical Staff Member

teameigen

Full-time|On-site|San Francisco

At teameigen, we are a dynamic group of innovators, engineers, and storytellers, supported by renowned venture capitalists and the visionaries behind some of the world’s most cherished consumer products, as well as influential figures from film, music, and television.We seek passionate builders with a keen sense of product development to advance applied emotional intelligence and create the technological frameworks that will define consumer technology in the next decade.As a member of our technical staff, you will:Independently tackle complex and ambiguous challenges from start to finish.Engage across the entire technology stack with a product-first mindset rather than a tech-centric approach.Collaborate with writers, artists, and other creatives to enhance the emotional and aesthetic experience of our product.Demonstrate relentless urgency and a strong work ethic in all projects.

Dec 1, 2025

Apply

Technical Staff Member - Machine Learning

Reka

Full-time|Remote|US, UK, Remote

As a Technical Staff Member specializing in Machine Learning, you will:Engage in the complete development lifecycle of innovative large-scale deep learning models.Curate datasets, architect solutions, implement algorithms, and train and assess models to enhance our offerings.Work collaboratively with engineers and researchers to convert groundbreaking research into real-world applications.Join us at a pivotal time, take on diverse roles, and contribute to building transformative products from the ground up!

Aug 1, 2023

Apply

Technical Staff Member - Inference Engineering

Inferact

Full-time|$200K/yr - $400K/yr|Remote|San Francisco

At Inferact, we are on a mission to establish vLLM as the premier AI inference engine, revolutionizing AI progress by making inference both more accessible and efficient. Our founding team consists of the original creators and key maintainers of vLLM, positioning us uniquely at the nexus of cutting-edge models and advanced hardware.Role OverviewWe are seeking a passionate inference runtime engineer eager to explore and expand the frontiers of LLM and diffusion model serving. As models evolve and grow in complexity with new architectures like mixture-of-experts and multimodal designs, the demand for innovative solutions in our inference engine intensifies. This role places you at the heart of vLLM, where you will enhance model execution across a variety of hardware platforms and architectures. Your contributions will have a direct influence on the future of AI inference.

Jan 22, 2026

Apply

Backend Engineer - Member of Technical Staff

Krew

Full-time|On-site|San Francisco

Krew is revolutionizing the credit-servicing sector with cutting-edge AI-driven agents designed to enhance financial wellness. Our innovative approach is supported by notable investors such as Long Journey Ventures (Arielle Zuckerberg, Pascal Levy-Garboua) and prominent angels including Ryan Hoover (Founder, Product Hunt), Charlie Songhurst (Board Member, Meta), and Michael Jones (Former Chair, Huntington Bank Ventures). Our team consists of exceptional talent from renowned organizations including Figure Robotics, Optiver, Bain, and the United Nations, alongside top-tier engineers and researchers from institutions like the University of Chicago and Oxford. Currently, our omnichannel agents assist over 100,000 consumers on their journeys towards financial stability. Our ambitious goal is to resolve more than $1 trillion in outstanding consumer delinquencies globally, employing an empathetic and human-centric approach.Job Overview:We are seeking a talented Member of Technical Staff with a focus on backend engineering. The ideal candidate is a motivated engineer who prioritizes high-quality code and thrives in a collaborative environment driven by rapid innovation.Key Responsibilities:Architect and develop backend APIs and services that empower our AI agents and operational tools.Deliver user-facing features from conception through to deployment (including data modeling, API development, and UI integration).Enhance system reliability, observability, and performance across the engineering stack.Work closely with design, operations, and compliance teams to ensure the delivery of safe and compliant features.Write comprehensive tests, maintain code quality, and engage in design review discussions.Contribute to the evolution of our engineering standards and product development methodology.Minimum Qualifications:A minimum of 2 years of full-time professional experience in backend or full-stack engineering.Proficiency in backend development technologies such as Python or Go.Strong commitment to code quality, including testing, documentation, and code reviews.Preferred Qualifications:Experience in FinTech, Risk Management, or Compliance is advantageous but not essential.Familiarity with LLM/AI-powered products, event-driven architectures, or data processing pipelines.Knowledge of security, privacy, and compliance best practices.

Nov 20, 2025

Create account — see all 5,943 results