Senior Staff Machine Learning Engineer Enterprise Genai jobs in San Francisco – Browse 7,571 openings on RoboApply Jobs

Senior Staff Machine Learning Engineer Enterprise Genai jobs in San Francisco

Open roles matching “Senior Staff Machine Learning Engineer Enterprise Genai” with location signals for San Francisco. 7,571 active listings on RoboApply Jobs.

7,571 jobs found

1 - 20 of 7,571 Jobs
Apply
companyScale AI logo
Full-time|$218K/yr - $273K/yr|On-site|San Francisco, CA; New York, NY

At Scale AI, we are at the forefront of the AI revolution, providing the essential data infrastructure that empowers organizations to create and implement robust AI applications. Our partnerships with top enterprises and government entities accelerate their AI goals through innovative data annotation platforms, generative AI solutions, and comprehensive enterprise AI capabilities.Discover the General Agents TeamThe General Agents team, an integral part of Scale's Enterprise division, is dedicated to developing advanced general agents tailored for diverse customer applications. We operate at the cutting edge of agent technology, transforming sophisticated reasoning and agentic capabilities into dependable, production-ready systems that deliver substantial economic benefits. Our agents are designed for scalability, focusing on recurring enterprise challenges, with a strong emphasis on generalization, extensibility, and widespread deployment.Your Impact in This RoleAs a Senior/Staff Machine Learning Engineer on the General Agents team, you will be pivotal in architecting, building, and deploying production-grade AI agents that address significant enterprise challenges. Your role will encompass the entire agent lifecycle—from system design and model evaluation to deployment and iterative refinement—effectively merging cutting-edge agent techniques with the practicalities of real-world customer settings.You will:Create and implement comprehensive agent systems that integrate LLM reasoning, memory, tool usage, and control logic to tackle recurring enterprise challenges.Develop scalable and reliable agent architectures that can adapt to a variety of customer data and tools.Establish evaluation frameworks, datasets, environments, and metrics to assess agent performance, reliability, and business outcomes in live settings.Collaborate with product managers, clients, data annotators, and engineering teams to translate enterprise needs into robust agent designs.Transition cutting-edge agent techniques (e.g., planning, multi-step reasoning, tool utilization, multi-agent collaboration) into maintainable and observable systems.Oversee the deployment, monitoring, and iterative enhancement of agent systems, including failure analysis and continuous improvement based on actual usage.Guide the technical direction and architectural practices for general agent development, with increased scope and leadership at the Staff level.

Mar 26, 2026
Apply
companyDatabricks logo
Full-time|$190.9K/yr - $232.8K/yr|On-site|San Francisco, California

P-1285 About This Role Join Databricks as a Staff Software Engineer specializing in GenAI inference, where you will spearhead the architecture, development, and optimization of the inference engine that powers the Databricks Foundation Model API. Your role will be crucial in bridging cutting-edge research with real-world production requirements, ensuring exceptional throughput, minimal latency, and scalable solutions. You will work across the entire GenAI inference stack, including kernels, runtimes, orchestration, memory management, and integration with various frameworks and orchestration systems. What You Will Do Take full ownership of the architecture, design, and implementation of the inference engine, collaborating on a model-serving stack optimized for large-scale LLM inference. Work closely with researchers to integrate new model architectures or features, such as sparsity, activation compression, and mixture-of-experts into the engine. Lead comprehensive optimization efforts focused on latency, throughput, memory efficiency, and hardware utilization across GPUs and other accelerators. Establish and uphold standards for building and maintaining instrumentation, profiling, and tracing tools to identify performance bottlenecks and drive optimizations. Design scalable solutions for routing, batching, scheduling, memory management, and dynamic loading tailored to inference workloads. Guarantee reliability, reproducibility, and fault tolerance in inference pipelines, including capabilities for A/B testing, rollbacks, and model versioning. Collaborate cross-functionally to integrate with federated and distributed inference infrastructure, ensuring effective orchestration across nodes, load balancing, and minimizing communication overhead. Foster collaboration with cross-functional teams, including platform engineers, cloud infrastructure, and security/compliance professionals. Represent the team externally through benchmarks, whitepapers, and contributions to open-source projects. What We Look For A BS/MS/PhD in Computer Science or a related discipline. A solid software engineering background with 6+ years of experience in performance-critical systems. A proven ability to own complex system components and influence architectural decisions from conception to execution. A deep understanding of ML inference internals, including attention mechanisms, MLPs, recurrent modules, quantization, and sparse operations. Hands-on experience with CUDA, GPU programming, and essential libraries (cuBLAS, cuDNN, NCCL, etc.). A strong foundation in distributed systems design, including RPC frameworks, queuing, RPC batching, sharding, and memory partitioning. Demonstrated proficiency in diagnosing and resolving performance bottlenecks across multiple layers (kernel, memory, networking, scheduler).

Jan 30, 2026
Apply
companyAirbnb, Inc. logo
Full-time|$244K/yr - $305K/yr|Remote|Remote - USA

Airbnb began in 2007 with two hosts and three guests in San Francisco. Since then, the platform has grown to over 5 million hosts and more than 2 billion guests worldwide. Airbnb connects people with unique places to stay and experiences, building authentic community connections across nearly every country. The team: Growth Platform Engineering The Growth Platform team focuses on driving sustainable, long-term growth for Airbnb. The team’s mission centers on building an agentic system and supporting capabilities to help all Airbnb offerings grow, both now and in the future. Efforts include delivering personalized and relevant content and product experiences to users, both on and off the Airbnb platform. The team is working toward a future where AI identifies opportunities, creates campaigns, personalizes experiences, and optimizes outcomes with minimal human input. This journey moves through a maturity curve: AI-assisted, agentic, and ultimately autonomous systems, always with human oversight to ensure brand safety, quality, and compliance. Growth Platform Engineering is tightly integrated with the Airbnb product, enhancing the customer journey and enabling new ways for users to engage. The platform supports a range of digital marketing channels, landing pages, email, push notifications, SMS, and digital advertising, as well as the machine learning and data infrastructure that powers these efforts. What you will do Develop AI-driven solutions to shape the future of Airbnb’s agentic growth platform, using the latest AI methodologies. Lead and mentor engineers through brainstorming, design, and implementation of AI products and features, from initial concept to deployment. Work at the intersection of technical depth, architectural innovation, and mentorship as a Senior Staff Engineer. Collaborate with cross-functional teams to build scalable systems that operate globally. Help evolve the foundational elements of Airbnb’s AI-powered growth systems.

Apr 14, 2026
Apply
companyDatabricks logo
Full-time|$190K/yr - $285K/yr|On-site|San Francisco, California

The Applied AI team at Databricks is at the forefront of pioneering GenAI-powered products. In recent years, we have successfully launched the Databricks Assistant, AI/BI Genie, and Agent Bricks, collaborating with product teams to significantly enhance LLM quality for these offerings. These innovations are utilized by hundreds of thousands of Databricks users daily. We are dedicated to solving complex challenges such as code suggestion, error detection and correction, text-to-SQL generation, automatic pipeline generation, and knowledge QA.As we continue to evolve our GenAI products, we are looking for multiple GenAI Engineers at various experience levels to lead the next phase of our development. Our goals for 2025 include improving LLM quality, broadening GenAI capabilities across Databricks products, and reinforcing our platform architecture to facilitate seamless AI interactions on a large scale.

Feb 2, 2026
Apply
companyDatabricks logo
Full-time|$164.2K/yr - $205.2K/yr|On-site|San Francisco, California

The Applied AI team at Databricks is dedicated to pioneering advancements in GenAI-driven products. In recent years, we have successfully launched notable innovations such as the Databricks Assistant, AI/BI Genie, and Agent Bricks. These products are utilized by hundreds of thousands of Databricks users daily. We are addressing complex challenges such as code suggestions, error detection and correction, text-to-SQL generation, automatic pipeline creation, and knowledge QA. As our GenAI products continue to advance, we are on the lookout for multiple GenAI Engineers, ranging from junior to senior levels, to spearhead the next phase of development. In 2025, our focus will be on enhancing the quality of LLMs, broadening GenAI functionalities across Databricks products, and fortifying our platform architecture to facilitate seamless AI interactions at scale.

Feb 2, 2026
Apply
companyScale AI logo
Full-time|$218.4K/yr - $273K/yr|On-site|San Francisco, CA; New York, NY

Artificial Intelligence is increasingly becoming a pivotal element across all sectors of society. At Scale AI, we are committed to accelerating the evolution of AI applications. For nearly a decade, we have been the premier AI data foundry, propelling groundbreaking advancements in areas such as generative AI, defense applications, and autonomous vehicles. Following our recent investment from Meta, we are intensifying our efforts to develop advanced post-training algorithms that are essential for sophisticated agents in enterprises worldwide.The Enterprise ML Research Lab is at the forefront of this AI revolution, leveraging a suite of proprietary research, tools, and resources to support our enterprise clients. As a Staff Machine Learning Research Engineer focusing on Agent Post-training, you will be instrumental in creating our next-generation Agent Reinforcement Learning training platform. Your work will enable the training of top-tier Agents that deliver state-of-the-art results in real-world enterprise applications.You will incorporate cutting-edge research into our training framework, empowering ML Research Engineers on the Enterprise AI team to deploy use cases ranging from next-generation AI cybersecurity firewalls to training foundational healthtech search models. If you are passionate about shaping the future of the GenAI movement, we welcome your application!

Mar 26, 2026
Apply
companyRocket Money logo
Full-time|$210K/yr - $260K/yr|Hybrid|San Francisco, CA, Washington, D.C., New York City, N.Y., Denver, CO

We are looking for a talented individual who is local to any of our offices (Silver Spring, NYC, SF, Miami, Denver) and is eager to work at least 1-2 times per week from one of these locations.ABOUT ROCKET MONEY At Rocket Money, our mission is to empower individuals to take control of their financial lives. We provide our members with unparalleled insights into their finances and a suite of services that save them both time and money, enabling them to achieve their financial goals.ABOUT THE TEAM As Machine Learning Engineers at Rocket Money, we play a vital role in enhancing customer engagement with our diverse range of financial products. Our responsibilities include transaction enrichment, personalization, and creating cross-functional tools that bolster various AI initiatives. Collaborating closely with product teams, we develop features that aid customers in understanding, tracking, and improving their personal finances. We value team players who excel in cross-team collaboration, can align strategy with ML and AI-driven user experiences, deliver scalable and high-quality user experiences, and are mindful of the impact our products have on end users. At the Staff level, you will be expected to cultivate broad expertise in our products and the ML solutions that enhance them, while driving technical advancements within the team.ABOUT THE ROLE As a Staff Machine Learning Engineer, you will spearhead our ML and AI product development efforts, utilizing your expertise to design, implement, and maintain sophisticated ML systems that elevate our product experiences. Your responsibilities will include:Leading the architecture and development of advanced AI and ML features across Rocket Money's product suite, proactively identifying and addressing technical challenges.Designing and maintaining robust evaluation frameworks to ensure continual improvement of ML/AI systems and facilitating similar initiatives among others.Creating innovative product experiences that leverage our unique dataset and scalability, guiding others in delivering impactful results through effective technical leadership and collaboration with product teams.Overseeing the end-to-end development and implementation of ML and AI product features in partnership with cross-functional product teams, emphasizing thorough technical critique and clear communication of business impacts.Providing technical mentorship to foster an environment of high-impact contributions from all team members.

Jan 9, 2026
Apply
companyDoorDash Inc. logo
Full-time|$137.1K/yr - $246.8K/yr|Hybrid|San Francisco, CA; Sunnyvale, CA

Join us in creating the most dependable on-demand logistics engine for last-mile retail delivery! We are on the lookout for a seasoned machine learning engineer to aid in the development of cutting-edge growth and personalization models that will elevate DoorDash's expanding retail and grocery services.About the RoleWe are seeking a dedicated Applied Machine Learning expert to become part of our innovative team. As a Staff Machine Learning Engineer, you will conceptualize, design, implement, and validate algorithmic enhancements that enrich the growth and personalization experiences central to our rapidly evolving grocery and retail delivery business. Leveraging our advanced data and machine learning infrastructure, you will implement novel ML solutions to enhance the consumer search experience, making it more relevant, seamless, and enjoyable across grocery, convenience, and various retail sectors. A strong command of production-level machine learning and proven experience in addressing end-user challenges while collaborating effectively with multidisciplinary teams is essential.This position will report to the engineering manager on our Personalization team and is expected to be hybrid, combining both in-office and remote work (#LI-Hybrid).

Mar 11, 2026
Apply
companyFaire logo
Full-time|$268K/yr - $368.5K/yr|On-site|San Francisco, CA

About FaireFaire is a transformative online wholesale marketplace, driven by the conviction that local businesses are the future. Independent retailers around the globe generate more revenue than massive corporations like Walmart and Amazon combined, yet individually, they remain small. At Faire, we harness technology, data, and machine learning to connect this vibrant community of entrepreneurs. Think of your favorite local boutique — we empower them to discover and sell the best products from around the world. With our innovative tools and insights, we aim to level the playing field, enabling small businesses to thrive against larger competitors.By championing the growth of independent businesses, Faire positively impacts local economies on a global scale. We’re in search of intelligent, resourceful, and passionate individuals to join us in fueling the shop local movement. If you value community, we invite you to be part of ours.About this RoleAs the Senior Staff Machine Learning Platform Engineer, you will spearhead the technical vision and evolution of Faire's ML platform. You will establish standards, influence organization-wide architecture, and lead intricate, cross-functional initiatives that enhance data science velocity at scale. This position is crucial for adapting ML workflows to leverage modern AI productivity tools. You will not only develop models but also design the systems that enable those models to empower tens of thousands of small retailers in competing and growing their local businesses.

Mar 4, 2026
Apply
companyDecagon logo
Full-time|Remote|San Francisco

Join Decagon as a Staff Software Engineer specializing in Machine Learning Infrastructure. In this role, you will play a crucial part in enhancing and optimizing our machine learning systems. You will collaborate with a talented team of engineers to build scalable and efficient infrastructure that supports our AI-driven initiatives.As a key contributor, you will leverage your expertise in software engineering and machine learning to solve complex challenges and drive innovation. Your work will impact various projects and help shape the future of our technology.

Feb 24, 2026
Apply
companyFaire logo
Full-time|$224K/yr - $308K/yr|On-site|San Francisco, CA

About FaireAt Faire, we are revolutionizing the wholesale marketplace with an unwavering commitment to local communities. Our platform empowers independent retailers globally, enabling them to thrive against larger competitors like Walmart and Amazon. By leveraging cutting-edge technology, data insights, and machine learning, we connect these vibrant entrepreneurs with the best products from around the world. We believe that with the right tools, small businesses can elevate their potential and compete on a grand scale.By nurturing independent businesses, Faire is making a significant positive impact on local economies worldwide. We are in search of intelligent, resourceful, and passionate individuals to join our mission of championing local commerce. If you resonate with our community-driven values, we'd love to welcome you to our team.About this roleAs a Staff Machine Learning Platform Engineer, you will play a pivotal role in shaping, enhancing, and managing a scalable machine learning platform designed to expedite model training, deployment, and governance. You will serve as the vital technical link between our data science and production engineering teams. Joining a small but integral team, you will amplify Faire’s capabilities to support tens of thousands of local businesses in an increasingly competitive retail landscape.

Mar 4, 2026
Apply
companyHive logo
Full-time|On-site|San Francisco

Join Hive as a Senior Machine Learning Engineer and help shape the future of AI! We are seeking passionate individuals who excel at developing and deploying cutting-edge deep learning models. In this role, you will work with large-scale datasets to create innovative machine learning solutions, collaborating closely with a talented team of engineers to push the boundaries of artificial intelligence. Ideal candidates will have a proven track record of building and scaling machine learning projects from conception to production, along with a strong commitment to continuous learning and personal ownership in their work.

Dec 10, 2021
Apply
companyAmbience Healthcare logo
Full-time|$250K/yr - $250K/yr|Hybrid|San Francisco

About Us:At Ambience Healthcare, we aspire to redefine healthcare technology. We are creating an AI intelligence platform that brings humanity back to healthcare while delivering significant ROI for health systems nationwide.Our cutting-edge technology enables healthcare providers to concentrate on exceptional patient care by alleviating the administrative tasks that detract from their critical responsibilities. Ambience provides real-time, coding-aware documentation and clinical workflow support across various healthcare settings, including ambulatory, emergency, and inpatient environments, partnering with top health systems across North America.We are relentless in our pursuit of excellence, exhibiting extreme ownership as we develop optimal solutions for our health system partners. We value transparency, positivity, and profound insight — holding each other to high standards because the challenges we tackle are of utmost importance.Ambience has been recognized as the leading company for improving clinician experience in the KLAS Research Emerging Solutions Top 20 Report, named one of the Next Big Things in Tech by Fast Company, and selected as one of the best AI companies in healthcare by Inc. Additionally, we were honored as a LinkedIn Top Startup in 2024 and 2025. Our esteemed investors include Oak HC/FT, Andreessen Horowitz (a16z), OpenAI Startup Fund, and Kleiner Perkins — and we’re just getting started.The Role:As a Staff Machine Learning Engineer on the Frontier AI team at Ambience, you will tackle the most challenging model quality issues across our clinical AI products, including foundational coding models, adaptive scribing, voice agents, long-context chart understanding, and clinical reasoning. This role focuses on research direction, designing learning loops, and driving comprehensive improvements in model quality over time.Ambience delivers advanced clinical AI solutions in real-world healthcare environments. The models that fuel our products operate under unique constraints, including proprietary ontologies, complex electronic health record (EHR) data, stringent compliance requirements, and clinician workflows where both latency and accuracy are critical. You will leverage your deep research instincts and engineering rigor to push the boundaries of what is possible.Our engineering roles are hybrid, requiring in-office attendance at our San Francisco location three days a week.

Mar 17, 2026
Apply
companyScale AI logo
Full-time|$275K/yr - $350K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY

About Scale AI At Scale AI, we are dedicated to propelling the advancement of AI applications. Over the past eight years, we have established ourselves as the premier AI data foundry, supporting groundbreaking innovations in fields such as generative AI, defense technologies, and autonomous vehicles. Following our recent Series F funding round, we are intensifying our efforts to harness frontier data, paving the way toward achieving Artificial General Intelligence (AGI). Our work with enterprise clients and governments has enhanced our model evaluation capabilities, allowing us to expand our offerings for both public and private evaluations. About the ACE Team The Agent Capabilities & Environments (ACE) team, a vital part of Scale’s Research organization, unites customer-focused Researchers and Applied AI Engineers. Our primary mission is to conduct research on agent environments and reinforcement learning reward signals, benchmark autonomous agent performance in real-world contexts, and develop robust data programs aimed at enhancing the capabilities of Large Language Models (LLMs). We are committed to creating foundational tools and frameworks for evaluating models as agents, focusing on autonomous agents that interact dynamically with a wide range of external environments, including code repositories and GUI interfaces. About This Role This position sits at the cutting edge of AI research and its practical applications, concentrating on the data types necessary for the development of state-of-the-art agents, including browser and software engineering agents. The ideal candidate will investigate the data landscape required to propel intelligent and adaptable AI agents, steering the data strategy at Scale to foster innovation. This role demands not only expertise in LLM agents and planning algorithms but also creative problem-solving skills to tackle novel challenges pertaining to data, interaction, and evaluation. You will contribute to influential research publications on agents, collaborate with customer researchers, and partner with the engineering team to transform these advancements into scalable real-world solutions.

Mar 26, 2026
Apply
companyEnigmaio logo
Full-time|Hybrid|New York, NY, San Francisco, CA or Los Angeles, CA

Join our innovative Match Team as a Senior Machine Learning Engineer at Enigmaio, where you will play a pivotal role in developing advanced algorithms and models that enhance our matching capabilities. You will collaborate with cross-functional teams to design, implement, and optimize machine learning solutions that drive business outcomes.

Apr 10, 2026
Apply
companyTaskrabbit logo
Full-time|$148K/yr - $200K/yr|Hybrid|San Francisco, California, United States

About Taskrabbit:Taskrabbit is an innovative marketplace platform that seamlessly connects individuals with Taskers to manage everyday home tasks, including furniture assembly, handyman services, moving assistance, and much more.At Taskrabbit, we aim to transform lives one task at a time. We celebrate innovation, inclusion, and hard work, fostering a collaborative, pragmatic, and fast-paced culture. We seek talented, entrepreneurially minded, data-driven individuals who possess a passion for empowering others to pursue their passions. In partnership with IKEA, we are creating more opportunities for individuals to earn a consistent, meaningful income on their terms by establishing enduring relationships with clients in communities globally.Taskrabbit operates as a hybrid company, with team members located across the US and EU, and has been recognized as a Built In — Best Places to Work for 2022, 2023, and 2024, receiving accolades across various national and regional categories. Join us at Taskrabbit, where your contributions will be significant, your ideas appreciated, and your potential maximized!This position operates on a hybrid schedule, requiring two days of in-office collaboration per week. It can be based in our San Francisco office or our new New York City office (opening March 2026).About the RoleMachine Learning is a foundational element at Taskrabbit, and we are in search of an experienced Senior Machine Learning Engineer to join our team and help mold the future of ML/AI at Taskrabbit. This distinct, full-stack role is designed for someone who is enthusiastic about the entire machine learning lifecycle—from initial research and model development to constructing the robust infrastructure necessary for deploying and scaling your innovations.As a Senior Machine Learning Engineer, you will engage with exciting challenges that directly influence how users discover and interact with home services on the Taskrabbit platform. You will play a vital role in enhancing our capabilities in areas such as search ranking, content discovery, and recommendation systems. Collaborating closely with data scientists and fellow engineers, you will design and implement cutting-edge algorithms, ensuring the scalability, reliability, and optimization of our models in production alongside software engineers.

Feb 17, 2026
Apply
companyVSCO logo
Full-time|$240K/yr - $260K/yr|On-site|San Francisco, CA

About VSCO At VSCO, we empower photographers with an innovative platform that provides essential tools, a vibrant community, and the visibility needed for creative and professional growth. We cultivate an authentic creative environment that welcomes photographers of all skill levels, offering a space that inspires opportunity, collaboration, and connection. Our mission is to support photographers in their journeys, enabling them to thrive and connect with fellow creatives and businesses through our comprehensive suite of tools, available on both mobile and desktop. We seek individuals who are passionate and proactive in advancing our mission. Our team members have the opportunity to make a significant impact, and we believe that collaborative efforts yield stronger results. Our core values are essential to our team culture and guide our hiring process. Learn more about what you can expect when joining VSCO on our Careers Page. About The Role As a Senior Machine Learning Engineer, you will harness the power of AI and machine learning to create innovative, reliable user-facing product features. You will leverage your extensive technical background and hands-on experience in deploying machine learning models to deliver impactful solutions based on real-world feedback. Your focus on measurable outcomes and customer satisfaction drives your work, blending innovation with practical implementation. You will be highly skilled in Python and adept across the data and machine learning stack, enabling you to develop and launch models efficiently while ensuring scalability and maintainability. Whether working with traditional algorithms or cutting-edge deep learning and generative AI, you will expertly navigate the complexity of each problem, managing every phase from defining the challenge to deployment and iterative improvement. Your dedication to software engineering excellence will inform your thoughtful approach to system design for machine learning, encompassing data quality, pipeline design, feature workflows, model serving, and ongoing monitoring and enhancement. By integrating machine learning deeply within our cohesive product experiences, you will collaborate effectively with cross-functional teams, aligning on objectives, defining success metrics, and driving meaningful outcomes. You will stay informed about the rapidly evolving AI landscape, maintaining a discerning perspective that allows your team to focus on significant advancements while avoiding distractions. The Day to Day Design and implement ML-powered features for search, discovery, personalization, and more.

Mar 23, 2026
Apply
companyScale AI, Inc. logo
Full-time|$289.8K/yr - $362.3K/yr|On-site|San Francisco, CA; New York, NY

The Enterprise Machine Learning team at Scale AI is at the forefront of the AI revolution, collaborating closely with clients to pinpoint significant business challenges and develop advanced AI systems. By leveraging Scale’s proprietary research, data, and infrastructure, we unlock domain expertise through exceptional data quality and expert feedback. As the Director of Enterprise Machine Learning, you will spearhead a talented team of research scientists and engineers, charting the research roadmap and overseeing the journey from initial prototyping to full deployment. You will excel in a dynamic environment, adeptly balancing technical leadership with people management, visionary planning, and effective delivery. This position is perfect for a leader who thrives amidst uncertainty, possesses a deep understanding of the cutting-edge capabilities and limitations of generative AI, and is driven by the desire to transform research into robust, production-ready systems.

Mar 31, 2026
Apply
company
Full-time|On-site|San Francisco

The OpportunityJoin us at ComfyOrg as a Senior/Staff Applied Machine Learning Engineer! We are on the hunt for a passionate innovator who is enthusiastic about optimizing model inference. You will play a pivotal role in developing the heart of ComfyUI, our cutting-edge visual AI platform. Your expertise will help us push the limits of AI model performance, making them run faster and more efficiently than ever before.Are You a Match?You are fascinated by model inference, memory management, and torch optimizations.You possess experience in writing production-level PyTorch code that challenges performance standards.You have a passion for understanding the inner workings of AI models.You thrive on developing highly optimized code that consistently delivers results.You believe that the current landscape of ML deployment holds significant room for improvement.Your Responsibilities:Develop and enhance the core inference engine that drives ComfyUI.Optimize large models for speed and memory efficiency.Collaborate with our core team to architect new features.Tackle complex technical challenges within the visual AI domain.Contribute to the future direction of our technology.Experience with diffusion or LLM models, as well as creating custom nodes for ComfyUI, is highly beneficial.

May 29, 2025
Apply
companyOrchard logo
Full-time|On-site|San Francisco

Join Orchard as a Machine Learning Engineer and play a pivotal role in transforming data into actionable insights. In this dynamic position, you will leverage your expertise in machine learning algorithms and data analysis to develop innovative solutions that enhance our products and services.We are looking for a proactive team player who thrives in a fast-paced environment and possesses strong problem-solving skills. You will collaborate with cross-functional teams, engage with large datasets, and contribute to the design and implementation of machine learning models.

Mar 14, 2026

Sign in to browse more jobs

Create account — see all 7,571 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.