Engineering Manager At Saris Ai San Francisco jobs in San Francisco – Browse 12,785 openings on RoboApply Jobs

Engineering Manager At Saris Ai San Francisco jobs in San Francisco

Open roles matching “Engineering Manager At Saris Ai San Francisco” with location signals for San Francisco. 12,785 active listings on RoboApply Jobs.

12,785 jobs found

1 - 20 of 12,785 Jobs
Apply
company
Full-time|On-site|San Francisco

Saris AI develops advanced AI automation for the banking sector, with teams in San Francisco, Montreal, and Toronto. The company addresses large-scale automation challenges, focusing on long-context reasoning, integrating with legacy systems, and meeting strict compliance needs. Saris AI’s AI agents are already active in production, supporting real customer workflows as the business expands. Role overview The San Francisco engineering team is seeking an Engineering Manager. This leader will guide engineers through shifting priorities and frequent ambiguity. The position involves building and leading teams, managing projects from concept to launch, and ensuring the delivery of reliable software that powers AI-driven products. What you will do Build and lead engineering teams focused on automation and AI Oversee software projects from initial idea through production launch Adapt to changing requirements and priorities as the company grows Address the technical challenges unique to deploying software in AI systems Requirements Proven experience building and leading engineering teams History of delivering software projects from start to finish Comfort working in environments where priorities and requirements shift quickly Understanding of the complexities involved in AI-driven software deployment Location This role is based in San Francisco.

Apr 24, 2026
Apply
company
Full-time|On-site|San Francisco

Join Saris AI as an AI Systems Engineer, where you will play a pivotal role in designing and implementing innovative solutions that leverage artificial intelligence technologies. You will collaborate closely with cross-functional teams to develop AI systems that enhance our products and services, driving impactful results for our clients.

Mar 30, 2026
Apply
company
Full-time|On-site|San Francisco

About the RoleJoin our innovative team at saris-ai, a pioneering applied AI startup based in San Francisco and Montreal. We are on a mission to redefine the banking industry by addressing a $100 billion annual challenge that is rapidly evolving. Our cutting-edge multi-turn AI agentic systems are at the forefront of this transformation.Our objective is to develop automated solutions that require sophisticated long-context reasoning, seamless tool integration across legacy systems, and stringent compliance mechanisms, particularly in scenarios where conventional answers are elusive.With successful real-world deployments and a rapidly expanding client base, we are seeking passionate and technically adept builders eager to make a significant impact from the start.We are in search of an AI Sales Engineer who will play a pivotal role in enhancing productivity and efficiency within financial institutions through innovative AI solutions. In this role, you will lead discovery calls with various departments to identify workflow challenges and design impactful AI-driven solutions. You will serve as the technical authority during sales cycles, assisting clients in recognizing problems, framing solutions, and articulating ROI.This position combines elements of customer engagement and workflow analysis.Your Key Responsibilities Include:Conduct discovery and technical scoping calls with potential clients to uncover operational inefficiencies in financial institutions.Develop workflow maps and reports to identify bottlenecks and pinpoint opportunities for agentic AI automation.Collaborate with the Sales team to co-manage the pre-sales process from initial discovery to proposal delivery.Engage with executive stakeholders, presenting clear and compelling insights on how Agentic AI can deliver measurable ROI and transform operations at scale.Work closely with cross-functional teams including sales, customer success, and product to ensure customer satisfaction and success.

Dec 18, 2025
Apply
company
Full-time|On-site|San Francisco

Join Our Team as a Machine Learning EngineerSaris-AI is a pioneering applied AI startup, based in San Francisco and Montreal, focused on revolutionizing the banking sector. Our mission is to address a colossal $100 billion/year challenge that is rapidly expanding, innovating the limits of what can be achieved with advanced multi-turn AI systems.We aim to automate complex workflows that necessitate long-context reasoning, orchestration of tools across legacy systems, and rigorous compliance processes—solving problems that currently lack definitive solutions.Our team has successfully deployed AI agents that manage real customer workflows effectively in production. As we expand our customer base and accelerate our growth, we are in search of highly skilled technical builders who aspire to make a significant impact in the early stages of our journey.As a foundational Machine Learning Engineer, you will own our entire ML stack and bring custom agents to life.

Dec 12, 2025
Apply
company
Full-time|On-site|San Francisco

Join Saris AI as a Staff Backend Software Engineer, where you will play a pivotal role in developing innovative solutions that harness the power of artificial intelligence. In this dynamic position, you will collaborate with cross-functional teams, design scalable systems, and contribute to the overall architecture of our backend solutions.

Apr 9, 2026
Apply
company
Full-time|On-site|San Francisco

About the RoleJoin Saris-AI, an innovative applied AI startup based in San Francisco and Montreal, where we are revolutionizing the banking industry. Our mission is to address a $100 billion annual challenge, growing rapidly each quarter, as we explore the potential of advanced multi-turn AI systems.We focus on automation challenges that require extensive contextual reasoning, tool integration with existing systems, and adherence to compliance requirements—problems that lack straightforward solutions.Our successful deployment of real AI agents managing customer workflows has led to a burgeoning client base, and we are expanding quickly. We seek highly skilled technical professionals who aspire to make a significant impact from the outset.Core Engineering TeamWe are in search of a passionate Software Developer who excels in dynamic and ambiguous settings. You should have experience guiding products from the initial version to large-scale implementation, and a penchant for crafting sustainable, elegant systems.Your Key ResponsibilitiesConceive, develop, test, deploy, maintain, and enhance AI-driven software solutions tailored for the fintech sector.Manage project timelines effectively, prioritizing tasks to deliver exceptional features.Contribute to product and business strategies as a key player, influencing both technical frameworks and overarching goals.Encourage and uphold best practices, including code reviews and testing.Candidate ProfileProven experience in backend/fullstack engineering, particularly within SaaS projects with significant impact.Extensive knowledge of Python (Django, Flask, FastAPI), TypeScript (React), Docker, and cloud platforms (AWS/GCP).Solid track record in designing, constructing, and optimizing distributed systems.Strong foundation in both relational and non-relational databases (PostgreSQL, Redis).Expertise in API design (REST, GraphQL) and deployment practices.Preferred QualificationsExperience building intricate workflow orchestration systems.Familiarity with cloud infrastructure.Background in creating integrations or data pipelines.A passion for mentoring and developing team capabilities.

Dec 12, 2025
Apply
company
Full-time|On-site|San Francisco

Position OverviewJoin us at saris-ai as a Senior Product Manager, where you will play a pivotal role in crafting, documenting, and implementing intricate banking back-office workflows. This position encompasses comprehensive product development focused on lending and deposit processes.You will bridge the gap between lending regulations, operational methodologies, and business objectives by translating these into detailed product specifications, data models, validation criteria, design prototypes, and automation setups that engineering and operations teams can seamlessly execute.Additionally, you will spearhead user experience research initiatives and leverage AI-driven design tools to efficiently produce workflow prototypes and interaction specifications that serve as a foundation for engineering teams.

Feb 26, 2026
Apply
company
Full-time|On-site|San Francisco

Saris AI develops applied AI solutions for the banking sector, with teams in San Francisco, Montreal, and Toronto. The company builds automation tools that handle complex, long-context reasoning and agent-driven decision-making. Reliability and compliance shape every product, and Saris AI's agents already manage real customer workflows in production. As revenue grows, the engineering team is expanding to enhance current offerings and explore new directions. The Senior Machine Learning Engineer role is based in San Francisco and sits within the core engineering group. The team works in a collaborative, early-stage setting, balancing infrastructure needs with the delivery of features that serve customers directly. What you will do Build and maintain machine learning infrastructure, such as evaluation frameworks, prompt management systems, and tools for model observability. Develop new AI features for customers while supporting and improving the underlying infrastructure. Shape strategies for evaluation, LLM routing, prompt engineering, and model selection. Set practical standards to boost quality without slowing down development. Guide technical direction by clarifying trade-offs and architectural choices. Requirements Minimum 4 years of experience in machine learning or AI engineering, including production deployment of ML systems. Direct experience with large language models, prompt engineering, evaluation techniques, and model routing. Background in building tools and systems that deliver value to users. Comfort making pragmatic trade-offs and recognizing when a solution is sufficient. Ability to navigate ambiguity, define problems, and deliver results independently. Strong focus on end users and understanding the impact of ML decisions on customer experience. Supports team growth through code reviews, collaboration, and clear technical communication. Bonus Experience in regulated industries, especially banking.

Apr 24, 2026
Apply
company
Full-time|On-site|San Francisco

About Saris AISaris AI specializes in developing AI-driven workflow agents that enhance and automate back-office operations specifically for banks and credit unions. Our solutions operate in highly regulated, high-stakes environments where precision, trustworthiness, and explainability are prioritized alongside rapid execution. We focus on designing for intricate systems, real-world operators, and sustainable impacts rather than just eye-catching demos.The RoleWe are seeking a Senior Product Designer who excels in navigating complex challenges. In this pivotal role, you will spearhead the design efforts across our core product areas, transforming convoluted workflows, outdated systems, and regulatory frameworks into intuitive, reliable user experiences. This position offers substantial ownership and influence over the product's strategic direction.You will collaborate closely with co-founders, product managers, and engineers to redefine the way banks and credit unions engage with AI workflow agents, particularly for users who are more domain experts than technical specialists.What You’ll DoLead end-to-end design for key product surfaces, including discovery, workflows, interaction models, and visual systems.Design user interfaces for enterprise software alongside complex, multi-step workflows.Translate banking and operational requirements into seamless user experiences.Partner with PMs to clarify problems, outline solutions, and navigate trade-offs.Collaborate with engineering teams to guarantee superior execution.Design with a focus on trust, transparency, and user control in AI-driven systems.Develop and maintain design systems, patterns, and documentation to support product scaling.Engage in user research, usability testing, and iterative refinement with real customers.What We’re Looking For6+ years of product design experience, including substantial time in enterprise or B2B software environments.Demonstrated success in designing complex systems such as dashboards, workflows, administrative tools, and regulated products.Proficiency in understanding and synthesizing user challenges.Strong capabilities in interaction design and information architecture.Experience collaborating closely with engineering teams in dynamic product environments.Ability to balance the needs of users with technical constraints and business objectives.

Mar 11, 2026
Apply
company
Full-time|On-site|San Francisco

Join a pioneering team of former Google engineers who have developed ground-breaking defensive technologies, such as Safe Browsing and reCAPTCHA. We are on a mission to confront an urgent challenge: combating the rising tide of adversarial AI attacks that threaten organizations globally.Operating in stealth mode, we are targeting a lucrative $5B+ market that is primed for innovation. Conventional detection methodologies are proving inadequate against the speed and sophistication of AI-driven assaults. Current adversaries are leveraging AI to engineer tailored, high-evasion attacks, leaving traditional systems vulnerable.Your Role:You will design a network of AI agents that are rapid, cost-effective, and precise, collaborating to identify and neutralize emerging threats. Your work will dive deep into real-time threat data, continuously evolving your agents in a fast-paced environment. These agents will function under an orchestration layer that fosters quick adaptation and learning.The Excitement of the ChallengeRapidly Evolving Models: The landscape changes daily; solutions that worked yesterday may be outdated today.Intelligent Adversaries: We are engaged in a real-time arms race against cunning, AI-enhanced attackers crafting sophisticated payloads.No Existing Playbook: We are forging new detection paradigms as swiftly as threats evolve. This high-stakes work places you in the heart of the action from day one.If you thrive on solving challenging problems with rapid feedback, this is your opportunity.Why We Are Positioned to SucceedExpansive Market: The market is vast at $5B and expanding quickly, while established players struggle to adapt.Proven Track Record: Our team has previously developed the foundational technology for Safe Browsing (serving over 5B users) and reCAPTCHA (protecting more than 5M websites) during our time at Google.Experienced Team: This is our third endeavor in creating a category-defining security enterprise, and we know how to scale our technology and our organization effectively.Deeply Integrated AI and Security: We embed AI from the outset rather than layering it on top.Top Talent: We hire only the highest achievers; many on our team were in the top 1% of engineers at Google. If you excelled in your previous role, you will fit right in.Agility: We prioritize speed and efficiency in everything we do.

Sep 10, 2025
Apply
company
Full-time|On-site|San Francisco

Join Our Innovative Team at Simple AIAt Simple AI, we are transforming enterprise communications through cutting-edge voice AI agents. Our highly realistic agents empower leading companies such as DoorDash, xAI, and Omaha Steaks to efficiently manage a variety of phone operations, including customer support, order processing, and lead qualification.As we experience exponential growth and a surge in customer demand, we are seeking a few founding software engineers to help us navigate this exciting journey and develop systems for the future. Our passionate team operates from our vibrant San Francisco office five days a week, driven by a shared vision of leveraging AI to make a significant impact in the world.We are proud to be backed by esteemed investors and operators, including Y Combinator, Massive Tech Ventures, and industry leaders such as Michael Seibel (Twitch), Jared Friedman (Scribd), and more.

Oct 8, 2025
Apply
company
Full-time|On-site|San Francisco

About Saris AIAt Saris AI, we are pioneering the future of work within the banking sector from our innovative bases in San Francisco and Montreal. Our applied AI startup is tackling an unprecedented $100 billion annual challenge, which is rapidly expanding, by pushing the limits of multi-turn AI agent systems.Our mission is to address complex automation challenges that necessitate long-context reasoning, tool orchestration across legacy systems, and stringent compliance protocols—where solutions are not yet defined.With a proven track record, we've successfully deployed real agents that manage actual customer workflows. As we expand our customer base and scale our operations, we seek highly technical individuals eager to make a significant early impact.The RoleWe are in search of a dynamic AI Product Manager specializing in Implementation and Onboarding to spearhead AI transformation initiatives in the back offices of banks and credit unions. You will be responsible for guiding customers through their transformation journeys, from initial scoping to full AI deployment, ensuring seamless execution and optimal return on investment. This role merges client-facing collaboration with product-centric documentation and delivery.This position encompasses both customer engagement and product scoping.Your Mission IncludesOverseeing onboarding and implementation across customer departments, ensuring that timelines, scopes, and expectations are synchronized.Conducting discovery calls to identify use cases, edge cases, and integration requirements.Drafting product scoping documents and comprehensive blueprints for agentic AI workflows.Engaging with executive stakeholders, offering clear and compelling insights on how Agentic AI contributes to measurable ROI and transforms operations on a large scale.Collaborating with cross-functional teams including sales, customer success, and product to guarantee customer satisfaction.Who You Are5+ years of experience in implementation, product management, consulting, or enterprise onboarding—preferably with AI products in the banking or credit union sector.Domain experience within the U.S. banking or credit union landscape.Proficient in workflow mapping, use case identification, and translating client needs into product specifications.Excellent communication skills to convey technical concepts effectively to diverse stakeholders.

Dec 18, 2025
Apply
company
Full-time|On-site|San Francisco Office

fractional-ai is hiring a Software Engineer to join the team onsite in San Francisco. This role centers on building and improving technology that advances artificial intelligence. Role overview As a Software Engineer, you will work closely with others to develop and support projects that drive the company’s AI initiatives forward. The work involves collaborating with colleagues and contributing technical solutions in a team setting. What you will do Develop software for AI-driven projects Collaborate with team members in the San Francisco office Contribute ideas and technical expertise to ongoing initiatives Requirements Interest in technology and artificial intelligence Ability to work onsite in San Francisco Strong motivation to learn and contribute to team projects

Apr 28, 2026
Apply
company
Full-time|On-site|San Francisco

Role OverviewLocation: San Francisco, CA Work Model: In-officeAbout Effective AIAt Effective AI, we are pioneering the future of work, focusing on sophisticated AI solutions for complex knowledge tasks rather than simple, repetitive functions. Our goal is to develop advanced AI Teammates that excel in intricate workflows and collaborate seamlessly with human professionals. Our initial challenge is to revolutionize the trillion-dollar U.S. Property & Casualty insurance sector, an area rich in data and complexity, ideal for our innovative approach.We have successfully secured $10 million in seed funding from prominent investors including Lightspeed Ventures and Valor Equity Partners.Our passionate team operates out of San Francisco, where we value in-person collaboration to address these pressing challenges.Your ResponsibilitiesAs a Founding Software Engineer, you will be an integral member of our founding team, significantly influencing the design and development of our core product from inception. You will confront critical challenges in agentic AI, crafting Teammates capable of managing essential insurance functions such as underwriting and claims processing.Specifically, you will:Facilitate long-term task completion by creating the foundational architecture for AI agents to plan and execute multi-step processes reliably over prolonged interactions.Develop advanced reasoning functionalities, enabling agents to make informed decisions based on ambiguous or incomplete information, akin to human experts.Create intelligent, tool-utilizing agents capable of selecting and employing a variety of external tools—APIs, databases, web searches, and Excel-based pricing algorithms—to gather information and take decisive actions.Design adaptive and learning systems, equipping our AI Teammates to learn from feedback and adjust to evolving conditions such as regulatory changes or market dynamics.Your ProfileWe seek an innovative and driven builder excited to tackle substantial challenges.You possess a solid foundation in computer science and remarkable problem-solving capabilities, evidenced by impactful projects or previous experience.Your genuine enthusiasm for AI fuels your passion for tackling complex issues.

Aug 19, 2025
Apply
company
Full-time|On-site|SF Office - 171 2nd, 4th floor

About the RoleWe are on the lookout for a skilled Forward Deployed AI Engineer to join our dynamic team at Stack AI. This position is crucial for implementing enterprise-level AI solutions with an emphasis on Retrieval-Augmented Generation (RAG) pipelines and large language model (LLM) workflows. Your contributions will significantly enhance our offerings to Fortune 500 companies and enterprises across diverse sectors.Role Overview:In this role, you will seamlessly integrate large language models into enterprise systems, collaborating with strategic accounts to tailor solutions that meet their technical needs. Utilizing the Stack AI platform, you’ll engage with clients to co-create innovative solutions that address their evolving challenges.Key Responsibilities:Enhance and maintain solutions for strategic accounts using the Stack AI platform.Analyze and document requirements and relationships within target enterprise offices.Identify opportunities and provide insights to shape our go-to-market strategy.Directly contribute to the Stack AI codebase, transforming customer feedback into enhancements for the Python backend and React/Next.js TypeScript frontend.Project future opportunities and secure high-value contracts.Draft proposals, present to stakeholders, and lead engaging product demonstrations.Promote Stack AI at enterprise conferences and events.

Jun 27, 2025
Apply
company
Full-time|On-site|San Francisco

About David AIDavid AI is pioneering the audio data research landscape, applying a rigorous R&D approach to dataset development akin to the methodologies used in AI labs for model creation. Our goal is to integrate AI seamlessly into real-world applications, with audio serving as the perfect entry point due to its inherent versatility and human connection. As the audio AI field progresses, the demand for high-quality training data becomes critical, and that's where David AI excels.Founded in 2024 by a team of experienced engineers and operators from Scale AI, David AI has quickly gained traction, serving prominent clients among FAANG companies and AI research labs. Recently, we secured a $50M Series B funding round from esteemed investors including Meritech, NVIDIA, Jack Altman (Alt Capital), Amplify Partners, and First Round Capital.Our team embodies intelligence, humility, ambition, and a close-knit ethos. We are on the lookout for exceptional talent in research, engineering, product, and operations to join us in advancing the frontiers of audio AI.About Our Engineering TeamAt David AI, our engineering team is responsible for constructing the pipelines, platforms, and models that convert raw audio into valuable data for top AI labs and enterprises. We pride ourselves on our collaborative environment, comprising product engineers, infrastructure specialists, and machine learning experts dedicated to leading the charge in audio data research.We operate at a fast pace, taking ownership of our projects from conception through to production. Our team develops real-time processing pipelines capable of managing terabytes of audio data daily while deploying innovative generative audio models.About This RoleAs a Product Engineer at David AI, you will design and implement state-of-the-art tools that enable our users to leverage audio data effectively for training their AI models. You will collaborate closely with researchers to continuously refine our data collection methodologies.Your ResponsibilitiesDeliver full-stack features that will be utilized by thousands of users on a daily basis.Develop scalable systems that create essential data processing pipelines, extracting actionable insights from terabytes of audio data each day.Construct, deploy, and assess LLM and DSP-based solutions to enhance our clients' comprehension of intricate features within our datasets.Rapidly iterate on research hypotheses by collaborating with researchers and the operations team to deploy enhancements efficiently.

Feb 24, 2025
Apply
company
Full-time|On-site|San Francisco

About Simple AIAt Simple AI, we are pioneering cutting-edge voice AI solutions tailored for enterprises. Our lifelike voice agents empower renowned companies such as DoorDash, xAI, and Omaha Steaks in managing diverse phone operations ranging from customer support to order intake and lead qualification. As we experience rapid revenue growth, we seek a couple of founding software engineers to help us meet current customer demands while developing innovative systems for the future. Our team is passionate about harnessing AI to transform the world, and we operate from our San Francisco office five days a week.Simple AI is proudly supported by top-tier investors and industry leaders, including Y Combinator, Massive Tech Ventures, and prominent figures like Michael Seibel (co-founder of Twitch), Jared Friedman (Scribd), Alexandr Wang (Scale AI), Matt Van Horn (June Oven and Lyft), Suhail Doshi (Mixpanel), JJ Fliegelman (WayUp), and Scott Wu (Cognition).Meet the FoundersThe founders, Catheryn Li and Zach Kamran, previously led software teams at Y Combinator, where they launched numerous products for YC founders, including the co-founder matching site. Cat brings experience from Instagram and Facebook and holds degrees in mathematics and computer science from MIT. Zach studied computer science and statistics at the University of Chicago and led software initiatives at PEAK6.What We ProvideCompetitive salary with significant equity opportunitiesEmployee-friendly equity termsOffice located in San FranciscoComprehensive health, dental, and vision insuranceComplimentary lunch and dinner at the officeFree Uber rides home after 8:30 PMRegular team events and offsite activitiesYour Qualifications4+ years of relevant software engineering experience.Exceptional problem-solving skills and a passion for tackling technical challenges.A customer-centric mindset coupled with strong communication skills and a keen interest in product design and prioritization.Self-motivated with the ability to work independently.A genuine enthusiasm for continual learning and keeping abreast of the latest advancements in AI.

Oct 8, 2025
Apply
companyOpenAI logo
Full-time|Hybrid|San Francisco

About the TeamJoin the AI Success Engineering team, where we transform innovative AI technologies into impactful enterprise solutions. Collaborating closely with our clients, we guide them from initial experimentation to substantive real-world transformations, focusing on adoption, technical preparedness, and long-term value realization. Our team is pivotal in accelerating the deployment of AI products in production, ensuring they deliver measurable business results. We work in synergy with Sales, Solutions Architecture, Technical Success, and Product teams to safely and successfully bring leading-edge AI solutions to market.About the RoleWe are in search of a seasoned technical leader to oversee and expand a high-performing team of AI Success Engineers. This team is responsible for ensuring post-sale technical success for ChatGPT Enterprise, facilitating customer onboarding, activation, and adoption through structured programs, enablement initiatives, and change-management strategies.AI Success Engineers are highly technical and hands-on, assisting customers with advanced capabilities including connectors, Codex, custom GPTs, and other cutting-edge features upon their release. They collaborate closely with client stakeholders to guarantee effective deployment, widespread adoption, and tangible business outcomes.In this leadership role, you will shape team strategy, ensure robust execution and technical excellence, and create frameworks that promote scalability, consistency, and operational superiority—while maintaining close connections with customers and the field.This position is based in San Francisco and follows a hybrid work model, requiring three days per week in the office. Some regional travel will be required.Key ResponsibilitiesDevelop and implement the strategy and operational framework for the AI Success Engineering team, ensuring alignment with OpenAI's goals and customer requirements.Recruit, lead, mentor, and nurture a high-performing team of AI Success Engineers with solid technical capabilities and a focus on customer impact.Oversee the successful post-sale adoption of OpenAI products across various sectors, including enterprises, digital-native companies, and high-growth organizations.Ensure technical readiness and effective adoption of advanced OpenAI capabilities, collaborating closely with clients on practical applications.Advocate for customer perspectives to inform product development and commercial strategies.Establish operational rhythms, such as leadership updates and knowledge-sharing platforms, to enhance team effectiveness.

Feb 18, 2026
Apply
companyPerplexity logo
Full-time|On-site|San Francisco

About the RoleWe are seeking a talented Inference Engineering Manager to spearhead our AI Inference team at Perplexity. This is a remarkable opportunity to design and expand the infrastructure that drives Perplexity's innovative products and APIs, catering to millions of users with cutting-edge AI capabilities.You will take charge of the technical direction and implementation of our inference systems while cultivating and leading a high-caliber team of inference engineers. Our technology stack encompasses Python, PyTorch, Rust, C++, and Kubernetes. You will play a crucial role in architecting and scaling the large-scale deployment of machine learning models for Perplexity's Comet, Sonar, Search, and Deep Research products.Why Perplexity?Develop state-of-the-art systems that are among the fastest in the industry using leading-edge technology.Engage in high-impact work within a smaller team, enjoying considerable ownership and autonomy.Seize the chance to create infrastructure from the ground up instead of maintaining outdated systems.Work across the entire spectrum: minimizing costs, scaling traffic, and advancing the capabilities of inference.Make a significant impact on the technical roadmap and team culture at a rapidly expanding company.ResponsibilitiesLead and nurture a high-performing team of AI inference engineers.Develop APIs for AI inference utilized by both internal and external clients.Design and scale our inference infrastructure for enhanced reliability and efficiency.Benchmark and resolve bottlenecks across our inference stack.Drive large sparse/MoE model inference at rack scale, including sharding strategies for extensive models.Innovate by developing inference systems that support sparse attention and disaggregated pre-fill/decoding serving.Enhance the reliability and observability of our systems and lead incident response efforts.Make technical decisions regarding batching, throughput, latency, and GPU utilization.Collaborate with ML research teams on model optimization and deployment.Recruit, mentor, and develop engineering talent.Establish team processes, engineering standards, and operational excellence.Qualifications5+ years of engineering experience, with at least 2 years in a technical leadership or management capacity.Proficiency in programming languages and tools such as Python, PyTorch, Rust, and C++.Experience with Kubernetes and cloud infrastructure.Strong understanding of machine learning model deployment and optimization.Exceptional problem-solving and communication skills.

Jan 18, 2026
Apply
company
Full-time|On-site|San Francisco

Responsibilities:Develop Our Product. Take ownership of software features that enhance our AI platform, empowering private market investors to analyze deals swiftly and accurately. This role requires you to actively engage in coding and delivering essential platform components from the ground up.Ensure Quality and Excellence. Establish and maintain coding standards, architectural guidelines, and review procedures; mentor fellow engineers. Co-cultivate a culture centered on practical, reliable, and test-driven development that swiftly meets business requirements.Scale for Growth. Assist in monitoring and scaling the architecture of our platform and its infrastructure.Collaborate Across Functions. Engage closely with product, security, and go-to-market teams to translate our strategic roadmap into actionable features and deliverables.

Sep 19, 2025

Sign in to browse more jobs

Create account — see all 12,785 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.