Performance Engineer Immediate Openings For Local Candidates jobs in San Francisco – Browse 5,289 openings on RoboApply Jobs

Performance Engineer Immediate Openings For Local Candidates jobs in San Francisco

Open roles matching “Performance Engineer Immediate Openings For Local Candidates” with location signals for San Francisco. 5,289 active listings on RoboApply Jobs.

5,289 jobs found

1 - 20 of 5,289 Jobs
Apply
companyusm2 logo
Contract|On-site|San Francisco

We are seeking a talented Performance Engineer to join our dynamic team at usm2. This is an exciting opportunity for local professionals who are passionate about optimizing system performance and enhancing user experience. As a Performance Engineer, you will play a crucial role in analyzing performance metrics, identifying bottlenecks, and implementing solutions to ensure our applications run smoothly and efficiently.

May 18, 2017
Apply
companyCandid Health logo
Full-time|$135K/yr - $175K/yr|On-site|San Francisco (CA), Denver (CO), New York (NY)

Join Candid Health as an Engineering Recruiter, where you will take on a pivotal role as a strategic partner and advisor to our Hiring Managers. Your collaboration will be essential in identifying their talent requirements, crafting effective recruitment strategies, and attracting exceptional candidates for technical roles. You will also engage in strategic recruiting initiatives aimed at refining our hiring processes and contribute to the establishment of a top-tier recruiting team.Your ResponsibilitiesManage the complete recruitment cycle for both individual contributor and leadership engineering roles, encompassing candidate sourcing, screening, interviewing, and extending offers while ensuring a smooth and positive experience for candidates.Work closely with Hiring Managers to thoroughly understand their talent needs and devise recruitment strategies that align with overall business goals.Identify and implement opportunities to enhance and streamline recruitment processes, driving initiatives that boost efficiency, effectiveness, and candidate quality.Cultivate and sustain strong relationships with candidates, Hiring Managers, and cross-functional stakeholders.Your ProfileA minimum of 4 years of full-cycle recruiting experience in a fast-paced environment.Proven track record in developing pipelines and formulating top-of-funnel strategies.Exceptional communication and interpersonal skills, enabling you to develop strong relationships with candidates, colleagues, and stakeholders across all levels.A humble, team-oriented approach that prioritizes collective success and values collaboration and diverse perspectives to meet shared objectives.Proactive and resilient, with a commitment to achieving excellence in all aspects of your work.Experience with common recruiting tools and systems, such as Ashby, to enhance productivity.A data-driven mindset, utilizing metrics to inform your hiring strategies and assist Hiring Managers in the same.LocationCandid Health operates as an in-person team; this position can be based in any of our three locations: Denver, New York, or San Francisco.Compensation TransparencyThe estimated annual salary range for this role is $135,000 - $175,000 USD.

Jan 5, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Role overview The Performance Modeling Engineer II position at OpenAI centers on building and applying performance models to enhance the efficiency of advanced AI systems. Based in San Francisco, this role contributes to the reliability and speed of OpenAI’s technologies. What you will do Develop and implement performance models for AI systems Collaborate with data scientists and engineers to refine performance metrics Support the efficiency and rigorous standards of OpenAI’s technologies

Apr 20, 2026
Apply
companyCrusoe logo
Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.

Mar 18, 2026
Apply
companyCandid Health logo
Full-time|$209K/yr - $283K/yr|On-site|San Francisco

About Candid HealthCandid Health is dedicated to transforming the healthcare landscape by addressing one of its most intricate and expensive challenges: the billing and revenue cycle management (RCM) process. The healthcare sector has been hampered by sluggish, inefficient workflows that squander precious resources, ultimately detracting from providers' ability to focus on patient care. Our innovative revenue cycle automation platform is set to revolutionize this domain with a smart, data-driven methodology that streamlines billing, enhances claims processing, and eradicates administrative inefficiencies.The OpportunityAs a Staff Engineer at Candid Health, you will have the opportunity to tackle our most challenging infrastructure issues, developing the essential frameworks needed for rapid scalability and to meet surging customer demand.In this position, you will take significant ownership of resolving technical bottlenecks related to scalability, performance, reliability, and observability across distributed systems. You will be empowered to make strategic decisions regarding build versus buy, guiding technical discussions and architecting solutions that enhance the reliability and performance of our core platform.Moreover, you will participate in leadership discussions, allowing you to advocate for your team, influence project prioritization on the roadmap, and stay informed about developments across the engineering organization.We seek an individual who has successfully led high-stakes, business-critical technical projects to swiftly scale an organization's infrastructure, significantly impacting business success. The ideal candidate will have experience in a startup environment where the product achieved maturity, yet the technology required rapid scaling to meet customer needs, and you were integral in facilitating that growth.

Jul 30, 2025
Apply
company
Full-time|On-site|SF Bay Area

About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.

Oct 31, 2025
Apply
companyOrchard logo
Full-time|Remote|San Francisco

Join Orchard as a Senior Robotics Software Engineer specializing in Perception & Localization. In this pivotal role, you will leverage your expertise in robotics software development to enhance our cutting-edge solutions. Collaborate with a dynamic team to innovate and optimize perception algorithms and localization techniques, driving advancements in our robotic systems.

Mar 15, 2026
Apply
companyOpenAI logo
Full-time|Remote|San Francisco

OpenAI is seeking a Performance Modeling Engineer based in San Francisco. This role centers on building and improving models that enhance the performance and efficiency of AI systems. The work directly supports the technical backbone of OpenAI’s products. Key responsibilities Develop and refine models aimed at optimizing the performance of AI systems. Collaborate with engineers and data scientists to tackle technical challenges as they arise. Contribute to projects that improve the efficiency of large-scale AI infrastructure. Role overview This position offers the chance to work on foundational technology that underpins OpenAI’s products. The focus is on practical improvements and close teamwork with technical colleagues to advance the capabilities and efficiency of AI at scale.

Apr 20, 2026
Apply
companyOrchard logo
Full-time|On-site|San Francisco

Join Orchard as a Robotics Software Engineer specializing in Perception and Localization. In this dynamic role, you will be at the forefront of developing cutting-edge robotics software solutions that enhance our autonomous systems. Collaborate with a passionate team of engineers and researchers to push the boundaries of technology and innovation.

Mar 13, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Join Our TeamAt OpenAI, our mission is to harness artificial general intelligence in a way that benefits everyone globally. A significant portion of our user base engages with our products in various languages, necessitating a seamless experience across diverse languages, regions, and cultures.The Internationalization team is dedicated to creating the foundational infrastructure that allows OpenAI products to be launched globally by default. We focus on developing systems that facilitate localization, enable international product launches, and ensure high-quality user experiences worldwide.Your RoleAs a Senior Software Engineer on the Internationalization team, you will play a vital role in constructing the systems that enable localization and international product launches for OpenAI. You’ll work on platforms that manage product content, oversee translation workflows, and support localization infrastructure across all OpenAI products.This position lies at the intersection of AI systems, developer platforms, and product infrastructure.Key ResponsibilitiesDevelop and enhance OpenAI’s localization, content, and experimentation platform utilized across product teams, including open-source components:Create AI-driven translation pipelines integrated with human-in-the-loop review processes.Design robust systems that consistently deliver localized content for web and mobile applications.Build tools that empower linguists and localization teams to review and refine translations.Develop developer tools that streamline localization and internationalization workflows.Create and maintain internationalization libraries for use across OpenAI products:Design systems capable of accurately processing numbers, currencies, dates, and pluralization for various locales.Enhance support for multilingual interfaces, including right-to-left languages.Collaborate with product teams to ensure new features are internationally ready.Who You AreYou possess strong software engineering experience in building backend or full-stack systems.You are familiar with Java, React, MySQL, and cloud infrastructure patterns such as object storage, containerized services, and Kubernetes.

Mar 18, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Role Overview OpenAI is hiring a ChatGPT Performance Engineer in San Francisco. This role focuses on improving the performance and efficiency of ChatGPT’s advanced AI models. The position works closely with cross-functional teams to identify and implement solutions that make ChatGPT faster and more reliable for users around the world. What You Will Do Optimize the speed, reliability, and scalability of ChatGPT’s platforms. Collaborate with engineers and other teams to solve technical challenges. Develop and refine systems to support a seamless user experience globally. Impact This work directly shapes the future of AI at OpenAI, helping deliver a dependable and efficient ChatGPT experience to millions of users.

Apr 15, 2026
Apply
companyWaabi Inc. logo
Full-time|On-site|San Francisco, CA

Join Waabi as a Senior / Staff Software Engineer specializing in Localization, where you'll play a pivotal role in enhancing our products for diverse global markets. You will be responsible for designing and implementing innovative software solutions that facilitate language and cultural adaptations across our platforms.

Apr 4, 2026
Apply
companyGenmo logo
Full-time|On-site|San Francisco HQ

At Genmo, we are at the forefront of advancing artificial intelligence through innovative research in video generation. Our mission is to construct open, cutting-edge models that will ultimately contribute to the realization of Artificial General Intelligence (AGI). As part of our dynamic team, you will play a pivotal role in redefining the future of AI and expanding the horizons of video creation.We are looking for a skilled GPU Performance Engineer who can extract maximum performance from our H100 infrastructure and fine-tune our model serving stack to achieve unparalleled efficiency. If you are passionate about optimizing performance, particularly at the microsecond level, and thrive on pushing hardware to its limits, this is the perfect opportunity for you.Key ResponsibilitiesUtilize advanced profiling tools such as Nsight Systems and nvprof to analyze and enhance GPU workloads.Develop high-performance CUDA and Triton kernels to optimize essential model functions.Reduce cold start latency from seconds to mere milliseconds in our serving infrastructure.Optimize memory access patterns, implement kernel fusion, and maximize GPU utilization.Collaborate closely with machine learning engineers to optimize model implementations.Diagnose and resolve performance issues throughout the application and hardware stack.Implement custom memory pooling and allocation strategies to enhance performance.Promote performance optimization techniques and foster a culture of excellence across teams.

Jul 17, 2025
Apply
companyLangChain logo
Full-time|On-site|San Francisco, CA

About Us:At LangChain, we are on a mission to make intelligent agents commonplace. Our platform serves as the backbone for agent engineering in real-world applications, enabling developers to transform their prototypes into production-ready AI agents that organizations can depend on. Initially recognized for our widely adopted open-source tools, we have evolved to also provide a comprehensive platform for building, evaluating, deploying, and managing agents at scale.Today, our solutions including LangChain, LangGraph, LangSmith, and Agent Builder are utilized by teams delivering genuine AI products across both startups and large enterprises. Millions of developers rely on LangChain to empower AI teams at prominent companies such as Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.With $125 million raised in Series B funding from leading investors including IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are in an exciting phase of product development, experiencing rapid growth where every team member can significantly influence our projects and collaborative work culture. At LangChain, your contributions can reshape how technology manifests in the real world.About the Role:We are seeking a dedicated core maintainer to join the LangChain team. This individual will play a crucial role in enhancing and maintaining the LangChain JavaScript package. Preference will be given to candidates who can work in person in San Francisco.Enhancing the core abstractions and runtime of the langchain and langgraph packagesImproving and expanding documentationResponding to user inquiries and resolving issues effectivelyUtilizing langchain to develop example applicationsSuccess Factors:Minimum of 3 years of experience in software engineering or applied machine learningStrong background in Software EngineeringExceptional written and verbal communication skills, capable of conveying technical concepts to both technical and non-technical audiencesAbility to thrive in a fast-paced environment and view unstructured settings as opportunities to define impactful workOwnership mindset; proficient at managing tasks independently and effectively

Feb 10, 2025
Apply
companyOpenAI logo
Full-time|Hybrid|San Francisco

About Our TeamThe Training Runtime team is at the forefront of developing a cutting-edge distributed machine learning training runtime, enabling everything from pioneering research to large-scale model deployments. Our mission is to empower researchers while facilitating growth into frontier-scale operations. We are crafting a cohesive, modular runtime that adapts to researchers’ evolving needs as they progress along the scaling curve.Our focus is anchored in three key areas: optimizing high-performance, asynchronous data movement that is aware of tensor and optimizer states; building robust, fault-tolerant training frameworks that incorporate comprehensive state management, resilient checkpointing, deterministic orchestration, and advanced observability; and managing distributed processes for enduring, job-specific, and user-defined workflows.We aim to seamlessly integrate proven large-scale capabilities into a developer-friendly runtime, enabling teams to iterate rapidly and operate reliably across various scales. Our success is gauged by both the enhancement of training throughput (the speed of model training) and researcher throughput (the pace at which ideas transform into experiments and products).About the RoleAs a Training Performance Engineer, you will be instrumental in driving efficiency enhancements throughout our distributed training architecture. Your responsibilities will include analyzing extensive training runs, pinpointing utilization gaps, and engineering optimizations that maximize throughput and system uptime. This position merges a profound understanding of systems with practical performance engineering—analyzing GPU kernel performance, collective communication throughput, and investigating I/O bottlenecks, while also implementing model sharding techniques for large-scale training.Your efforts will ensure our clusters operate at peak performance, enabling OpenAI to develop larger and more sophisticated models within existing compute budgets.This position is located in San Francisco, CA, utilizing a hybrid work model with three days in the office each week, and we offer relocation assistance for new hires.Key Responsibilities:Analyze end-to-end training runs to detect performance bottlenecks across computation, communication, and storage.Enhance GPU utilization and throughput for large-scale distributed model training.Collaborate with runtime and systems engineers to boost kernel efficiency, scheduling, and collective communication performance.Implement model graph transformations to enhance overall throughput.Develop tools for monitoring and visualizing metrics such as MFU, throughput, and uptime across clusters.

Oct 16, 2025
Apply
companyLanceDB logo
Full-time|On-site|HQ

About LanceDBLanceDB is an innovative, developer-centric, open-source database designed for multimodal AI applications. We provide robust solutions ranging from hyper-scalable vector search capabilities to advanced retrieval for Retrieval-Augmented Generation (RAG). LanceDB is your ideal partner for creating AI applications, enabling seamless interaction with large-scale AI datasets and powering some of the most cutting-edge applications across various industries.About the RoleWe are seeking a Senior Open Source Engineer to enhance the presence of LanceDB within the extensive data infrastructure ecosystem. You will engage in projects that sit at the convergence of high-performance computing, big data, and open-source systems. Your contributions will drive integrations, optimize distributed operations, and support initiatives within the Apache and AI communities.Key ResponsibilitiesLead open-source community initiatives to integrate the Lance format with systems such as Spark, Hive Metastore, Presto, Trino, and Ray.Design and sustain efficient distributed operations for Lance datasets.Develop optimized indices to facilitate predicate pushdown and enhance query performance in Spark, Ray, or Trino.Engage in the development of table formats, data encodings, and various components of the Lance format using Rust.Manage and enhance internal data processing infrastructure.Advocate for the Lance format in open-source forums and at major Big Data conferences.RequirementsOver 10 years of experience in developing high-performance databases, big data systems, or large-scale data services.In-depth knowledge of the internal workings of open-source Big Data or AI training systems such as Hadoop, Spark, Flink, Ray, Iceberg, Delta Lake, Hudi, ClickHouse, Trino, Presto, PyTorch, or JAX.Extensive experience with high-performance computing using Java or Scala.Familiarity with Rust is preferred, or a strong willingness to learn.Demonstrated ability to work efficiently, independently, and collaboratively within a high-caliber team environment.Preferred QualificationsActive contributor, committer, or PMC member in Apache or other significant open-source projects.Experience with Java, Rust, C++, Apache Arrow, DataFusion, Parquet, Iceberg, or Delta Lake is a plus.

Oct 25, 2025
Apply
companyLangChain logo
Full-time|$160K/yr - $225K/yr|On-site|San Francisco, CA

About Us:At LangChain, we are on a mission to revolutionize the world of intelligent agents. Our innovative platform empowers developers to transition from initial prototypes to production-ready AI agents that can be trusted by their teams. Originating from widely embraced open-source tools, we have since expanded to offer a comprehensive suite for building, assessing, deploying, and managing AI agents at scale.Today, our tools—LangChain, LangGraph, LangSmith, and Agent Builder—are utilized by teams delivering genuine AI solutions across both startups and major enterprises. Millions of developers rely on LangChain to enhance AI capabilities for companies such as Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.With a successful $125M Series B funding from esteemed investors including IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are in an exciting phase of product development and growth. Every team member has a significant opportunity to influence the technology we create and how we collaborate. At LangChain, your contributions will directly shape the future of AI.About the Role:On-site 5 days/week in San Francisco, CA or Boston, MA (San Francisco preferred)We are seeking a dedicated core maintainer to join our LangChain team. This individual will be instrumental in enhancing and maintaining the LangChain Python package.Enhancing the core abstractions and runtime of the langchain and langgraph packagesImproving our documentationProviding answers to user inquiries and resolving issuesUtilizing langchain to develop example applicationsSuccess Factors for This Role:5+ years of experience in software engineering or applied machine learningExcellent written and verbal communication skills, capable of conveying technical concepts clearly to both technical and non-technical stakeholdersAbility to thrive in a fast-paced environment, viewing unstructured situations as opportunities to identify impactful work and define the company’s future successOwnership mentality, demonstrating a proactive approach to work

Feb 10, 2025
Apply
companyNash logo
Full-time|On-site|San Francisco

Senior Infrastructure & Performance EngineerAs a Senior Infrastructure & Performance Engineer, you will take charge of enhancing the performance, reliability, and scalability of Nash's foundational infrastructure. Collaborating closely with the Engineering Leadership and both platform and product engineering teams, you will design and manage low-latency, mission-critical systems that facilitate real-time logistics for some of the world's largest retailers.This is a key senior role focused on elastic capacity, high availability, cloud-native architectures, Postgres performance, and enterprise-grade CI/CD for multi-region deployments. You will define the technical roadmap, establish best practices, and implement systems that support the essential workflows of major retailers.Key ResponsibilitiesOversee infrastructure performance and reliability for Nash's production environments, ensuring low latency, high throughput, and consistent performance under load.Design, develop, and enhance AWS infrastructure, utilizing managed services with a focus on ECS/Fargate.Lead initiatives in Postgres performance engineering, including query optimization, indexing strategies, connection management, replication, cluster design, and failover.Architect and maintain multi-region, highly available systems with robust resiliency and guaranteed disaster recovery.Design and refine enterprise-grade CI/CD pipelines that enable safe, repeatable, and rapid deployments across environments and regions.Establish observability standards (metrics, logs, tracing, SLOs) to proactively identify and resolve performance bottlenecks.Collaborate with application engineers to inform system design choices that influence scalability, latency, and reliability.Lead incident response efforts and postmortems, emphasizing root cause analysis, systemic improvements, and long-term resilience.Set best practices for infrastructure and performance while mentoring engineers throughout the organization.Qualifications6+ years of experience in building and managing high-scale production infrastructure for mission-critical systems.Proficiency with AWS, particularly with ECS/Fargate, and experience with cloud-native architecture.Strong background in Postgres performance tuning and optimization.Deep understanding of CI/CD practices and experience in multi-region deployments.Exceptional analytical and problem-solving skills, with a proactive approach to performance management.

Jan 6, 2026
Apply
companyClickUp logo
Full-time|On-site|United States of America

At ClickUp, we're not just developing software; we're shaping the future of work! In an era dominated by work sprawl, we identified a more efficient way. This led us to create the first truly integrated AI workspace, consolidating tasks, documents, chat, calendar, and enterprise search, all enhanced by context-driven AI. Our mission is to empower millions of teams to escape silos, reclaim their time, and reach unprecedented levels of productivity. At ClickUp, you'll have the chance to learn, innovate, and leverage AI in transformative ways that will not only influence our product but also the broader landscape of work itself. Join a daring, pioneering team that's challenging the limits of what's possible! We are on the lookout for a technical leader in SaaS client performance who is passionate about enhancing the customer experience through top-tier performance solutions. As a Senior Performance Engineer, you will spearhead comprehensive strategies to optimize application speed, memory utilization, and reliability across our entire platform. You will be empowered to analyze, diagnose, and address performance bottlenecks wherever they arise—be it front-end, back-end, or infrastructure—ensuring ClickUp remains the fastest and most reliable productivity platform available.The ideal candidate is a hands-on authority in browser and NodeJS performance, with a thorough understanding of how code influences rendering, memory management, and overall user experience. You excel in solving intricate challenges, collaborating across teams, and establishing new benchmarks for performance excellence. If you're driven to make a significant impact for millions of users, this is your chance to lead at scale.Your Responsibilities:Conduct root cause analysis on client performance issues and perform post-mortems.Profile application code to identify inefficient algorithms, memory leaks, and other issues; propose and implement effective solutions.Establish performance monitoring, alerting, and dashboards to proactively detect and resolve client performance challenges.Examine client traffic patterns, load testing outcomes, and other metrics to set benchmarks and drive enhancements.Champion performance best practices and set performance standards across the engineering organization.Identify infrastructure upgrades (caching, CDNs, database optimization) to elevate the client experience.Collaborate with development teams to incorporate performance as a core requirement in the development of new features.

Dec 22, 2025
Apply
companyDatabricks logo
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California

P-97 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world. We achieve this by creating and managing a leading data and AI infrastructure platform that enables our clients to leverage deep data insights for business enhancement. Our commitment to pushing the limits of data and AI technology is matched by our focus on resilience, security, and scalability, which are essential for our customers' success on our platform. Databricks operates one of the largest-scale software platforms, comprising millions of virtual machines that generate terabytes of logs and process exabytes of data daily. Given our scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must adeptly protect our customers from these issues. As a Senior Performance Engineer, you will collaborate with various teams throughout the organization to assess product and feature performance, pinpoint performance bottlenecks, and partner with engineers to address performance and scalability challenges. This includes setting performance goals for different software releases, guiding teams in developing performance benchmarks, conducting competitive benchmark analyses for various Databricks products, and performing in-depth analyses to identify and resolve performance issues.

Jan 30, 2026

Sign in to browse more jobs

Create account — see all 5,289 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.