Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Strong background in software engineering or a related field. Experience with performance tuning and optimization techniques. Proficient in programming languages such as Python, Java, or C++. Familiarity with machine learning frameworks and tools. Excellent problem-solving skills and ability to work collaboratively in a team environment.
About the job
OpenAI is hiring a ChatGPT Performance Engineer in San Francisco. This role focuses on improving the performance and efficiency of ChatGPT’s advanced AI models. The position works closely with cross-functional teams to identify and implement solutions that make ChatGPT faster and more reliable for users around the world.
What You Will Do
Optimize the speed, reliability, and scalability of ChatGPT’s platforms.
Collaborate with engineers and other teams to solve technical challenges.
Develop and refine systems to support a seamless user experience globally.
Impact
This work directly shapes the future of AI at OpenAI, helping deliver a dependable and efficient ChatGPT experience to millions of users.
About OpenAI
OpenAI is an innovative research organization dedicated to advancing artificial intelligence in a safe and beneficial manner. Our team comprises exceptional talent from various domains, committed to pushing the boundaries of what is possible with AI technology. We foster a culture of creativity, collaboration, and continuous learning, making OpenAI an exciting place to build a career.
Role Overview OpenAI is hiring a ChatGPT Performance Engineer in San Francisco. This role focuses on improving the performance and efficiency of ChatGPT’s advanced AI models. The position works closely with cross-functional teams to identify and implement solutions that make ChatGPT faster and more reliable for users around the world. What You Will Do Optimize the speed, reliability, and scalability of ChatGPT’s platforms. Collaborate with engineers and other teams to solve technical challenges. Develop and refine systems to support a seamless user experience globally. Impact This work directly shapes the future of AI at OpenAI, helping deliver a dependable and efficient ChatGPT experience to millions of users.
Join Our Innovative TeamAt ChatGPT Engineering, we are dedicated to crafting the user interfaces that millions rely on daily. Our mission is to enhance AI capabilities through seamless, polished, and trustworthy experiences for our users. As a Frontend Engineer, you will play a pivotal role, collaborating with design, product, research, and backend teams to deliver intuitive and reliable features that scale globally.Our diverse teams focus on areas such as Growth, Ecosystems, Personalization, Search & Knowledge Experiences, and Platform UI. Together, we build foundational UI and product experiences that empower users to discover, trust, and gain value from ChatGPT.Your RoleWe are seeking talented Frontend Engineers to create cutting-edge product experiences for ChatGPT. You will focus on impactful user flows, UI architecture, and performance, delivering features that transform advanced AI into a simple, responsive, and delightful experience for users.Key ResponsibilitiesDesign and implement high-quality product experiences across web platforms.Collaborate with design and product teams to turn innovative ideas into stunning, accessible UIs.Lead frontend architecture decisions, including component systems, state management, routing, and client-side performance optimizations.Enhance reliability and quality through effective testing, monitoring, and tooling.Work closely with backend and data teams to integrate APIs and deliver comprehensive end-to-end features.Contribute to the long-term technical vision and participate in design and architecture reviews.Who You ArePossess strong frontend development skills using modern web frameworks (e.g., React, TypeScript) and have experience building scalable production UIs.Exhibit a passion for performance, accessibility, and craftsmanship, paying attention to detail while maintaining speed.Enjoy collaborating across the tech stack and working closely with design and product teams.Thrive in fast-paced environments and embrace ambiguity, all while upholding high quality standards.LocationWe welcome applicants from San Francisco, New York, and Seattle.About OpenAIOpenAI is at the forefront of AI research and deployment, committed to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the limits of AI capabilities, striving for innovations that improve lives globally.
Join Our Innovative TeamThe ChatGPT team at OpenAI is at the forefront of our mission to innovate across core domains, including Growth, Personalization, and Search Infrastructure. We are expanding our talented teams to create cutting-edge experiences, tools, and systems that enhance ChatGPT's functionality across all user platforms.Your Role as an Android EngineerWe are seeking a skilled Android Software Engineer to take the lead in developing exciting new features for the ChatGPT Android application.Key Responsibilities:Craft and implement innovative features and functionalities for the ChatGPT Android app.Set and uphold rigorous engineering standards focusing on performance, reliability, and code quality.Collaborate with cross-functional teams to design user-centric experiences.Influence technical decisions and establish a robust long-term architecture for mobile platforms.Provide mentorship and guidance to peers through code reviews and shared ownership.Ideal Candidate Attributes:Proficient in Android development using Kotlin/Java, modern architectural patterns, and Jetpack.A passion for enhancing mobile UX and optimizing system performance.Thrives in dynamic, collaborative environments.We are open to candidates in San Francisco, New York, and Seattle.About OpenAIOpenAI is a pioneering AI research and deployment organization dedicated to ensuring that general-purpose artificial intelligence serves the interests of humanity. We actively push the boundaries of AI capabilities while prioritizing safety and human-centric design in all our products. Our commitment to inclusivity drives our mission to incorporate diverse perspectives, experiences, and voices, reflecting the full spectrum of humanity.We are proud to be an equal opportunity employer, embracing diversity and ensuring that all individuals are treated fairly, regardless of race, religion, gender, or any other protected characteristic.
About the TeamAt ChatGPT, we are at the forefront of innovation, continuously enhancing our system with new capabilities and adapting to ever-evolving user needs. To sustain our rapid pace of development, we require a robust infrastructure capable of managing real-world production challenges, such as high concurrency and unpredictable traffic patterns.The mission of the ChatGPT Infrastructure team is to design and maintain the foundational platforms that facilitate swift iterations without compromising on performance or reliability. We create the shared systems, data pathways, rollout procedures, and reliability measures that enable teams to deploy changes to ChatGPT efficiently and at scale.Our focus is on high-impact infrastructure: we develop fundamental systems and streamlined processes that leverage hard-earned operational insights, ensuring that engineers do not have to repeatedly navigate similar challenges and pitfalls as they innovate.About the RoleWe are seeking experienced Senior and Staff Software Engineers to architect and construct the underlying infrastructure that supports ChatGPT, amplifying the productivity of teams working on user experience.This role transcends mere maintenance; it is about building platforms: you will define interfaces, develop essential abstractions, and create tools that promote safe and rapid iterations. Your contributions will lead to reduced friction, fewer regressions, enhanced performance, and systems that scale seamlessly as our product grows.Where You Can Make a DifferenceAs part of our team, you may engage with one or more of the following areas:Platform Foundations & Frameworks: Craft core libraries, service frameworks, and shared components that standardize system development and integration.Scalability & Performance Primitives: Develop patterns and infrastructure aimed at minimizing latency, boosting throughput, and maintaining cost efficiency as demand increases.Reliability Guardrails: Implement design mechanisms to prevent outages, including rate limiting, load shedding, and safe fallbacks.Developer Productivity via Golden Paths: Establish streamlined workflows that make common processes fast, safe, and user-friendly.Observability & Debugging Systems: Create instrumentation and metrics models to enhance debugging capabilities.
About Our Dynamic TeamJoin the Growth team at OpenAI, where we are dedicated to amplifying the reach and impact of our innovative AI products, including ChatGPT. Our mission is to optimize user acquisition, subscriptions, and engagement, ensuring millions of users experience lasting value from our offerings.In this role, you will collaborate cross-functionally with Product, Data Science, Design, and Marketing teams to develop growth-oriented features, enhance conversion funnels, personalize user experiences, and drive significant business outcomes. Through data analysis, experimentation, and AI-powered insights, we strive to create engaging and scalable user journeys.Key Responsibilities:Lead and grow a high-impact engineering team focused on initiatives that drive growth.Boost user growth and subscriptions by enhancing sign-up processes, onboarding experiences, and subscription models.Enhance user retention and engagement through the development of features that improve user experience and personalization.Conduct data-driven experiments (A/B testing, pricing strategies, referral programs) to refine key growth metrics.Work collaboratively with Product, Marketing, Design, and Data Science teams to align on growth strategies.Utilize AI insights to craft personalized user experiences that enhance conversion rates and lifetime value.Ensure engineering solutions are scalable and performant to support growth initiatives.
About Our TeamAt OpenAI, we are developing a robust Swift platform, from architecture to continuous integration, that empowers hundreds of engineers and their agents to innovate within ChatGPT, Atlas, Sora, and various other rapidly advancing applications. Our mission is to enable teams to expedite development while enhancing both performance and reliability.Role OverviewWe are seeking experienced Staff+ iOS engineers with diverse backgrounds in the iOS platform. We value expertise in performance optimization, build/CI systems, or enhancing AI developer productivity. Whether your specialty lies in UI frameworks or automated testing, your contributions will lead to reduced friction, minimized complexity, fewer regressions, and improved overall performance.If you are passionate about scaling Swift development alongside advanced AI tools, we invite you to apply for this exciting role.This position is open in San Francisco, New York, and Seattle.Key ResponsibilitiesDevelop and enhance foundational Swift frameworks utilized across multiple applications and platforms (caching, state management, observability, navigation, and component systems).Boost application performance (startup speed, responsiveness, memory efficiency, battery life) through profiling, increased visibility, regression prevention, and architectural patterns.Enhance reliability by systematically reducing crashes, improving error handling, and driving improvements in the release process.Collaborate closely with product engineering teams to identify pain points and emerging infrastructure requirements.Create internal tools and automation (Bazel, CI, testing frameworks, Codex agent skills) to accelerate engineering velocity.Simplify complex real-world constraints into clean abstractions through intuitive APIs, enforceable contracts, and safe defaults.Ideal Candidate ProfileStrong fundamentals in iOS engineering (Swift, concurrency, networking, Xcode ecosystem, familiarity with UIKit/SwiftUI) and an understanding of contemporary features and best practices.A penchant for tackling platform challenges: frameworks, architecture, performance, and tools that empower and accelerate other engineers.Demonstrated ownership: proactively identifying risks and opportunities, driving projects to completion, and adapting quickly.A strong focus on measurement: able to instrument, define metrics, conduct experiments, and iterate based on data-driven insights.
About Our TeamThe Technical Success team at OpenAI is pivotal in ensuring the seamless and secure deployment of ChatGPT and OpenAI API applications for both developers and enterprises. Acting as trusted advisors, we empower our customers to maximize the value derived from our innovative models and products.Within this team, the AI Deployment Engineering group is dedicated to supporting high-impact and strategic partners. We engage collaboratively to overcome technical challenges, provide in-depth expertise, and co-create groundbreaking ecosystem experiences that showcase the enhanced benefits of partnering with OpenAI.About the RoleWe are seeking an AI Deployment Engineer to play a crucial role in shaping the future of OpenAI’s partner ecosystem within the ChatGPT domain across various applications and commerce. As the primary technical resource for our strategic partners, you will lead solution design efforts and co-develop innovative experiences that fully leverage our platform's capabilities.You will provide expert technical guidance while collaborating with our Partnerships, Product, Engineering, and Go-To-Market teams, reporting directly to the Technical Success department.This position is based in our San Francisco or New York City office, utilizing a hybrid work model of three days in-office each week, with relocation assistance available for new hires.Key Responsibilities:Deliver outstanding partner experiences by providing technical expertise, scoping use cases, and aiding in the development of applications within ChatGPT and commerce checkout processes alongside technical stakeholders and strategic partners.Collaborate with our Partnerships team to design and support ecosystem integrations, ensuring technical feasibility and impactful launches.Advise partners on best practices for utilizing the OpenAI API and ChatGPT to develop secure, scalable, and unique experiences.Act as the initial point of contact for inquiries related to design, security, compliance, and architecture, escalating complex requirements to internal experts when necessary.Develop and sustain documentation, implementation guides, and FAQs addressing common partner requirements and technical hurdles.Collect partner feedback to represent the ecosystem's voice within internal teams, influencing product roadmaps and future partner initiatives.
Join Our Innovative TeamThe ChatGPT Ecosystem team plays a pivotal role in our mission to transform ChatGPT into a comprehensive super-assistant. Our ambition is to develop a platform that empowers both developers and end-users to enhance ChatGPT's capabilities, making it significantly more effective for a variety of real-world applications.We are constructing the groundwork for a vibrant ecosystem of applications, integrations, and tools that will expand the functionalities of ChatGPT. This team operates at the intersection of product engineering, platform design, and developer experience, collaborating closely with product management, research, security, and trust & safety teams to enable a global community of developers to innovate on top of ChatGPT.Your Role as a Full Stack EngineerIn the role of Full Stack Engineer on the ChatGPT Ecosystem team, you will contribute to the design and development of the platform that underpins ChatGPT Apps and developer integrations.Your responsibilities will span the entire stack—from user-friendly developer interfaces to robust backend systems—shipping features that simplify the process for developers to create, deploy, and scale applications within ChatGPT. You will collaborate closely with product managers, researchers, and other cross-functional partners to translate evolving model capabilities into effective, developer-friendly tools.We seek engineers who are passionate about building platforms, prioritize developer experience, and are eager to shape the foundational elements of a new application ecosystem.This position is based in San Francisco or New York City, operating under a hybrid work model with three days in the office weekly. Relocation assistance is available for new hires.Key ResponsibilitiesDevelop comprehensive platform features that support the ChatGPT Apps SDK and the broader developer ecosystem.Design and implement APIs, services, and interfaces enabling developers to create and integrate applications within ChatGPT.Enhance developer workflows, tooling, and documentation to ensure building on ChatGPT is intuitive and scalable.Collaborate with Product, Security, Trust & Safety, and Research teams to deliver high-quality, safe, and reliable platform capabilities.Iterate rapidly based on feedback from developers to continually enhance the platform and ecosystem experience.Contribute to defining the technical direction and architecture of the ChatGPT Ecosystem as it scales.Qualifications for SuccessA passion for building high-quality product and platform experiences.Strong experience in full-stack development with a focus on both frontend and backend technologies.Ability to collaborate effectively with cross-functional teams.A proactive approach to seeking feedback and iterating on product features.Commitment to enhancing developer experience and building intuitive tools.
About Our TeamThe ChatGPT team is at the forefront of innovation, blending research, engineering, product design, and user experience to extend OpenAI’s groundbreaking technology to users worldwide.As part of the Growth Partnerships team, our mission is to broaden reach, discover new channels for user acquisition, and create impactful integrations that bring ChatGPT to users wherever they are. We collaborate closely with external partners and internal teams to craft product experiences, APIs, and growth strategies that promote adoption while ensuring trust and safety are paramount.Our work is interdisciplinary, combining product development, engineering excellence, and business acumen to turn partnerships into sustainable growth engines for ChatGPT.About the RoleWe are in search of a skilled Full Stack Engineer to join our ChatGPT Growth Partnerships team. This role is pivotal in establishing the technical infrastructure that supports partner-driven growth. You will be responsible for developing comprehensive product experiences that encompass frontend applications, backend services, APIs, experimentation, and data analytics, all aimed at facilitating seamless integrations, onboarding, user activation, and monetization through our partners.This high-impact position is perfect for engineers who excel in dynamic, fast-paced environments, can take projects from conception to deployment, make informed product and technical decisions, and deliver results swiftly while adhering to high engineering standards.Key ResponsibilitiesDesign and implement full-stack product experiences that support partner integrations, onboarding processes, activation pathways, and growth opportunities.Create and maintain backend services and APIs to ensure scalable and secure partner interactions.Collaborate with product managers, partnership leads, designers, data scientists, and researchers to translate strategic objectives into tangible products.Lead experimentation initiatives, including A/B testing and metrics development, to analyze factors that drive user adoption, retention, and value through partnerships.Identify key leverage points where small technical enhancements can create significant growth opportunities.Establish best practices for building scalable and partner-friendly systems.Foster a culture of ownership, transparency, inclusivity, and constructive dialogue within the engineering team.You Will Excel in This Role If YouHave a proven track record of delivering full-stack features that drive user acquisition, activation, or monetization.Thrive in a collaborative environment where innovative ideas are welcomed and encouraged.Possess a knack for problem-solving and a passion for creating impactful user experiences.
About the TeamAt OpenAI, our mission is to harness the power of general-purpose artificial intelligence to benefit humanity as a whole. The ChatGPT team is dedicated to integrating our cutting-edge technology into the lives of users worldwide. We cater to a broad spectrum of individuals who rely on ChatGPT daily, whether in personal or professional contexts.We are committed to learning from our deployments and ensuring the responsible and safe use of AI technologies. Safety takes precedence over unregulated expansion.As a Product Designer, your role will be pivotal in crafting intuitive and accessible technology to fulfill our mission. We are looking for an innovative designer to develop user-friendly, aesthetically pleasing products that challenge the norms of design. Joining us at this formative stage means you will play a significant role in shaping our product vision and design ethos.About the RoleIn this position, you will collaborate with a dynamic team of product designers and cross-functional partners to create exceptional product experiences that address current and future user needs. You will oversee the comprehensive design process for new features and enhancements across a variety of OpenAI's consumer and business products. Your contributions will help guide our evolution as we advance our technology, products, and design philosophies.This position is located at our San Francisco headquarters. We provide relocation assistance for new hires.Key ResponsibilitiesInfluence the overall design and product strategy for OpenAI offerings.Design and launch high-quality products and enhancements, from initial concepts to polished prototypes and visuals.Collaborate closely with engineering, product management, AI researchers, and fellow designers to shape both long-term strategies and immediate initiatives.Conduct user research to deepen our understanding of user needs and refine our offerings.Contribute to the development and enhancement of our design system.Help cultivate a strong design culture and expand our team.Ideal Candidate Profile4+ years of experience delivering large-scale software products.A portfolio that demonstrates exceptional UX, UI, and interaction design capabilities, with a commitment to quality and craftsmanship.A passion for solving complex interaction design challenges.Strong communication and collaboration skills suited for a team environment.
About Our TeamAt OpenAI, our mission is to ensure that artificial intelligence benefits all of humanity. The ChatGPT for Work initiative is a crucial part of this mission, as it empowers individuals to harness the full potential of AI in their daily tasks. By minimizing time spent on repetitive duties and coordination, we enable our users to focus on meaningful and impactful work. We are developing an AI-driven workspace where AI serves as a super-assistant for routine tasks and as a collaborative partner that users can delegate work to, review, edit, and approve with confidence.Our approach ensures that organizations can trust our solutions, as we ground our experiences in the appropriate company context and systems, providing safety and reliability.Role OverviewAs a Data Scientist for ChatGPT for Work, you will play a pivotal role in shaping our product strategy through data insights. Your responsibilities will include identifying the most pressing user problems, developing sharp hypotheses to enhance team and business outcomes, and influencing future developments by presenting compelling, evidence-based recommendations. You will be the Directly Responsible Individual (DRI) for the insight → strategy → experiment → decision loop, defining success metrics for teams, identifying critical adoption and retention barriers, and translating signals into actionable product direction.Collaboration will be key, as you will work closely with Product, Engineering, Research, and Finance teams to ensure our metrics are reliable, our experimentation is thorough, and our insights lead to tangible product improvements.This position is based in San Francisco, utilizing a hybrid work model that requires three days in the office each week. We also offer relocation assistance to new employees.Key Responsibilities:Develop and own the core KPI framework for ChatGPT for Work, covering onboarding, activation, engagement, retention, and expansion, along with quality and trust metrics.Create end-to-end funnels to analyze where individuals and teams excel or encounter challenges, from initial workspace setup to sustained usage and long-term value creation.Define and operationalize metrics related to “time-to-value” and collaboration loops, linking them to significant business outcomes.Design and assess experiments and rollouts to measure the impact of product modifications across key workflows.Collaborate with product and engineering teams to enhance data instrumentation, quality, and metric definitions, ensuring rapid and accurate decision-making.Translate complex analyses into clear and persuasive insights that will help shape our product strategy and roadmap.
About Our TeamAt OpenAI, our Marketing team plays a crucial role in advancing our mission to promote the responsible and widespread adoption of artificial intelligence. We focus on creating and implementing strategies that enhance awareness, engagement, and utilization of our innovative products and platform among key audiences. By employing a data-driven methodology, we gain insights into our customers' needs and challenges, ensuring their perspectives are integrated into product development and messaging. Our collaborative efforts with Product, Engineering, Research, Communications, and Design teams aim to deliver a seamless customer experience across various channels. Beyond merely showcasing product features, we strive to provide valuable insights and resources that empower our users to maximize the benefits of AI technologies.About the RoleAs the Product Marketing Manager for ChatGPT for Work, you will spearhead marketing initiatives that drive adoption and engagement for one of the world's leading AI products. We are seeking a strategic marketer adept at crafting compelling narratives through a deep understanding of ChatGPT’s capabilities and the diverse needs of our user base. Reporting directly to the Head of Product Marketing, this role presents a unique opportunity to influence the roadmap and market positioning for products such as ChatGPT. The ideal candidate will be based in San Francisco, CA, with a hybrid schedule of three days in the office or can work remotely from the West Coast, U.S., with potential travel requirements.Your Responsibilities Will Include:Formulating and executing go-to-market strategies for new features and enhancements within ChatGPT for Work.Collaborating with product and research teams to simplify complex technical concepts into clear, user-focused messaging.Utilizing market research and competitive analysis to refine product positioning and pinpoint growth opportunities.Designing and overseeing marketing campaigns, content, and resources that foster engagement, adoption, and revenue growth.Assessing product usage data and user feedback to enhance marketing strategies and elevate user experience.You Would Excel in This Role If You Have:Over 10 years of experience in consumer or product marketing, with a strong emphasis on technology.A solid understanding of AI technologies and the ability to effectively communicate their value to users.
Role overview The Performance Modeling Engineer II position at OpenAI centers on building and applying performance models to enhance the efficiency of advanced AI systems. Based in San Francisco, this role contributes to the reliability and speed of OpenAI’s technologies. What you will do Develop and implement performance models for AI systems Collaborate with data scientists and engineers to refine performance metrics Support the efficiency and rigorous standards of OpenAI’s technologies
Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.
About Our TeamThe Ecosystem team is pivotal in transforming ChatGPT into an exceptional super-assistant. We are constructing a robust platform that will empower millions of developers—and ultimately users—to enhance ChatGPT's functionalities through innovative tools, SDKs, and platform capabilities that foster a vibrant ecosystem of ChatGPT applications.Role OverviewWe are in search of a passionate Full-Stack Software Engineer who thrives at the intersection of front-end and back-end development, ready to deliver comprehensive features on the ChatGPT Apps platform and SDK. You will collaborate closely with Product, Security, Trust & Safety, Research, and fellow engineers to create exceptional developer experiences and broaden the capabilities of ChatGPT Apps.Key ResponsibilitiesDesign, develop, and deploy essential features within the ChatGPT Apps SDK and the larger platform.Work alongside Product Management, Security, Trust & Safety, Research, and Engineering to craft developer-centric experiences.Take ownership of features from conception through to delivery: guiding the design, technical strategy, implementation, launch, and iterative improvements.Develop scalable services and APIs while creating intuitive UI experiences that facilitate publishing, discovery, and enhanced app functionalities.Help establish a long-term technical vision—laying the groundwork for new product areas as they evolve.Contribute to the team's best practices regarding testing, architecture, and overall software development lifecycle (SDLC) quality.You Will Excel If YouAre a versatile full-stack engineer—confidently navigating between front-end and back-end to deliver holistic features (experience with React and Python is advantageous, but not mandatory).Have experience building developer tools or platforms (SDKs, integrations with tools like Slack, open-source libraries, or time spent at developer-focused companies).Possess a product-oriented mindset and relish collaborating with cross-functional teams to enhance user and developer experiences.Enjoy shipping features rapidly while maintaining high standards for quality, security, and performance.Are eager to create foundational platform capabilities that will scale alongside a fast-growing ecosystem.About OpenAIOpenAI is at the forefront of AI research and deployment, committed to ensuring that general-purpose artificial intelligence serves the greater good for humanity. We strive to push the boundaries of AI capabilities while prioritizing the safe and responsible deployment of these technologies in the real world.
About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.
OpenAI is seeking a Performance Modeling Engineer based in San Francisco. This role centers on building and improving models that enhance the performance and efficiency of AI systems. The work directly supports the technical backbone of OpenAI’s products. Key responsibilities Develop and refine models aimed at optimizing the performance of AI systems. Collaborate with engineers and data scientists to tackle technical challenges as they arise. Contribute to projects that improve the efficiency of large-scale AI infrastructure. Role overview This position offers the chance to work on foundational technology that underpins OpenAI’s products. The focus is on practical improvements and close teamwork with technical colleagues to advance the capabilities and efficiency of AI at scale.
We are seeking a talented Performance Engineer to join our dynamic team at usm2. This is an exciting opportunity for local professionals who are passionate about optimizing system performance and enhancing user experience. As a Performance Engineer, you will play a crucial role in analyzing performance metrics, identifying bottlenecks, and implementing solutions to ensure our applications run smoothly and efficiently.
At Genmo, we are at the forefront of advancing artificial intelligence through innovative research in video generation. Our mission is to construct open, cutting-edge models that will ultimately contribute to the realization of Artificial General Intelligence (AGI). As part of our dynamic team, you will play a pivotal role in redefining the future of AI and expanding the horizons of video creation.We are looking for a skilled GPU Performance Engineer who can extract maximum performance from our H100 infrastructure and fine-tune our model serving stack to achieve unparalleled efficiency. If you are passionate about optimizing performance, particularly at the microsecond level, and thrive on pushing hardware to its limits, this is the perfect opportunity for you.Key ResponsibilitiesUtilize advanced profiling tools such as Nsight Systems and nvprof to analyze and enhance GPU workloads.Develop high-performance CUDA and Triton kernels to optimize essential model functions.Reduce cold start latency from seconds to mere milliseconds in our serving infrastructure.Optimize memory access patterns, implement kernel fusion, and maximize GPU utilization.Collaborate closely with machine learning engineers to optimize model implementations.Diagnose and resolve performance issues throughout the application and hardware stack.Implement custom memory pooling and allocation strategies to enhance performance.Promote performance optimization techniques and foster a culture of excellence across teams.
About Our TeamThe Training Runtime team is at the forefront of developing a cutting-edge distributed machine learning training runtime, enabling everything from pioneering research to large-scale model deployments. Our mission is to empower researchers while facilitating growth into frontier-scale operations. We are crafting a cohesive, modular runtime that adapts to researchers’ evolving needs as they progress along the scaling curve.Our focus is anchored in three key areas: optimizing high-performance, asynchronous data movement that is aware of tensor and optimizer states; building robust, fault-tolerant training frameworks that incorporate comprehensive state management, resilient checkpointing, deterministic orchestration, and advanced observability; and managing distributed processes for enduring, job-specific, and user-defined workflows.We aim to seamlessly integrate proven large-scale capabilities into a developer-friendly runtime, enabling teams to iterate rapidly and operate reliably across various scales. Our success is gauged by both the enhancement of training throughput (the speed of model training) and researcher throughput (the pace at which ideas transform into experiments and products).About the RoleAs a Training Performance Engineer, you will be instrumental in driving efficiency enhancements throughout our distributed training architecture. Your responsibilities will include analyzing extensive training runs, pinpointing utilization gaps, and engineering optimizations that maximize throughput and system uptime. This position merges a profound understanding of systems with practical performance engineering—analyzing GPU kernel performance, collective communication throughput, and investigating I/O bottlenecks, while also implementing model sharding techniques for large-scale training.Your efforts will ensure our clusters operate at peak performance, enabling OpenAI to develop larger and more sophisticated models within existing compute budgets.This position is located in San Francisco, CA, utilizing a hybrid work model with three days in the office each week, and we offer relocation assistance for new hires.Key Responsibilities:Analyze end-to-end training runs to detect performance bottlenecks across computation, communication, and storage.Enhance GPU utilization and throughput for large-scale distributed model training.Collaborate with runtime and systems engineers to boost kernel efficiency, scheduling, and collective communication performance.Implement model graph transformations to enhance overall throughput.Develop tools for monitoring and visualizing metrics such as MFU, throughput, and uptime across clusters.
Oct 16, 2025
Sign in to browse more jobs
Create account — see all 5,224 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.