About the job
About the Team
Join OpenAI’s innovative Hardware organization, where we are dedicated to developing state-of-the-art silicon and system-level solutions tailored for the complex demands of advanced AI workloads. Our team is at the forefront of crafting the next generation of AI-native silicon, collaborating closely with software and research partners to create hardware that seamlessly integrates with AI models. We not only deliver production-grade silicon for OpenAI’s supercomputing infrastructure but also engineer custom design tools and methodologies that foster innovation and optimize hardware specifically for AI applications.
About the Role
We are seeking a talented RTL Engineer to take charge of designing and implementing critical compute, memory, and interconnect components for our bespoke AI accelerator. You will engage closely with architecture, verification, physical design, and machine learning engineers to translate AI workloads into highly efficient hardware structures. This is an exciting hands-on design role that offers significant ownership over definition, modeling, and implementation aspects.
This position is based in San Francisco, CA, and follows a hybrid work model requiring 3 days in the office weekly. We provide relocation assistance for new employees.
Responsibilities
Develop clean, production-quality microarchitecture and RTL for key accelerator subsystems.
Contribute to architectural studies, including performance modeling and feasibility assessments.
Collaborate with software, simulator, and compiler teams to ensure effective hardware/software co-design and optimal workload integration.
Work with design verification (DV) and physical design (PD) teams to guarantee functional correctness, timing closure, area/power targets, and smooth integration.
Construct and evaluate performance and functional models to validate design intentions.
Engage in design reviews, documentation, and support throughout the entire silicon lifecycle.
You Might Excel In This Role If You Have:
Advanced research or industry experience in computer architecture and AI/ML hardware-software co-design, encompassing workload analysis, dataflow mapping, or optimization of accelerator algorithms.
Proficiency in writing production-quality RTL in Verilog/SystemVerilog, with a proven track record of successfully delivering complex blocks to tape-out.
Experience in developing hardware design models or architectural simulators, preferably for AI/ML or high-performance computing systems.
Familiarity with industry-standard design tools (lint, CDC/RDC, synthesis, STA) and methodologies.
Strong analytical skills and a collaborative mindset.

