About the job
About the Team
The Preparedness team plays a crucial role within the Safety Systems organization at OpenAI, adhering to our Preparedness Framework.
While frontier AI models promise to bring significant benefits to humanity, they also introduce substantial risks. The Preparedness team is dedicated to ensuring that the development of advanced AI models fosters positive outcomes. Our mission includes identifying, monitoring, and preparing for catastrophic risks associated with these technologies.
Key Mission Objectives:
- Monitor and predict the evolving capabilities of frontier AI systems to identify misuse risks that could significantly impact society.
- Establish concrete procedures, infrastructure, and partnerships to mitigate these risks and ensure the safe development of powerful AI systems.
This fast-paced and impactful role connects capability assessment, evaluations, internal red teaming, and mitigations for frontier models, facilitating coordination on AGI preparedness.
About the Role
As a Threat Modeler, you will spearhead OpenAI's comprehensive approach to identifying, modeling, and forecasting risks from frontier AI systems. Your work will ensure that our evaluation frameworks, safeguards, and classifications are robust, comprehensive, and future-focused. You will help articulate the rationale behind our most stringent risk-prevention strategies, influencing prioritization and mitigation across various domains. This position acts as a central hub, integrating technical, governance, and policy considerations regarding our approach to frontier AI risks.
Key Responsibilities
- Develop and maintain comprehensive threat models across various misuse areas (biological, cyber, attack planning, etc.).
- Create plausible threat models addressing loss of control, self-improvement, and other potential risks associated with alignment from frontier AI systems.
- Forecast risks by merging technical foresight, adversarial simulation, and current trends.
- Collaborate closely with technical partners on capability evaluations and risk assessments.

