About the job
About the Role
Perplexity is seeking a talented Model Behavior Architect to join our innovative AI team in San Francisco. In this role, you will be instrumental in developing and evaluating AI products that enhance user experiences across various domains. Collaborating closely with both research and product teams, you will design strategies for prompt and context engineering that ensure high-quality interactions.
This position uniquely blends creativity and analytical skills. You will gain a profound understanding of our answer engine by rigorously testing model capabilities and working with our AI infrastructure, including system prompts, tool prompts, skills, and evaluations, to create an exceptional product experience for our users.
As the go-to expert on prompting, model quality, and behavioral consistency, you will be pivotal in the deployment of new product features and model releases.
Key Responsibilities
Context Engineering: Create, test, and refine context strategies and system prompts that influence answer engine behavior across various products, features, and use cases.
Evaluation Systems: Develop automated and semi-automated evaluation pipelines to assess model quality, detect regressions, and scale across product surfaces.
Model Launch Support: Collaborate with research and engineering teams to validate model behavior prior to and during rollouts, ensuring seamless transitions without any degradation.
Research & Analysis: Identify inconsistencies and potential failure modes in model outputs through meticulously designed research initiatives for both internal and production-facing systems.
Cross-functional Collaboration: Work closely with design, product, and research teams to translate product objectives into specific model behavior requirements.
Knowledge Sharing: Assist engineers across teams in developing a strong understanding of prompt design, context engineering, and evaluation best practices.
Staying Current: Keep abreast of the latest alignment, evaluation, and prompting techniques from both industry and academia, and integrate the best ideas into the team.

