About the job
The Machine Learning Observability team at Datadog is at the forefront of developing innovative tools designed to monitor, interpret, and enhance AI systems deployed in production environments, with a special focus on Large Language Models (LLMs) and generative AI technologies. Our solutions provide comprehensive and scalable observability for AI workloads, including drift detection, model evaluation, and behavior tracing, empowering our clients to deploy AI confidently.
As a Staff Software Engineer, you will be instrumental in driving the development of new features and core capabilities within Datadog’s LLM Observability product. You will influence product strategy, lead experimental initiatives, and leverage your extensive knowledge of AI systems and software engineering to tackle complex challenges in the rapidly evolving AI domain. Your contributions will have a significant impact on how our customers monitor, diagnose, and optimize LLM-powered applications in production.
Join us in creating the essential tools that ensure AI systems are observable, comprehensible, and dependable in real-world applications.
At Datadog, we value our office culture which fosters relationships, collaboration, and creativity. We operate within a hybrid work model to enable our Datadogs to achieve a work-life balance that suits them best.

