About the job
Antimetal is on the lookout for a Research Engineer to enhance the intelligent systems that drive our innovative solutions. In this role, you'll be tasked with prototyping cutting-edge methodologies, conducting experiments, and overseeing the transition from research to real-world application. Collaboration with our platform and product teams will be key as you help shape the capabilities of our AI agents and contribute to the development of evaluation methodologies.
Modeling infrastructure and its observability is a challenging domain. The telemetry we handle is often high-volume, chaotic, and transient, with ground truth that is often only approximate. Our goal is to develop AI agents capable of understanding this intricacy and reasoning about operational dynamics, including implementing code and configuration modifications.
Research Focus Areas
Infrastructure Intelligence: Developing models to comprehend infrastructure behaviors and the underlying reasons, such as anomaly detection, issue forecasting, telemetry analysis across logs, metrics, events, and traces, and understanding causal relationships. These foundational capabilities enable our agents to reason effectively about infrastructure.
Autonomous Agents: Designing long-running, parallel agents that can identify, diagnose, and remediate infrastructure problems, including the ability to fix code and configuration issues. Enhancements in multi-step reasoning, orchestration, context management, memory, and reinforcement learning will be essential for this role.
Evaluation: Ensuring the performance of our agents and guiding improvements by collaborating with the platform team to establish evaluation methodologies, generate synthetic data, analyze past incidents, and model the domain.
About Antimetal
Antimetal aims to redefine the future of infrastructure management. Our mission is to create a platform that identifies, resolves, and prevents issues, allowing engineers to focus on what they excel at: developing exceptional products.
Your Responsibilities:
Experiment, Evaluate, Iterate, Ship: Conduct experiments across our research domains, analyze outcomes, validate effective methods, and transition successful strategies into production.
Build Evaluation Infrastructure: Work alongside the platform team to establish live and offline evaluation systems, benchmarks, and synthetic data generation tools that facilitate continuous improvement.

