About the job
At Scale AI, we are at the forefront of the AI revolution, providing the essential data infrastructure that empowers organizations to create and implement robust AI applications. Our partnerships with top enterprises and government entities accelerate their AI goals through innovative data annotation platforms, generative AI solutions, and comprehensive enterprise AI capabilities.
Discover the General Agents Team
The General Agents team, an integral part of Scale's Enterprise division, is dedicated to developing advanced general agents tailored for diverse customer applications. We operate at the cutting edge of agent technology, transforming sophisticated reasoning and agentic capabilities into dependable, production-ready systems that deliver substantial economic benefits. Our agents are designed for scalability, focusing on recurring enterprise challenges, with a strong emphasis on generalization, extensibility, and widespread deployment.
Your Impact in This Role
As a Senior/Staff Machine Learning Engineer on the General Agents team, you will be pivotal in architecting, building, and deploying production-grade AI agents that address significant enterprise challenges. Your role will encompass the entire agent lifecycle—from system design and model evaluation to deployment and iterative refinement—effectively merging cutting-edge agent techniques with the practicalities of real-world customer settings.
You will:
- Create and implement comprehensive agent systems that integrate LLM reasoning, memory, tool usage, and control logic to tackle recurring enterprise challenges.
- Develop scalable and reliable agent architectures that can adapt to a variety of customer data and tools.
- Establish evaluation frameworks, datasets, environments, and metrics to assess agent performance, reliability, and business outcomes in live settings.
- Collaborate with product managers, clients, data annotators, and engineering teams to translate enterprise needs into robust agent designs.
- Transition cutting-edge agent techniques (e.g., planning, multi-step reasoning, tool utilization, multi-agent collaboration) into maintainable and observable systems.
- Oversee the deployment, monitoring, and iterative enhancement of agent systems, including failure analysis and continuous improvement based on actual usage.
- Guide the technical direction and architectural practices for general agent development, with increased scope and leadership at the Staff level.

