company

Technical Staff Member - Infrastructure for AI Systems

Chakra LabsBrooklyn
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

We are looking for a candidate who is not only technically adept but also passionate about pushing the boundaries of AI technology.

About the job

At Chakra Labs, we are dedicated to creating innovative environments for AI agents, focusing on systems that enhance and measure their productive capabilities.

Key Responsibilities

  • Agent Orchestration at Scale: Manage hundreds of agent executions simultaneously, each with its unique stateful environment. You'll oversee the dispatch layer, including SQS, concurrency management, and failure recovery.

  • Environment and Task Design: Develop realistic environments and challenging scenarios that push agents to their limits. Your role will involve constructing new evaluations and designing meaningful tasks that assess critical performance metrics.

  • Exploring New Frontiers: Stay on the cutting edge of agent evaluation by supporting new environment modalities and integrating external orchestration frameworks.

  • Observability: Implement Prometheus and OpenTelemetry across services, create Grafana dashboards, and manage structured logging.

Qualifications

  • Container Orchestration: Proficient in managing Kubernetes or similar technologies in production environments, including auto-scaling, pod lifecycle management, persistent storage, and networking.

  • Distributed Systems: Experience in building or maintaining message-driven architectures (e.g., SQS, Kafka). You understand how to manage job flows, implement retries without duplication, and handle failures gracefully.

  • LLM Infrastructure: Familiarity with running LLM workloads at scale, including token instrumentation, rate limit management, prompt caching, and multi-provider routing.

  • Experience: Approximately 3-5 years of relevant experience, though we are open to candidates who possess the required skills and knowledge.

Why Join Us?

  • Unique Focus: While this role is centered on infrastructure, the workload involves AI agents—monitoring model behaviors alongside pod health, and analyzing token throughput alongside network performance.

  • Engaging Clients: Collaborate directly with AI researchers and labs, contributing to the advancement of agent capabilities and building the foundational infrastructure they rely on.

  • Dynamic Team Environment: Take ownership of entire systems rather than just tasks, with opportunities to impact various projects and initiatives.

About Chakra Labs

Chakra Labs specializes in developing environments that empower AI agents to optimize their performance and expand their capabilities. We are at the forefront of innovation in AI infrastructure, dedicated to supporting researchers and labs in their pursuit of excellence.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.