About the job
About Fluidstack
At Fluidstack, we are at the forefront of creating the infrastructure that powers advanced artificial intelligence. We collaborate with leading AI laboratories, governmental entities, and major enterprises—including Mistral, Poolside, and Meta—to deliver computing capabilities at unprecedented speeds.
Our mission is to bring Artificial General Intelligence (AGI) to fruition, and our team is driven by a strong commitment to excellence in building world-class infrastructure. We view our customers' successes as our own, and we take immense pride in the systems we develop and the trust we cultivate. If you are fueled by purpose, dedicated to exceptional standards, and eager to work diligently to shape the future of intelligence, we invite you to join us in this transformative journey.
About the Team
The Data Center Operations team is pivotal in supporting our rapid growth by deploying and managing large-scale datacenters. We take complete on-site responsibility for each facility and oversee the entire lifecycle of our hardware fleet, delivering scalable and reliable infrastructure solutions and services.
About the Role
As a Site Manager, you will lead and expand a team of over 10 technicians and specialists across multiple datacenter facilities within a designated campus or region. You will be tasked with driving operational strategy, team development, and executing excellence while managing the complete lifecycle of datacenter infrastructure, ensuring that our 24/7 operations consistently meet or exceed our service level agreements (SLAs).
Key Responsibilities
Recruit, develop, and oversee cross-functional operations teams, including Facility Managers, Operations Technicians, and Logistics Specialists across various sites; enhance organizational capabilities through performance management, coaching, and career development.
Determine optimal staffing levels and organizational structures to meet current and future business needs through effective headcount forecasting, resource allocation, and succession planning.
Maintain 24/7 operational accountability, ensuring uptime, reliability, security, and availability meet or exceed SLAs; oversee incident and change management processes and act as the escalation point for critical incidents.
Ensure proficient management of ticket queues, work prioritization, shift scheduling, and on-call rotations across facility, operations, and logistics teams.
Drive operational KPIs and metrics including uptime, Mean Time to Recovery (MTTR), Mean Time Between Failures (MTBF), inventory accuracy, capacity utilization, and cost efficiency; lead post-incident reviews with root cause analysis and implement necessary corrective actions.

