About the job
As a global leader in data resilience, Veeam empowers businesses to manage their data seamlessly, providing cutting-edge solutions in data backup, recovery, portability, security, and intelligence. Headquartered in Seattle, Veeam serves over 550,000 customers worldwide, ensuring their operations run smoothly. Join us in shaping the future of data resilience—let's move forward together, learning and making impactful contributions for some of the world's leading brands.
About the Role
Veeam is expanding its Site Reliability Engineering (SRE) organization to enhance our service offerings. As the SRE Team Leader, you will build and lead a dynamic team that collaborates with product, platform, and security engineering to ensure our systems are reliable, scalable, and observable from the ground up. You'll partner with fellow engineering leaders to embed reliability into our service roadmaps.
In this role, you will champion the adoption of SRE principles such as SLIs/SLOs and error budgets, while managing a healthy daytime follow-the-sun on-call model in collaboration with other regions. You will guide your team in enhancing the overall operability, reliability, resilience, and security of the services we support.
What You’ll Do
People & Team Leadership
- Recruit, onboard, and develop your SRE team.
- Foster a culture that values learning and engineering over fault-finding and crisis management.
- Maintain sustainable operational coverage; oversee on-call health and workloads.
Reliability Strategy & Governance
- Establish and operationalize SLIs/SLOs and error budgets in collaboration with service owners.
- Conduct reliability reviews and ensure accountability for outcomes.
- Define reliability standards, runbooks, readiness checklists, and alerting patterns, including SLO-based alerting.

