About the job
Veeam Software develops solutions for data resilience and security posture management, supporting more than 550,000 customers worldwide. Headquartered in Seattle and operating across over 30 countries, Veeam helps organizations understand, protect, and strengthen their data and AI systems. The company brings together identity, data, security, and AI risk management to enable safer, scalable AI adoption.
Role overview
The Senior Platform Engineer - Cloud Workloads joins the Workload team within Veeam’s R&D department. This position centers on maintaining observability infrastructure, improving incident response, and expanding proactive support for cloud-based workloads.
Main responsibilities
- Design, build, and operate observability pipelines using the Elastic Stack (Elasticsearch, Kibana, Fleet) for Azure and AWS environments.
- Create and manage SLO/SLI dashboards and error budget reports for Backup-as-a-Service (BaaS) platform services.
- Lead incident response for distributed, multi-tenant cloud workloads, and maintain and improve runbooks for these processes.
- Develop and refine proactive support tools, such as pattern analysis, tenant correlation dashboards, and alerts for baseline deviations to reduce reactive support.
- Administer Elastic Fleet agent policies, monitor enrollment health, and manage log streaming pipelines across Azure and AWS worker fleets.
- Collaborate with SRE, R&D, and Proactive Support teams to address observability gaps, including tenant identification workflows and admin portal integrations.
Location
This role is based in San Jose, CA, USA.

