About the job
About Rootly
At Rootly, we are dedicated to revolutionizing how organizations manage incidents. Our mission is to provide a reliable incident management platform that empowers companies to respond swiftly and effectively when challenges arise. Our innovative approach has established us as leaders in a new multi-billion dollar segment, and we are seeking exceptional talent to help us achieve our ambitious goals.
Our customers, including industry giants like NVIDIA, Figma, Canva, and Tripadvisor, trust Rootly for their critical incident management needs. They appreciate our user-friendly platform and unique partnership approach, which has garnered us a stellar 5-star rating on G2. Join us in creating a reliable future for organizations worldwide.
Backed by prestigious investors from Y Combinator to key operators in tech, we prioritize transparency and team involvement in our financial health. We conduct monthly business reviews and share updates through our weekly changelog.
About the Role
As a Senior Site Reliability Engineer at Rootly, you will play a crucial role in shaping our technical infrastructure. You will thrive in a dynamic environment where each day presents new challenges and opportunities for growth. This position is perfect for individuals who seek ownership, enjoy tackling complex technical problems, and are driven by a mission to enhance reliability. While the work will be demanding, it promises to be one of the most rewarding experiences in your career.
- Collaborate with product teams to enhance the observability, reliability, and performance of services.
- Take ownership of our CI/CD pipelines, observability tools, monitoring systems, and incident response processes.
- Develop tools and automation to reduce manual toil, enhance engineering velocity, and improve developer experience and system reliability.
- Engage deeply with engineering teams to gain insights into system performance and identify cross-functional reliability and scaling concerns.
- Design and scale our infrastructure while ensuring top-notch performance and operational excellence.

