companyMaintainX logo

Site Reliability Engineer at MaintainX | Montreal & Toronto

MaintainXMontreal & Toronto
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

What You Will Do: Evaluate service maturity and provide insightful guidance to development teams. Collaborate with development teams to implement best practices for observability. Empower development teams to take ownership of their service deployment, support, and infrastructure. Mentor developers on reliability practices, focusing on fostering self-sufficiency. Act as the liaison for the Platform Division teams to promote the adoption of tooling and practices across development teams. About You: In-depth understanding of observability practices within distributed system environments and their impact on system design and team dynamics. Hands-on experience with SRE concepts, including SLOs, error budgets, and incident management. 3-5+ years of experience in software engineering, SRE, DevOps, or production engineering roles with a track record of operating production systems. Proficient in cloud-native platforms and infrastructure-as-code concepts and tools. Familiarity with at least one programming language.

About the job

Join MaintainX, the foremost Asset and Work Intelligence platform designed for industrial and frontline environments. As a modern, IoT-enabled, cloud-based tool, we ensure the reliability, safety, and operational efficiency of physical equipment and facilities. Trusted by over 12,000 businesses, including industry giants like Duracell, Shell, and McDonald's, MaintainX is at the forefront of operational excellence.

Having recently secured $150 million in Series D funding, we now boast a total of $254 million in funding, elevating our company valuation to $2.5 billion.

We are on the lookout for a Site Reliability Engineer (SRE) to enhance MaintainX’s reliability, observability, and developer autonomy as we expand our platform.

In this pivotal role, you will collaborate closely with product and platform engineering teams to bolster the stability, resilience, and operational readiness of our services. Your contributions will include designing for reliability from the outset, establishing clear ownership and standards, and developing shared tooling that empowers teams to manage their services confidently.

Moreover, you will play a crucial role in shaping company-wide initiatives that define our approach to reliability engineering, including the establishment of observability standards, incident response practices, and service health metrics, thereby facilitating the adoption of proven industry practices at scale.

This position is ideal for an engineer who thrives on cross-team collaboration, influences technical direction through robust engineering practices, and transforms reliability principles into practical, scalable systems.

About MaintainX

MaintainX is the leading Asset and Work Intelligence platform tailored for industrial and frontline environments. Our cutting-edge, IoT-enabled, cloud-based solutions drive operational excellence across a diverse range of industries, ensuring the reliability and safety of physical equipment and facilities. With a distinguished client base and substantial funding, MaintainX is poised for continued growth and innovation.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.