Infrastructure Reliability Engineer

Tecsys Inc.Remote — Montreal, Quebec, Canada

Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

We are looking for candidates who possess a strong background in reliability engineering, cloud infrastructure, and automation. Ideally, you should have:Experience with AWS and Kubernetes environments. A solid understanding of monitoring tools, particularly Datadog. Skills in scripting and automation to enhance operational efficiency. Strong analytical and problem-solving capabilities. Excellent collaboration and communication skills.

About the job

About Tecsys

Tecsys is a rapidly growing innovator in supply chain solutions for leading healthcare systems, hospitals, pharmacies, distributors, retailers, and 3PLs. We collaborate with industry leaders to transform their supply chains through technology. If you thrive on tackling challenges and seek continuous learning opportunities, we invite you to join our dynamic team!

Position Overview

We are in search of an Infrastructure Reliability Engineer to join our Network Operations and Security Center (NOC) team, which is pivotal to the reliability of our critical SaaS platforms. In this role, you will contribute to the maintenance, optimization, and assurance of the reliability and performance of the systems that drive our cloud infrastructure on AWS and Kubernetes. A strong focus will be placed on automation, observability, and continuous improvement.

This position amalgamates reliability engineering with incident management, placing you in a key role responsible for availability, performance, and innovation. You will be part of a highly skilled team that values creative problem-solving, operational excellence, and the continuous enhancement of resilience through automation and engineering.

Your Responsibilities

Collaborate with engineering teams to support services prior to their launch through activities such as systems design consultation, platform and software framework development, capacity planning, and launch reviews.
Continuously innovate by identifying weaknesses, proposing creative solutions, and driving initiatives that simplify, scale, and strengthen the platform.
Maintain services post-launch by measuring and monitoring availability, latency, and overall system health.
Ensure optimized observability: enhance and expand monitoring and alerting using Datadog; define SLOs/SLIs and create actionable dashboards that yield reliability outcomes.
Develop and enhance...

About Tecsys Inc.

Tecsys is a trailblazer in supply chain management, delivering innovative solutions to healthcare providers and various industries. Our focus on technology-driven transformation positions us at the forefront of supply chain innovation.

Similar jobs

1 - 20 of 553 Jobs

Search for Senior Site Reliability Engineer

553 results

Select all on this page (20)

Apply

Senior Site Reliability Engineer

MongoDB, Inc.

Full-time|CA$144K/yr - CA$200K/yr|Hybrid|Montreal; Toronto

The Storage Layer Services (SLS) team at MongoDB is embarking on an innovative journey to re-architect our cloud storage layer, forming the core of our next-generation cloud storage architecture. This newly established team is dedicated to creating high-performance, multi-tenant distributed storage services that not only enhance our current Atlas storage stack but also enable more efficient customer workloads. As a Senior Site Reliability Engineer, you will collaborate closely with teams responsible for these storage services to establish Service Level Objectives (SLOs), develop capacity plans, and guarantee the reliability, durability, and operational safety of the foundational storage layer supporting Atlas. By joining our small team of seasoned SREs, you will play an integral role in executing a multi-year roadmap for MongoDB’s cloud storage architecture. This position is open to candidates based in our Toronto or Montreal offices or those working remotely from anywhere in Canada, provided they are located in the Eastern or Central time zones.

Apr 8, 2026

Apply

Senior Site Reliability Engineer - Data

Lightspeed Commerce

Full-time|On-site|Montreal, Quebec, Canada

Data is the new gold! At Lightspeed, we empower our data teams to construct and manage cutting-edge data and AI infrastructure platforms, alongside a robust governance framework that ensures seamless data flow across the organization. Our focus lies in data security, reliability, and high availability.Please note: As a global organization serving clients beyond Quebec, proficiency in English is a prerequisite for this role.Role:Work collaboratively with cross-functional data teams to design and implement scalable, reliable cloud infrastructure solutions that prioritize security and cost-efficiency.Advocate for a comprehensive security approach that encompasses infrastructure, supply chain, and third-party integrations, ensuring the protection of our entire data ecosystem.Contribute significantly to the development of self-service workflows for data and infrastructure, enabling teams to access resources efficiently and boost productivity.Promote and uphold best practices in Infrastructure as Code (IaC), ensuring High Availability, Disaster Recovery, and Security are integral to all design and deployment efforts.Additionally, you will:Engage in daily support and troubleshooting activities.Collaborate with the wider team to achieve organizational goals, even if it involves tasks outside your primary responsibilities.

Mar 31, 2026

Apply

Site Reliability Engineer (SRE)

MaintainX

Full-time|On-site|Montreal & Toronto

Join MaintainX, the world's leading platform for asset management and work intelligence tailored for industrial and frontline environments. We provide a modern IoT tool, powered by cloud-based computing, focusing on reliability, safety, and operations for physical equipment and facilities. Our services enhance operational excellence for over 12,000 businesses, including renowned names like Duracell, Univar Solutions Inc, Titan America, McDonald's, Brenntag, Cintas, Xylem, and Shell. Recently, we secured $150 million in Series D funding, bringing our total funding to $254 million and valuing the company at $2.5 billion. Role Overview: Assess the maturity level of services and provide recommendations to development teams. Collaborate with development teams to implement best practices in observability. Empower development teams to take ownership in deploying, supporting, and operating their services and infrastructure. Mentor developers on reliability practices, focusing on fostering their independence. Work closely with other platform division teams to promote the adoption of practices among development teams.

Feb 12, 2026

Apply

Site Reliability Engineer at MaintainX | Montreal & Toronto

MaintainX

Full-time|On-site|Montreal & Toronto

Join MaintainX, the foremost Asset and Work Intelligence platform designed for industrial and frontline environments. As a modern, IoT-enabled, cloud-based tool, we ensure the reliability, safety, and operational efficiency of physical equipment and facilities. Trusted by over 12,000 businesses, including industry giants like Duracell, Shell, and McDonald's, MaintainX is at the forefront of operational excellence. Having recently secured $150 million in Series D funding, we now boast a total of $254 million in funding, elevating our company valuation to $2.5 billion. We are on the lookout for a Site Reliability Engineer (SRE) to enhance MaintainX’s reliability, observability, and developer autonomy as we expand our platform. In this pivotal role, you will collaborate closely with product and platform engineering teams to bolster the stability, resilience, and operational readiness of our services. Your contributions will include designing for reliability from the outset, establishing clear ownership and standards, and developing shared tooling that empowers teams to manage their services confidently. Moreover, you will play a crucial role in shaping company-wide initiatives that define our approach to reliability engineering, including the establishment of observability standards, incident response practices, and service health metrics, thereby facilitating the adoption of proven industry practices at scale. This position is ideal for an engineer who thrives on cross-team collaboration, influences technical direction through robust engineering practices, and transforms reliability principles into practical, scalable systems.

Feb 12, 2026

Apply

Senior Production Reliability Engineer

Unity Technologies

Full-time|On-site|Montreal, Canada

Unity Technologies is seeking a Senior Production Reliability Engineer based in Montreal, Canada. This role centers on keeping production systems stable and responsive, directly supporting the products and services that power Unity's platform. Key responsibilities Diagnose and resolve production issues quickly to reduce downtime and service disruptions Work closely with engineering, operations, and product teams to improve system reliability Develop and refine processes that help maintain a stable, high-performing infrastructure Support Unity's growth by contributing to scaling strategies as services and user numbers increase Role impact This position plays a vital part in Unity's ability to deliver reliable products. Efforts in this role help ensure systems can handle both current operations and future expansion, keeping performance strong as demand grows.

Apr 27, 2026

Apply

Infrastructure Reliability Engineer

Tecsys Inc.

Full-time|Remote|Remote — Montreal, Quebec, Canada

At Tecsys, we recognize the transformative power of remote work on employee well-being and the environment. Our commitment to remote work fosters enhanced employee morale, productivity, and reduced commuting times. We are proud to be a remote-first organization, supported by cutting-edge technologies and programs that create a fantastic foundation for our team. Our flexible remote environment, complemented by well-located offices and collaborative workspaces, empowers our staff to work in ways that maximize their productivity.About TecsysTecsys is a rapidly growing innovator in supply chain solutions for leading healthcare systems, hospitals, pharmacies, distributors, retailers, and 3PLs. We collaborate with industry leaders to transform their supply chains through technology. If you thrive on tackling challenges and seek continuous learning opportunities, we invite you to join our dynamic team!Position OverviewWe are in search of an Infrastructure Reliability Engineer to join our Network Operations and Security Center (NOC) team, which is pivotal to the reliability of our critical SaaS platforms. In this role, you will contribute to the maintenance, optimization, and assurance of the reliability and performance of the systems that drive our cloud infrastructure on AWS and Kubernetes. A strong focus will be placed on automation, observability, and continuous improvement.This position amalgamates reliability engineering with incident management, placing you in a key role responsible for availability, performance, and innovation. You will be part of a highly skilled team that values creative problem-solving, operational excellence, and the continuous enhancement of resilience through automation and engineering.Your ResponsibilitiesCollaborate with engineering teams to support services prior to their launch through activities such as systems design consultation, platform and software framework development, capacity planning, and launch reviews.Continuously innovate by identifying weaknesses, proposing creative solutions, and driving initiatives that simplify, scale, and strengthen the platform.Maintain services post-launch by measuring and monitoring availability, latency, and overall system health.Ensure optimized observability: enhance and expand monitoring and alerting using Datadog; define SLOs/SLIs and create actionable dashboards that yield reliability outcomes.Develop and enhance...

Dec 16, 2025

Apply

Civil Engineering Site Inspector

Artelia

Full-time|On-site|Montreal

Join Artelia as a Civil Engineering Site Inspector, where you will play a crucial role in ensuring the successful execution of construction projects. As an integral part of our team, you will monitor site activities, enforce compliance with safety standards, and oversee the quality of materials used in projects. Your keen eye for detail and commitment to excellence will help us maintain our reputation for delivering high-quality engineering solutions.

Jan 5, 2026

Apply

Network Reliability Specialist

Squarepoint Capital

Full-time|$120K/yr - $120K/yr|On-site|Montreal, New York

Position Overview: The Network Reliability Specialist plays a pivotal role in maintaining the stability and integrity of Squarepoint's global trading network. This encompasses regional data centers, offices, and various cloud service providers. Key responsibilities include overseeing projects related to data, voice, video, network security, storage area networks, wireless networks, network monitoring, and capacity management. The Specialist will be involved in the installation, monitoring, maintenance, support, optimization, and documentation of all network hardware, software, and communication links. This role includes managing multiple projects and configuring both internal network services and those integrated with Internet-based services. Collaboration with the Core Networking team and trading systems developers is essential to ensure the technology team delivers an optimal trading platform. Design and implement highly available enterprise networks that include data centers, WANs, and offices. Lead project delivery initiatives related to data centers, WANs, and office setups. Ensure the optimization of network services and infrastructure to support trading operations.

Mar 10, 2026

Apply

Technician in Construction Site Surveillance

CIMA+

Full-time|On-site|Montreal

Join our specialized Infrastructure team as a Construction Site Surveillance Technician. You will primarily be responsible for inspecting construction sites and ensuring the rehabilitation of municipal infrastructures across various project scales.Main Responsibilities:Monitor the compliance of ongoing work and ensure adherence to plans and specifications, drafting site memos when necessary.Conduct required measurements, perform quantity calculations, master the description of items in the bill of quantities, and prepare cumulative reports for monthly payment requests.Be proactive in identifying discrepancies between the construction work and the plans, proposing solutions through communication with the project manager.Coordinate the work performed by the laboratory and participate in site meetings.Prepare construction reports, TQC plans, comment on additional work requests from the contractor, and deliver the required outputs to clients within stipulated deadlines.Ensure compliance with occupational health and safety standards on construction sites and report any hazardous situations.

Mar 12, 2026

Apply

Urban Infrastructure Site Technician

Artelia

Full-time|On-site|Montreal

Join Artelia as an Urban Infrastructure Site Technician! In this exciting role, you will be responsible for overseeing construction sites, ensuring compliance with safety regulations, and collaborating with engineers and project managers. Your contributions will be vital in maintaining the quality and integrity of urban infrastructure projects. This position offers an opportunity to work in a dynamic environment where your skills will directly impact the community.

Feb 2, 2026

Apply

Senior Data Engineer

MaintainX

Full-time|On-site|Montreal, Toronto

MaintainX is the world’s leading platform for asset management and work intelligence in industrial and frontline environments. We offer a modern cloud-based IoT tool that enhances the reliability, safety, and operations of physical equipment and facilities. Our platform drives operational excellence for over 12,000 companies, including Duracell, Univar Solutions Inc, Titan America, McDonald’s, Brenntag, Cintas, Xylem, and Shell. Recently, we secured Series D funding of $150 million, bringing our total funding to $254 million and valuing the company at $2.5 billion. We are looking for a Senior Data Engineer to join our rapidly growing team. You will play a crucial role in the development and maintenance of the data platform that directly supports the MaintainX product while enabling internal analytics and business intelligence. Additionally, you will enhance the platform's capabilities and the tools used daily by our engineering teams. Your contributions will span the entire data platform, including batch, micro-batch, and streaming pipelines, deployment, governance, and development tools.

Jan 12, 2026

Apply

Project Manager - Site Supervision for Bridges and Infrastructure

CIMA+

Full-time|Hybrid|Montreal

Join our dynamic and growing team at CIMA+, where we are seeking a dedicated Project Manager specializing in site supervision for major infrastructure projects, including bridge rehabilitation and road construction. Our passionate team of over 50 professionals is currently involved in significant projects such as the refurbishment of the Louis-Hippolyte Lafontaine Tunnel, the extension of Highway A19, the Urbanova interchange, and various major repair initiatives in the Greater Montreal area.We offer a flexible hybrid work model that allows you to balance working from home and in one of our offices in the metropolitan region.The Transport team at CIMA+ has built a stellar reputation in the transportation sector over the past 30 years, successfully managing thousands of projects. Our talented engineers lead projects of all complexities and sizes, drawing from a diverse pool of experience that includes planning, design, inspection, construction management, and infrastructure reconstruction in both urban and rural environments. We take a holistic approach to transportation projects, prioritizing environmental sustainability, promoting resilient communities, and supporting sustainable growth. Our innovative solutions integrate various modes of transportation, ensuring safe and efficient coexistence for motorists, pedestrians, and cyclists alike. Join us in a collaborative, innovative environment where you will have a genuine impact. Together, we will push boundaries and rise to the challenges of tomorrow, building a better world!

Mar 11, 2026

Apply

Senior Process Engineer

cima2

Full-time|On-site|Montreal

Join cima2 as a Senior Process Engineer, where you'll be at the forefront of innovation and technology. As a key member of our engineering team, you will be responsible for developing and optimizing processes that enhance production efficiency and sustainability.

Apr 7, 2026

Apply

General Maintenance and Electrician – Remote Sites (55822001)

Sodexo Canada Ltd.

Full-time|On-site|Montreal

Role Overview Sodexo Canada Ltd. is hiring a General Maintenance and Electrician for remote sites, based out of Montreal. This position plays a key part in keeping facilities running smoothly and safely. The work supports dependable operations for our clients in challenging locations. What You Will Do Perform general maintenance and repairs on building systems and equipment Handle electrical maintenance tasks to keep facilities safe and efficient Support day-to-day operations by addressing upkeep needs as they arise Location This role is based in Montreal and serves remote sites as needed.

Apr 17, 2026

Apply

Senior Data Engineer

Plusgrade

Full-time|On-site|Montreal, Quebec

Senior Data EngineerAbout UsAt Plusgrade, we believe that travel is not just about reaching a destination, but about enjoying the experiences created along the way. We partner with over 200 airlines, hospitality companies, cruise lines, railways, and financial services to transform everyday journeys into extraordinary experiences. Driven by our values of ambition, innovation, and collaboration, we constantly push boundaries and believe that we achieve more together. Join us in shaping the future of travel by unlocking the potential of data.Role OverviewWe are seeking a Senior Data Engineer to play a pivotal role in the evolution of our data platform. As part of our Data Engineering team, you will design and implement scalable, reliable, and high-performance data pipelines and platforms that support analytics, AI/ML, and customer-focused applications. In this role, you will serve as a technical leader, mentoring mid-level engineers, guiding solution design, and ensuring adherence to best practices. You will collaborate closely with a Staff Engineer and a Data Architect, participating in architectural decision-making while managing complex implementations and deliveries.Your ResponsibilitiesLead the design, development, and implementation of scalable, resilient, and cost-effective data pipelines for batch and streaming systems.Collaborate with Staff Engineers and Data Architects to translate the overall architecture into concrete technical solutions.Promote best practices in data modeling, pipeline design, code quality, and testing.Establish and enhance automated monitoring, alerting, and data quality frameworks for critical pipelines.Take ownership of complex, high-impact data engineering projects from conception to deployment and monitoring.Guide the team in designing event-driven, real-time, and microservices-oriented data flows.Optimize existing data infrastructure for performance, scalability, and cost-effectiveness in the cloud.Mentor and coach mid-level engineers, conduct reviews, and provide feedback to strengthen team capabilities.Collaborate with analysts, data scientists, and product teams to co-design data products that fuel advanced analytics and ML models.Contribute to the evolution of data governance, ensuring compliance and best practices.

Apr 7, 2026

Apply

Senior Mechanical Lead Engineer

Segula Technologies

Full-time|On-site|Montreal

Join our team at Segula Technologies as a Senior Mechanical Lead Engineer. In this pivotal role, you will lead and manage mechanical engineering projects, ensuring innovative solutions that meet client specifications and industry standards. You will collaborate with cross-functional teams to develop cutting-edge mechanical systems, while mentoring junior engineers and contributing to the overall success of our technical direction.

Jan 7, 2026

Apply

Senior Software Engineer

Vention Inc.

Full-time|On-site|Montreal

About the Role Vention Inc. is looking for a Senior Software Engineer to help build and improve software that supports manufacturing businesses. This position is based in Montreal. What You Will Do Design and develop software solutions that help clients optimize their manufacturing processes Work closely with colleagues from different teams to deliver reliable, user-focused products Contribute ideas that shape both the technology stack and the direction of new features Who You’ll Work With This role is part of the engineering team and involves frequent collaboration across departments to improve product quality and user experience.

Apr 20, 2026

Apply

Senior Platform Engineer

Stay22

Full-time|On-site|Montreal HQ

About Stay22At Stay22, we are transforming the way users convert online. Our AI-powered affiliate platform enables publishers, ticketing platforms, and content creators to unlock new revenue streams while enhancing the user experience for their audiences. With Stay22, our partners not only earn more but also provide greater value. Join us in making a significant impact in the affiliate marketing landscape.About the RoleAs a Senior Platform Engineer at Stay22, you will be instrumental in designing, enhancing, and evolving our robust, scalable, and highly reliable infrastructure. The Platform team plays a crucial role in building and maintaining the technical foundations that allow our product teams to innovate swiftly and confidently.In this role, you will collaborate closely with engineering teams to enhance the performance, reliability, and scalability of our systems while streamlining deployments and improving the developer experience.Main ResponsibilitiesDesign, build, and maintain our cloud infrastructure and distributed systems.Approach platform development with a product-centric methodology, treating it as an essential internal product.Enhance CI/CD pipelines to ensure rapid, reliable, and secure deployments.Implement and maintain observability tools, including monitoring, logging, and alerting.Collaborate with product teams to improve performance and resilience of services.Optimize the management of environments, configurations, and secrets.Ensure security, reliability, and scalability of the platform.Participate in process automation and enhance the developer experience.Mentor developers and promote best engineering practices.

Mar 27, 2026

Apply

Senior Electrical Engineer - Hydroelectric Equipment

cima2

Full-time|On-site|Montreal

cima2 is seeking a Senior Electrical Engineer in Montreal who specializes in hydroelectric equipment. This role centers on designing, developing, and supervising the installation of electrical systems that support hydroelectric power generation. Key responsibilities Design and develop electrical systems tailored for hydroelectric projects Supervise installation activities and verify that systems meet required standards Assess the performance of electrical equipment and completed installations Work closely with multidisciplinary teams to deliver effective project outcomes Maintain compliance with all relevant industry standards throughout project phases Requirements Significant experience in electrical engineering, with a focus on hydroelectric equipment Background in sustainable energy solutions Demonstrated success delivering projects in the hydroelectric field

Apr 20, 2026

Apply

Lead Senior Systems Engineer

Spreedly

Full-time|On-site|Montreal, Quebec

Join Our Team:At Spreedly, we pride ourselves on being the premier Open Payments Platform globally, strategically positioned at the heart of a network that processes over $50 billion in Gross Merchandise Volume (GMV) every year. Our Payments Orchestration platform enhances digital transaction efficiency by providing the most comprehensive marketplace for payment services. Leveraging our PCI-compliant architecture, the Advanced Vault solution delivers a modern feature set along with rule-based configurations, ensuring an optimized vaulting experience for all payment methods. Major global enterprises and rapidly growing companies accelerate their digital transformation by trusting our platform, which securely handles card data in our PCI-compliant vault, facilitating over $45 billion in annual transaction volumes with various payment services.Our vision is to cultivate a diverse and inclusive payment ecosystem that enriches the world. We aim to accelerate commerce through an open, secure, and adaptable payment platform that embraces all participants in the payment landscape. Our team members are instrumental in realizing this vision by fostering a culture of autonomy, transparency, and collaboration within our dynamic, high-growth organization.Key Product Offerings:Spreedly offers an open payments platform that enhances connectivity for optimal payment performance. Our key products and services include:Payment Gateway Integration: Seamlessly connects merchants, platforms, and marketplaces to numerous payment gateways and services.Tokenization: Securely manages and stores payment data through a universal tokenization service.Transaction Routing: Facilitates intelligent transaction routing to enhance both success rates and cost efficiency.Payment Vault: A robust storage solution designed for sensitive payment information.Fraud Tools Integration: Connects with various fraud prevention tools to bolster transaction security.Role Overview: As a Senior Systems Engineer at Spreedly, you will spearhead ambitious projects, engage in in-depth architectural discussions, and provide mentorship to team members. Your expertise will be essential in driving innovation and ensuring the robustness of our systems.

Jan 26, 2026

Create account — see all 553 results