About the job
Join Transform9, where we are revolutionizing healthcare access and enhancing patient communication with our cutting-edge conversational agent platform. Our goal is to deliver seamless experiences for patients and healthcare professionals alike. We are currently looking for a skilled Site Reliability Engineer to safeguard the health, performance, and reliability of our systems. In this pivotal role, you will collaborate with our development and operations teams to design and maintain a scalable infrastructure, automate processes, and improve service availability. Your expertise will play a crucial role in fostering a robust environment that supports our ambitious growth in the healthcare industry.
Responsibilities
- Design, implement, and sustain scalable and reliable systems that support the Transform9 platform and its services.
- Monitor system performance, manage incidents, and troubleshoot issues to guarantee optimal uptime and reliability.
- Construct and oversee CI/CD pipelines to facilitate smooth deployments and automate workflows.
- Work with development teams to establish best practices in system architecture, deployment, and monitoring.
- Implement observability solutions to gain insights into system performance and user experience.
- Participate in on-call rotations to react to system alerts, conduct root cause analyses, and execute remediation strategies.

