About the job
About Future Secure AI
Future Secure AI develops AI Co-Workers that automate operational tasks for enterprise clients. The team focuses on production-ready systems built for scale and real-world demands. Reliability, resilience, and disciplined engineering guide daily work. The company values individual growth and fosters a collaborative culture shaped by its BRAVER values. Leadership is entrepreneurial and accessible, supporting employees as individuals.
Role Overview: Site Reliability Engineer
Based in Sydney, the Site Reliability Engineer will design, build, and maintain the infrastructure supporting AI Co-Workers. This role involves close collaboration with product, AI, and engineering teams and offers the chance to take ownership of reliability throughout the system lifecycle.
Key Responsibilities
- Design, build, and manage reliable production infrastructure for AI Co-Workers.
- Oversee Kubernetes-based platforms for deploying and running AI workloads.
- Create and maintain infrastructure as code using Terraform.
- Implement and manage Helm-based deployment workflows.
- Define, measure, and improve system reliability using SLIs, SLOs, and SLAs.
- Participate in on-call rotations, handle incident response, conduct root cause analysis, and contribute to post-mortem reviews.
- Reduce operational toil through automation and engineering improvements.
- Develop and enhance observability, including monitoring, logging, and alerting.
- Work with engineers to ensure systems remain resilient, scalable, and secure.
- Manage tasks across build, deploy, and operate phases of the software lifecycle.

