About the job
Job Specification: Cloud Infrastructure Manager
About Us
At AMCS Group, we are leaders in sustainability software, headquartered in Ireland, with a global presence in Europe, the USA, and Australasia. Our team of over 1,300 talented professionals across 22 countries is dedicated to providing innovative technology solutions that pave the way for a carbon-neutral future.
Our Mission
We focus on developing cutting-edge SaaS solutions that enhance efficiency and promote sustainability in resource-heavy industries. With more than 5,000 clients spanning 23 countries, our Performance Sustainability software delivers tangible benefits, improving both profitability and environmental sustainability worldwide.
Position Overview
We are on the lookout for a strategic and experienced Cloud Infrastructure Manager to spearhead and enhance our Site Reliability Engineering (SRE), Cloud Operations, and FinOps teams. This key position is responsible for ensuring the infrastructure and operations are reliable, secure, and cost-effective to support our extensive product delivery. You will lead operational strategy, foster organizational growth, and ensure cross-functional collaboration, while taking charge of both team leadership and the technical direction in these vital areas.
Key Responsibilities
Strategic Leadership:
Develop and implement an integrated strategy for SRE, Cloud Operations, Cloud Security, and FinOps.
Create and execute a cloud optimization roadmap that prioritizes cost-effectiveness while maintaining high reliability and performance standards.
Provide guidance, mentorship, and performance assessments for multi-disciplinary teams.
Work closely with senior engineering leaders to align platform and reliability efforts with overall business goals.
Site Reliability Engineering:
Lead the SRE teams focused on ensuring reliability, availability, performance, and operational excellence.
Advance observability strategies, including metrics, logs, traces, dashboards, and alerting quality.
Promote SRE best practices such as SLIs, SLOs, error budgets, and toil reduction.
Cloud Operations:
Oversee cloud infrastructure provisioning, governance, and cost optimization across Azure, AWS, and GCP.
Encourage automation-first operational models to minimize manual processes.
Cloud Security:
Implement secure-by-default cloud architecture, governance, and controls.
Collaborate with IT security teams to integrate policy-as-code and identity-based access models.

