About the job
As a Manager of Data Centre & Cloud Operations, you will take charge of the design, execution, and upkeep of the infrastructure supporting our data center operations. This role requires an in-depth knowledge of data center systems encompassing hardware, networking, cooling, power, and security. You will also possess strong leadership capabilities to manage teams, streamline processes, and uphold the facility's operational excellence.
Key Responsibilities:
- Infrastructure Management:
- Oversee the management and maintenance of both physical and virtual infrastructures, including servers, storage, network devices, and power and cooling systems.
- Ensure the data center infrastructure remains secure, reliable, scalable, and aligned with the organization’s business and operational requirements.
- Plan, implement, and upgrade infrastructure to satisfy performance and capacity expectations.
- Team Leadership and Collaboration:
- Lead, supervise, and collaborate with other departments, such as IT, network operations, security, and facilities, to ensure seamless data center operations.
- Establish and uphold operational standards and best practices for data center personnel.
- Capacity Planning and Optimization:
- Monitor data center performance and capacity to forecast future infrastructure requirements and proactively tackle potential challenges.
- Formulate and execute strategies to optimize resource utilization within the data center, including space, power, cooling, and network bandwidth.
- Project Management:
- Direct the planning and execution of various infrastructure projects, including data center builds, expansions, and migrations.
- Oversee timelines, budgets, and resources to guarantee successful project outcomes.
- Track and report progress on significant milestones, ensuring projects remain on schedule and within budget.
- Security and Compliance:
- Ensure the data center adheres to industry standards and complies with applicable security, regulatory, and environmental regulations.
- Implement physical security measures and collaborate with security teams to safeguard data integrity and access controls.
- Develop disaster recovery and business continuity plans for data center operations.
- Troubleshooting and Incident Management:
- Lead troubleshooting initiatives and guarantee swift resolution of any infrastructure issues or outages.
- Develop and maintain incident response procedures to minimize downtime and meet recovery objectives.
- Conduct root cause analyses and continuous improvement processes for infrastructure incidents.
- Cost Management:
- Oversee the budget for data center infrastructure, identifying opportunities for cost-saving measures.

