About the job
flatgigs is hiring a Senior Machine Learning DevOps Engineer & IT Support specialist for its Dubai office. This onsite position focuses on building, securing, and maintaining the core systems that power our AI infrastructure.
Role Overview
This role combines MLOps, cloud networking, and IT service management. The position covers both hardware and software frameworks supporting machine learning workloads. Reliable support for production GPU environments and internal connectivity is essential to keep our team productive.
Main Responsibilities
- Manage and scale multi-cluster GPU environments and high-compute AI workloads.
- Provide hands-on IT support, including user access, device management, and internal systems.
- Design and maintain secure cloud network architectures on Azure and GCP.
- Support MLOps pipelines and help deploy machine learning models.
- Administer internal IT infrastructure, including identity and access management (IAM), single sign-on (SSO), and mobile device management (MDM).
- Set up monitoring and alerting for GPU health, system performance, and security.
- Apply cloud security best practices across infrastructure and endpoints.
- Troubleshoot and resolve system, deployment, and connectivity issues.
- Work closely with AI engineers to optimize infrastructure for machine learning tasks.
Location
This is a full-time, onsite role based in Dubai, United Arab Emirates.

