About the job
Huawei Canada's Distributed Scheduling and Data Engine Lab in Markham has contributed to Huawei Cloud's technical growth since 2014. The team specializes in cloud-native databases, intelligent SQL engines, AI and agent infrastructure, and evaluating large language models. Close collaboration with industry experts shapes both new product development and the continuous improvement of cloud platforms.
Role overview
The Senior Engineer - Cloud AI Infrastructure role focuses on building and refining infrastructure to support AI and agentic workloads. This position blends research, systems engineering, and product delivery to advance cloud AI capabilities.
What you will do
- Develop infrastructure for AI and agent workloads, combining technical research with hands-on engineering.
- Track trends in large language models, agentic AI, and multi-step agent workflows to inform infrastructure decisions.
- Identify and address performance bottlenecks related to GPU/NPU usage, data transfer, memory management, and distributed execution.
- Design and implement system-level architectures for agent execution, multi-model orchestration, and large-scale inference.
- Evaluate and optimize AI workload requirements on cloud and hybrid environments, balancing cost, performance, and scalability.
- Analyze the infrastructure stack, including distributed schedulers, inference pipelines, caching, and data access patterns.
- Work with engineering and product teams to prototype and deliver solutions based on research findings.
- Translate emerging AI trends and workload patterns into scalable infrastructure designs.
Location
Markham, Ontario, Canada

