About the job
Join gsstech-group as a Technology Engineer specializing in AI/ML Platforms, MLOps, and GenAI infrastructure. In this pivotal role, you will be responsible for designing, building, and scaling cutting-edge AI systems. We are looking for a candidate who excels in containerized environments, model serving, and cloud-based AI architecture, with an emphasis on performance, scalability, and resilience.
Key Responsibilities:
- Design, build, and maintain containerized applications leveraging technologies such as OpenShift, OpenShift AI, Kubernetes, and Helm Charts.
- Deploy and optimize AI inference engines like Triton Inference Server and vLLM for high-performance model serving.
- Lead the end-to-end model lifecycle management, including deployment, monitoring, scaling, and retraining workflows.
- Implement comprehensive monitoring, logging, and alerting systems using Prometheus and Grafana.
- Collaborate on projects focused on GenAI and LLM-based solutions, particularly Agentic AI solutions.
- Develop and automate CI/CD pipelines with tools such as Jenkins, Groovy, Ansible, and Terraform.
- Create automation scripts and internal tools using Python.
- Architect and manage AI/ML solutions on AWS, utilizing services like SageMaker and Bedrock (preferred).
- Enhance AI platforms across on-premise and cloud environments.
- Ensure systems are highly scalable, fault-tolerant, and optimized for performance.
- Contribute to architecture design, platform roadmap, and strategic technical decisions.

