Qualifications
Key ResponsibilitiesDesign and implement real-time data streaming pipelines using Kafka and associated technologiesBuild and sustain scalable data engineering solutions on the Cloudera platformDevelop and enhance ETL/ELT pipelines for extensive structured and unstructured dataCollaborate with data scientists to deploy and manage ML models utilizing MLOps/LLMOps frameworksEstablish and oversee CI/CD pipelines for data and ML workflowsUtilize OpenShift/Kubernetes for containerized deploymentsEnsure adherence to data quality, governance, and security best practicesMonitor, troubleshoot, and optimize data workflows and infrastructureEngage with cross-functional teams including product, engineering, and analyticsRequired Skills & Experience6+ years of experience in Data EngineeringExtensive hands-on experience with Apache Kafka (Real-Time Streaming - RTS)Expertise in Cloudera Data Platform (CDP) or similar big data ecosystemsExperience with CI/CD pipelines (Jenkins, GitLab, etc.)Hands-on experience with OpenShift/KubernetesStrong programming skills in Python/Scala/JavaExperience with distributed data processing frameworks (Spark, Hive, etc.)Solid understanding of data modeling and data warehousing conceptsPreferred SkillsExperience in AIOps, MLOps, or LLMOpsFamiliarity with AI/ML lifecycle management and deploymentKnowledge of cloud platforms (AWS/Azure/GCP) is advantageousExperience in the banking or financial services domainSoft SkillsStrong problem-solving and analytical thinkingExceptional communication and stakeholder management skillsAble to thrive in agile and fast-paced environments
About the job
Join gsstech-group as a Senior Data Engineer within our dynamic Data Engineering chapter in the Technology Platform domain. We seek a talented individual with a robust background in real-time data streaming, contemporary data platforms, and the operationalization of AI/ML technologies (AIOps, MLOps, LLMOps).
In this pivotal role, you will design, build, and optimize scalable data pipelines that empower advanced analytics and AI-driven solutions.