About the job
Cloudera Data Engineer
As a Cloudera Data Engineer, you will play a pivotal role in designing, developing, and maintaining robust data pipelines and platforms, with a primary focus on the Cloudera Hadoop ecosystem. You will collaborate closely with data analysts, scientists, and business stakeholders to guarantee data accessibility, integrity, and security.
Your Responsibilities:
- Design, build, and manage the Cloudera Hadoop Distribution (CDH/CDP) environment.
- Develop and maintain ETL pipelines using tools such as Apache NiFi, Hive, Spark, and Impala.
- Optimize and manage HDFS, YARN, Kafka, HBase, and Oozie workflows.
- Monitor and troubleshoot cluster performance, utilizing strong problem-solving and debugging skills.
- Work in collaboration with DevOps and Data Science teams to integrate data platforms into applications and analytics workflows.
- Ensure data governance, security, and compliance by leveraging tools such as Apache Ranger, Atlas, and Kerberos.
- Mentor and provide guidance to a team of data engineers to deliver comprehensive data solutions.

