Qualifications
Key Responsibilities:Design and implement scalable data pipelines that manage unstructured data formats using Snowflake and GCP. Utilize Snowflake’s unstructured data capabilities (Directory Tables, Scoped URLs, Snowpark) to make 'dark data' accessible and actionable. Develop and oversee cloud-native ETL/ELT processes with BigQuery, Cloud Storage, and Dataflow, ensuring smooth integration between GCP and Snowflake. Incorporate AI tools (OCR, NLP entities, Document AI) into the engineering workflow to transform unstructured data into structured insights. Optimize complex SQL queries and Python-based processing jobs for efficient handling of petabyte-scale environments.
About the job
Join Tiger Analytics, a dynamic and rapidly expanding advanced analytics consulting firm recognized for its expertise in Data Science, Machine Learning, and Artificial Intelligence. We partner with leading Fortune 100 companies to unlock the potential of their data, driving substantial business value. Acknowledged by Forrester and Gartner for our leadership in the analytics space, we are on the lookout for exceptional talent to help build the world’s premier analytics consulting team.
We are currently seeking a skilled Data Engineer proficient in Dataiku to join our innovative data team. In this role, you will design, construct, and maintain data pipelines, integration processes, and infrastructure. Working closely with data scientists, analysts, and other key stakeholders, you will facilitate efficient data flow and empower data-driven decision-making throughout the organization.
About Tiger Analytics
Tiger Analytics is a leader in advanced analytics consulting, providing deep expertise in Data Science, Machine Learning, and AI. We are committed to helping organizations harness the power of their data to drive strategic business outcomes.