About the job
Summary
At Storable, we are dedicated to revolutionizing the storage industry. Our advanced platform empowers businesses to efficiently manage, track, and expand their self-storage operations. We are in search of a talented Data Engineering Manager to lead our data-centric team. Storable is committed to harnessing the latest technologies to enhance data efficiency, accessibility, and insights, enabling our team to make informed decisions that drive growth.
As the Data Engineering Manager, you will be instrumental in directing and refining our data operations, ensuring that data is well-organized, accessible, and effectively managed throughout the organization. You will lead a skilled team, collaborate with cross-functional departments, and spearhead strategies to improve data quality, availability, and security.
Key Responsibilities:
- Lead Data Management Strategy: Develop and implement the data management vision, strategy, and best practices, ensuring they align with Storable's business objectives.
- Oversee Data Pipelines: Design, implement, and maintain scalable data pipelines with industry-standard tools to efficiently process and manage large-scale datasets.
- Ensure Data Quality & Governance: Establish data governance policies and frameworks to maintain data accuracy, consistency, and compliance across the organization.
- ETL Development: Construct, optimize, and manage ETL pipelines for ingesting, transforming, and delivering extensive datasets from diverse sources.
- Workflow Orchestration: Manage and schedule complex workflows utilizing Apache Airflow.
- Query Engines & Processing Frameworks: Utilize Trino (Presto), Apache Spark, and other tools.
- Manage Cross-Functional Collaboration: Work alongside engineering, product, and business teams to ensure data is accessible and actionable, driving informed decision-making.
- Optimize Data Infrastructure: Employ modern data tools and platforms (e.g., AWS, Apache Airflow, Apache Iceberg) to establish an efficient, reliable, and scalable data infrastructure.
- Monitor & Improve Performance: Actively oversee data processes and workflows, troubleshoot issues, and refine performance to ensure high reliability.

