About the job
Join Thumbtack and Empower Homeowners!
At Thumbtack, we are dedicated to helping millions of individuals confidently manage and improve their homes. Our comprehensive app serves as the ultimate resource for personalized guidance, AI tools, and an unparalleled hiring experience. Every day, across every county in the U.S., homeowners rely on Thumbtack for urgent repairs, seasonal maintenance, and significant renovations. We empower homeowners to determine which projects to undertake, when to tackle them, and how to connect with our growing network of 300,000 local service businesses. If you are motivated by the prospect of making a meaningful impact, we invite you to join us and envision what we can achieve together.
About the Data Engineering Team
Our Data Engineering team at Thumbtack is a pivotal, centralized group that collaborates closely with engineers, analysts, data scientists, and machine learning engineers to design and curate data sets from both internal and external sources. We are committed to meeting current and future data needs and will continue building a cohesive data warehouse while integrating data best practices across the entire software development lifecycle (SDLC).
The Challenge
With terabytes of data and unique challenges present across various teams at Thumbtack, this role is essential for cleaning, organizing, and measuring performance metrics. You will collaborate with engineers, data scientists, managers, and others to understand their data needs and create datasets that address these challenges effectively.
What You'll Do
- Collaboratively refine and promote a comprehensive framework for integrating data-driven thinking into the software development lifecycle for product teams.
- Design, architect, and maintain essential marketplace datasets, data marts, and feature stores that support a combination of established products and rapidly evolving features, in partnership with analytics, data science, and machine learning teams.
- Work alongside product engineers, analysts, data scientists, and machine learning engineers throughout Thumbtack to grasp their data requirements and assist in designing datasets with the same level of engineering rigor as other software projects.
- Champion data quality and best practices across diverse business areas.
- Contribute to the development of the next generation of data products at Thumbtack, utilizing real-time data solutions built on Apache Kafka.

