About the job
About ClickHouse
Featured in the prestigious 2025 Forbes Cloud 100 list, ClickHouse stands out as a pioneering and rapidly expanding private cloud company. With a robust customer base exceeding 3,000 and an impressive Annual Recurring Revenue (ARR) growth of over 250% year-on-year, ClickHouse is at the forefront of real-time analytics, data warehousing, observability, and AI workloads.
The company’s remarkable progression was recently affirmed through a $400M Series D funding round. In just the past three months, notable clients like Capital One, Lovable, Decagon, Polymarket, and Airwallex have either embraced our platform or enhanced their existing deployments. These organizations join a distinguished roster of AI trailblazers and global leaders, including Meta, Cursor, Sony, and Tesla.
Join us as we revolutionize the way businesses harness the power of data!
Note: This position is available for remote work in any country where ClickHouse has a hiring presence.
About the Team
The ClickPipes - Database Integrations team creates the platform that facilitates real-time data replication from databases to ClickHouse at a petabyte scale.
As part of this dynamic team, you will tackle intricate database-related challenges and distributed systems issues, including optimizing snapshot strategies by understanding database internals, managing schema evolution during live replication, ensuring data type compatibility across various systems, maintaining low end-to-end latency under unpredictable loads, and utilizing durable execution frameworks to guarantee data consistency over unreliable networks. Our work is transparent , our database integrations are developed on PeerDB, an open-source Change Data Capture (CDC) platform that we actively support and enhance.
Explore some of our recent achievements:
- ClickPipes for Postgres now supports failover replication slots
- MongoDB CDC to ClickHouse with Native JSON Support
- Under the Hood: Building MySQL Change Data Capture in ClickPipes
Your Responsibilities:
Develop data-centric systems
- Design and implement high-throughput integrations with various databases (Postgres, MySQL, MongoDB), data lakes (Iceberg, Delta Lake), and data warehouses (BigQuery, Snowflake).
