companyElliptic logo

Senior Data Reliability Engineer at Elliptic | London

EllipticLondon, United Kingdom
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Candidates should possess a strong background in site reliability engineering and data reliability, with experience in developing frameworks for data quality and observability. Familiarity with incident command processes and monitoring tools is essential.

About the job

What You'll Accomplish:

As a Senior Data Reliability Engineer, you will spearhead the integration of Site Reliability Engineering (SRE) across all engineering practices. Your leadership will ensure that every engineer and team is dedicated to crafting software that is not only resilient but also exceptionally reliable. You will collaborate with a diverse, cross-functional team of subject matter experts and on-call engineers, focused on maintaining high performance of our platform around the clock.

Overseeing a comprehensive suite of products, you will be responsible for the reliability of enterprise-grade applications that process thousands of queries per second. Elliptic is acclaimed for its extensive and dependable datasets, and your role will be pivotal in establishing a market-leading infrastructure for data quality and governance. This involves creating the processes, culture, and frameworks that will enhance observability, data quality, lineage, and remediation, forming a crucial backbone of our data and intelligence platform.

Your Responsibilities:

This role spans multiple teams, and you will receive full support from leadership and engineering while showcasing exemplary standards. Your main tasks will include:

  • Promote the principles of SRE and DRE throughout the engineering teams.

  • Lead the development of a data quality framework that assures our clients of the accuracy of our data and supports marketing and revenue initiatives.

  • Define and manage the on-call process within the SRE function:

    • Quickly gain an in-depth understanding of our systems.

    • Lead incident management.

    • Conduct post-incident reviews.

    • Ensure timely completion of follow-up actions.

    • Assess and enhance our existing end-to-end on-call processes.

  • Participate in the on-call rotation, approximately every 4 to 5 weeks, ensuring 24/7 coverage.

  • Evaluate, manage, and improve our current monitoring, alerting, paging, and documentation solutions.

  • Provide reports on system uptime, availability, and performance across our product range.

  • Draft post-mortem reports for both internal and external stakeholders.

  • Represent the SRE and DRE functions during discussions with top-tier enterprise financial institutions.

About Elliptic

Elliptic is a leader in cryptocurrency intelligence, providing insights and analytics to help businesses and financial institutions navigate the complexities of the crypto landscape. Committed to quality and reliability, we offer a robust platform empowered by vast datasets.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.