companyRealtor.com logo

Staff Site Reliability Engineer

Realtor.comAustin, Texas, United States
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

To excel in this role, you should have a robust background in Site Reliability Engineering, with extensive experience in AWS environments, particularly in EKS and Fargate. A proficiency in CI/CD practices and a deep understanding of API gateways and GraphQL is essential. You should also demonstrate leadership capabilities and a passion for mentoring others in the engineering field.

About the job

For over 25 years, Realtor.com® has stood as the premier online platform trusted by real estate professionals, seamlessly connecting buyers, sellers, and renters with invaluable insights and expert advice to discover their ideal home. Our comprehensive suite of tools not only transforms the real estate landscape, but also aids consumers in navigating one of life's most significant decisions—making it simple, intuitive, and empowering.

Join us in our mission to enable more individuals to find their way home by dismantling barriers, fostering meaningful connections, and instilling confidence with expert guidance.

About the Role

We are on the lookout for a Staff Site Reliability Engineer to become a vital member of our newly established Operations Excellence organization, reporting directly to the Director of Operations Excellence. This pivotal position will define the reliability, observability, and operational excellence of our platform infrastructure that serves millions of users. As a Staff SRE, you will take on a technical leadership role, mentoring others and establishing best practices, while influencing architectural decisions to empower our team of 600+ engineers in delivering outstanding customer experiences.

You will engage with crucial platform systems, including EKS infrastructure, Skyway (CI/CD), Frontdoor (Tyk API Gateway), Pantheon (Apollo GraphQL Federation), and our observability stack, all while implementing chaos engineering practices and spearheading cost optimization initiatives that yield measurable ROI.

We are committed to employing the best tools to expedite problem-solving. You will be expected to adeptly utilize AI coding assistants and LLMs to enhance development speed, generate boilerplate code, and troubleshoot intricate debugging scenarios. In addition to basic usage, this role demands the critical judgment to assess AI-generated outputs for security, performance, and accuracy. You should be comfortable incorporating AI tools into your daily tasks to minimize repetitive work, allowing you to concentrate on high-impact architectural and strategic engineering challenges.

What You'll Do

Platform Reliability & Infrastructure

  • Design and maintain highly available AWS infrastructure, including EKS clusters, Fargate (ECS), and multi-region architectures.
  • Take ownership of the reliability of essential services: Skyway (CI/CD), Frontdoor (Tyk), Pantheon (Apollo GraphQL), and associated infrastructure.
  • Establish SLIs, SLOs, and error budgets for Tier 1/2/3 systems; lead architectural reviews focused on reliability and cost-efficiency.
  • Drive...

About Realtor.com

Realtor.com® is the leading online destination for real estate professionals, offering an array of tools and resources that empower users to navigate the real estate market with confidence. Our commitment to innovation and user experience has made us a trusted partner for millions of buyers, sellers, and renters across the nation.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.