companyCrunchyroll, LLC logo

Senior Reliability Engineer

Crunchyroll, LLCMexico City, Mexico City, Mexico
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

QualificationsBachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience. Minimum of 8 years of professional experience in reliability engineering or a similar field. Strong analytical and problem-solving skills. Experience with monitoring and alerting systems. Proficient in data analysis and visualization tools. Excellent communication skills, both written and verbal.

About the job

About Crunchyroll

Founded by anime enthusiasts, Crunchyroll is dedicated to delivering the rich art and culture of anime to a vibrant community. We cater to over 100 million dedicated anime and manga fans across more than 200 countries and territories, helping them connect with the captivating stories and beloved characters they cherish. Whether through online streaming, theatrical releases, gaming, merchandise, or live events, we are driven by our passion for anime content.

Join our dynamic team and help us shape the future of anime!

Crunchyroll, LLC operates as a joint venture between Sony Pictures Entertainment in the US and Aniplex, a subsidiary of Sony Music Entertainment in Japan, both entities under the umbrella of Sony Group Corporation based in Tokyo.

About the Role

We are assembling a new Partner Reliability Engineering (PRE) team dedicated to proactively ensuring the health and quality of our living room device ecosystem and integrated partner payment systems. Our mission is to identify and rectify issues before they impact our customers by leveraging data, automation, and close collaboration with internal teams and global partners.

This position is perfect for a multifaceted engineer who thrives in operational settings, enjoys diving into data, and is passionate about creating tools that enhance system observability and reliability. You will operate at the intersection of site reliability engineering, analytics, automation, and external partner support, ensuring our users enjoy a flawless experience every time they hit "play."

What You Will Be Doing

  • Lead incident response, on-call support, and mitigation for device and payment-related issues in production environments.
  • Develop and enhance monitoring, alerting, and triage tools to maximize detection and resolution times.
  • Analyze data to detect anomalies, establish performance baselines, and uncover patterns that enhance quality.
  • Create automations, internal tools, and efficient data pipelines to support operational workflows and observability.
  • Collaborate with engineering, product, and analytics teams to implement systemic fixes and learn from postmortems.
  • Engage directly with external partners (e.g., Smart TV and device manufacturers, ISPs, payment providers) to investigate and resolve ecosystem issues.
  • Effectively communicate with both technical and non-technical stakeholders.

About Crunchyroll, LLC

Crunchyroll is a pioneering platform that connects anime fans around the globe. With a robust portfolio that includes streaming, theatrical releases, and merchandise, we are committed to enhancing the anime experience for our community.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.