About the job
About Crunchyroll
Founded by anime enthusiasts, Crunchyroll is dedicated to delivering the rich art and culture of anime to a vibrant community. We cater to over 100 million dedicated anime and manga fans across more than 200 countries and territories, helping them connect with the captivating stories and beloved characters they cherish. Whether through online streaming, theatrical releases, gaming, merchandise, or live events, we are driven by our passion for anime content.
Join our dynamic team and help us shape the future of anime!
Crunchyroll, LLC operates as a joint venture between Sony Pictures Entertainment in the US and Aniplex, a subsidiary of Sony Music Entertainment in Japan, both entities under the umbrella of Sony Group Corporation based in Tokyo.
About the Role
We are assembling a new Partner Reliability Engineering (PRE) team dedicated to proactively ensuring the health and quality of our living room device ecosystem and integrated partner payment systems. Our mission is to identify and rectify issues before they impact our customers by leveraging data, automation, and close collaboration with internal teams and global partners.
This position is perfect for a multifaceted engineer who thrives in operational settings, enjoys diving into data, and is passionate about creating tools that enhance system observability and reliability. You will operate at the intersection of site reliability engineering, analytics, automation, and external partner support, ensuring our users enjoy a flawless experience every time they hit "play."
What You Will Be Doing
- Lead incident response, on-call support, and mitigation for device and payment-related issues in production environments.
- Develop and enhance monitoring, alerting, and triage tools to maximize detection and resolution times.
- Analyze data to detect anomalies, establish performance baselines, and uncover patterns that enhance quality.
- Create automations, internal tools, and efficient data pipelines to support operational workflows and observability.
- Collaborate with engineering, product, and analytics teams to implement systemic fixes and learn from postmortems.
- Engage directly with external partners (e.g., Smart TV and device manufacturers, ISPs, payment providers) to investigate and resolve ecosystem issues.
- Effectively communicate with both technical and non-technical stakeholders.

