About the job
About Carvana
At Carvana, we're revolutionizing the automotive retail experience. With a bold vision and an innovative approach, we aim to make buying and selling cars enjoyable, swift, and equitable. As the fastest-growing automotive retailer in history, we have expanded our operations nationally, gone public on the New York Stock Exchange, celebrated the sale of our 1 millionth vehicle, and entered the Fortune 500 list, all in just eight years.
Currently serving over 4 million retail customers, Carvana stands as the fastest-growing and most profitable public automotive retailer, with even greater ambitions ahead in the largest consumer sector.
Joining our team means being part of a dynamic environment that welcomes change, encourages creative problem-solving, and continually strives for improvement. At Carvana, you’ll face meaningful challenges, gain rapid learning experiences, and contribute to shaping the future of automotive retail. If you’re eager to grow and make a difference within a collaborative team, you will thrive here. Check out what it's like to be part of our team from the people who live it everyday.
This is a 100% on-site position (Thursday through Monday)
About the Team and Position
We pride ourselves on being approachable and always willing to go the extra mile for our Carvana family. Whether it’s connecting a monitor or fine-tuning a flux capacitor to exactly 1.21 gigawatts, we expect intelligent, proactive individuals who bring innovative ideas and can handle multiple assignments. In return for your commitment, you’ll have the chance to work at one of the most rapidly growing and creatively driven tech companies, while contributing to the promotion of a life-changing product and the development of a world-class team every day.
What You’ll Be Doing
- Monitor and analyze existing systems and services
- Execute runbook functions to resolve alerts within defined SLAs
- Escalate alerts to designated on-call teams when NOC is unable to resolve
- Maintain services post-launch by responding swiftly to incidents
- Measure and monitor availability, latency, and overall system health
- Identify repetitive tasks for automation to support ongoing work intake
- Develop proficiency in utilizing monitoring and automation tools

