About the job
At Roblox, we pride ourselves on being a platform where tens of millions of users come together to explore, create, and connect in immersive 3D experiences crafted by our talented global community of developers and creators.
Our mission is to empower our community with the tools and platform to bring their imaginative ideas to life. We strive to redefine social interaction, enabling connections across the globe, regardless of device. We are on a journey to unite a billion people with positivity and respect, and we need exceptional talent like you to help us achieve this goal.
Joining Roblox means you will be at the forefront of transforming how people interact, tackling unique technical challenges at scale, and contributing to creating safer, more respectful shared experiences for all.
As a Lead Senior Data Center Engineer, you will play a pivotal role in scaling our Core and Edge Data Centers and enhancing our hardware infrastructure during a period of remarkable growth for our business. At Roblox, you will have unlimited potential to influence the future of our Imagination Platform™ while demonstrating your commitment to delivering innovative solutions to a global audience. If you possess the expertise to develop and manage hardware infrastructure capable of supporting millions of concurrent players year-round, and if you are as passionate about play as we are, you will seamlessly integrate into our highly skilled and expanding engineering team. You will report directly to the Technical Lead Data Center Engineer.
Your Responsibilities:
- Design and maintain the Core and Edge Data Center and hardware infrastructure to accommodate the extensive scale and real-time demands of our Imagination Platform™, ensuring our community enjoys an exceptional experience globally. This encompasses all facets of server, network infrastructure, power, and environmental life cycles.
- Lead initiatives to monitor and resolve systemic issues hindering hosts from returning to service.
- Diagnose and resolve critical issues while preventing recurrence through root cause analysis and providing recommendations for enhanced automation.
- Collaborate with colleagues to establish and maintain best practices in break-fix, installation, decommissioning, and all aspects of data center operations.
- Develop, influence, and refine the operational processes to ensure optimal performance and resilience of our infrastructure.

