About the job
ABOUT VETERINARY EMERGENCY GROUP
Founded in 2014, the Veterinary Emergency Group (VEG) is dedicated to transforming the emergency care experience for pets and their owners. With a vision to redefine norms and improve the ER experience, we have rapidly expanded our network of hospitals that operate 24/7/365 across the nation. Our commitment to understanding the needs of pets and their families drives our continuous innovation. We prioritize not only the wellbeing of our patients but also of our team members (VEGgies), empowering them to achieve greatness and fostering a culture of growth and belonging.
At VEG, we are reimagining emergency care in every aspect—from hospital operations to the support systems for our teams. Our headquarters team is pivotal in this transformation, whether it's through developing innovative technology to enhance hospital efficiency, recruiting exceptional talent, or effectively showcasing our brand through marketing. Our headquarters team ensures that our hospitals are equipped with the necessary resources to deliver outstanding care to pets and their families.
VEG has been recognized as a Great Place to Work® for 2025 and 2026.
THE ROLE
We are seeking a Senior Site Reliability Engineer who recognizes the critical importance of reliability at VEG; our proprietary platform, DogByte, is essential to the survival of pets. As the primary architect of our platform's resilience, you will engineer our infrastructure to be self-healing, enabling our medical teams to provide life-saving care around the clock. Your role will be a blend of high-level architectural strategy and hands-on technical execution, ensuring our engineering teams can rapidly develop while maintaining a solid foundation.
Your efforts will focus on evolving and enhancing existing systems to support VEG’s hospital expansion, ensuring that our infrastructure is never a limiting factor in our ability to open new hospitals or deliver medical care. You will take ownership of DogByte's ongoing stability, scaling it into a robust enterprise platform where individual hospital traffic is isolated to prevent impact on others.
This position offers the flexibility to work at our headquarters in White Plains or remotely.
KEY RESPONSIBILITIES
- Develop short- and long-term strategies to ensure DogByte can handle increasing volume year-over-year, particularly addressing traffic isolation between hospitals.
- Collaborate with engineering teams to ensure that data flows—from client to API to database—are optimized for high availability and performance.

