About the job
Join our dynamic team as an AWS DevOps Engineer where you'll tackle intricate, real-world infrastructure challenges at scale.
At bystadium, we are dedicated to creating innovative products that foster connection and delight for organizations across the globe. Our platform seamlessly integrates creativity, data, and technology to enhance every interaction, offering solutions from rewards and recognition to personalized gifting and brand experiences. We expertly manage fluctuating traffic demands, processing hundreds to hundreds of thousands of requests per minute, making this role essential for ensuring our systems remain fast, reliable, and secure.
In this position, you will have complete ownership of our cloud infrastructure, overseeing everything from architecture and automation to deployment, monitoring, security, performance, scalability, and cost efficiency. Collaboration with software engineers is key as you define infrastructure standards and deployment practices. We expect you to navigate ambiguous challenges, question assumptions, and craft practical, production-ready solutions.
Your Responsibilities
You will be tasked with:
- Designing, constructing, and managing secure, scalable, and highly available AWS infrastructure.
- Architecting systems capable of accommodating sudden and significant traffic increases.
- Taking ownership of architectural decisions, trade-offs, and outcomes in production settings.
- Ensuring fault tolerance, high availability, and disaster recovery.
- Constructing and maintaining infrastructure via Infrastructure as Code (Terraform).
- Automating provisioning, scaling, recovery, and operational workflows.
- Designing and upholding robust CI/CD pipelines for safe, repeatable, and zero-downtime deployments.
- Collaborating with engineering teams to enhance infrastructure and deployment standards.
- Implementing monitoring, alerting, and observability for proactive issue detection.
- Leading incident response, root cause analysis, and long-term system enhancements.
- Managing cloud security posture and enforcing best practices.
- Monitoring and optimizing cloud costs while ensuring performance and scalability.

