About the job
The Role
Join Pave Bank, where we are pioneering the future of programmable banking by merging traditional banking services with digital assets on a single, regulated platform. We are seeking a dynamic Site Reliability Engineer (SRE) to play a critical role in ensuring our core systems are consistently available, scalable, and high-performing as we expand.
As a Site Reliability Engineer at Pave Bank, you will collaborate closely with our Engineering, Product, Security, and Operations teams to develop robust infrastructure, automate operational tasks, and uphold reliability across all services. Your contributions will significantly influence the safety, performance, and scalability of our banking platform, enabling customers to place their trust in Pave Bank for their financial needs.
Key Responsibilities
Oversee, maintain, and enhance the reliability, availability, and performance of our production systems and services.
Design and sustain infrastructure as code (IaC), deployment pipelines, and automation processes to facilitate continuous delivery, scalability, and disaster recovery.
Address incidents, conduct root-cause analyses, and lead postmortems to ensure that lessons learned are effectively implemented.
Establish and uphold operational best practices including observability, logging, metrics, alerting, capacity planning, failover strategies, and backups.
Collaborate with Engineering, Product, Compliance, and Operations teams to ensure that our infrastructure aligns with reliability, compliance, and security standards.
Assist in service scaling, database operations, cloud infrastructure (preferably GCP), networking, and microservices orchestration.
Document operational runbooks, on-call procedures, and system architecture to support maintenance, knowledge sharing, and compliance.
Qualifications
Technical Skills and Experience
Proficient in programming or scripting languages such as Go, Python, Bash, or similar for automation and tooling.
Hands-on experience with cloud infrastructure, preferably Google Cloud Platform (GCP).
Familiar with containerization and orchestration technologies (Docker, Kubernetes, etc.).
Experience with infrastructure-as-code tools (Terraform, Cloud Deployment Manager, etc.).

