Join the forefront of online privacy with the world’s most advanced VPN.Are you a proactive problem-solver who thrives in dynamic environments? Become a vital member of the team that developed Threat Protection Pro, the NordLynx protocol, and the fastest VPN worldwide—innovations that empower individuals with privacy, security, and control over their digital lives.Your Contribution: Enable millions to regain their online security, privacy, and data management.At NordVPN, we safeguard millions of users every day through an expansive global edge infrastructure comprising thousands of servers across numerous countries. The platform engineering team is responsible for building and maintaining the internal backend services that facilitate this protection.We are seeking a Staff Site Reliability Engineer (SRE) to design, build, and enhance these critical systems. This role demands a high level of ownership—you will architect solutions and deploy them to production. Colleagues will depend on you when it comes to rethinking architecture or scaling services from inception to global deployment.Key ResponsibilitiesDesign and manage on-demand, globally distributed backend services.Make strategic architectural decisions regarding the integration and scalability of internal services.Oversee the entire lifecycle: planning, implementation, monitoring, incident response, and postmortems.Enhance infrastructure tooling and automation processes.Contribute to our engineering standards, documentation, and operational maturity.Assess and incorporate AI tools (including LLMs, Claude Code, and model integrations) into engineering workflows.Essential QualificationsExperience in designing and operating globally distributed systems.Proficiency in systems architecture, service communication, data flow, and resilience patterns.Extensive Linux administration experience at scale (including systemd, kernel tuning, and debugging production systems).Expertise in Docker—building, shipping, and running containers in production environments.Familiarity with databases such as PostgreSQL, MySQL, Redis, OpenSearch, and VictoriaMetrics.Experience with web servers, load balancing, and failover mechanisms (e.g., Nginx, HAProxy).
Apr 12, 2026