About the job
Company Background
Censys is dedicated to building the most comprehensive and reliable map of the Internet. Our mission is to empower users with real-time Internet intelligence and actionable threat insights, catering to global governments, over 50% of the Fortune 500, and leading threat intelligence providers worldwide.
Location
This is a fully remote position within the United States.
Role Summary
As a Senior Site Reliability Engineer (SRE) on the Infrastructure and Operations team, you will play a crucial role in designing, building, and deploying tools that enhance the efficiency of our development teams and production applications. We are seeking skilled engineers who are passionate about cloud-native technologies and committed to improving our microservice architecture's reliability and operational maturity.
Focusing on Developer Efficiency and Experience, you will help streamline engineering workflows, support our Software Development Life Cycle (SDLC), and empower developers to confidently build, deploy, and manage their services within the platform.
What You'll Do
- Develop and maintain tools to support applications running on Kubernetes and Google Cloud Platform.
- Collaborate with development teams to facilitate the building, shipping, and deploying of services and applications, ensuring resilience and reliability.
- Monitor and ensure the smooth operation of our production environments, assisting developers in debugging complex issues and capturing the four golden signals of performance.
- Contribute to the creation of a self-service platform that accelerates developer velocity, including service catalogs, repository tooling, and comprehensive documentation.
- Participate in a shared on-call rotation, embracing end-to-end service ownership alongside development teams.
