Senior Site Reliability Engineer at Intuition Machines | Remote
Intuition Machines, Inc.
Full-time|Remote|Remote — São Paulo, State of São Paulo, Brazil At Intuition Machines, we harness the power of AI and machine learning to develop cutting-edge enterprise security solutions. Our innovative approach is evident in our flagship product, the hCaptcha security suite, which serves hundreds of millions of users globally. With a talented, geographically diverse team, we prioritize low overhead, small teams, and rapid iteration to deliver impactful results.As a Senior Site Reliability Engineer, you will be instrumental in engineering robust solutions that enhance system performance, availability, security, and cost-effectiveness. These non-functional attributes are not just goals; they are essential to our mission and our customers' satisfaction. You will engage with various layers of our extensive internet-scale architecture, including infrastructure, data, and application logic, and lead the development of effective solutions.Your Responsibilities:Engage with large-scale systems that handle millions of requests per second, providing seamless service to millions of users across diverse cloud platforms.Innovate solutions aimed at optimizing performance, availability, security, and cost-efficiency.Ensure high uptime and speed, while enhancing the productivity of our development teams through continuous improvement of system performance, quality, security, and customer engagement metrics.Rapidly source and assess improvement opportunities based on customer feedback, internal insights, and system metrics.Foster a creative environment where your contributions directly enhance customer value and experience.Qualifications:Proficiency in Kubernetes, with a strong focus on managing and optimizing containerized applications.Extensive experience in monitoring applications, infrastructure, and network environments.Solid background in software engineering with a focus on backend development in Kubernetes-based systems.Strong programming skills in languages such as Python, JavaScript, Go, C++, or Rust.Comprehensive understanding of networking concepts, proxies, and content delivery networks (e.g., Cloudflare).Experience with multi-cloud environments, including virtual networking, load balancing, and web application firewalls.Strong familiarity with CI/CD methodologies.Hands-on experience in developing and orchestrating high-scale, high-availability systems.A minimum of six years of practical experience in engineering, DevOps, or Site Reliability Engineering roles.Knowledge of distributed systems, including queue-first architectures and sharding principles.Demonstrated engineering acumen, encompassing requirement gathering, problem-solving, and effective decision-making.Preferred: Knowledge of security frameworks, attack vectors, botnets, and impact analysis. What We Offer:A fully remote position with flexible working hours.Collaboration with an inspiring, global team.Modern development workflows that promote frequent shipping of code.High impact: engage in significant projects that shape our products and services.
Nov 8, 2024