Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
Proven experience in Linux system administration and network troubleshooting. Strong background in automation and scripting (e.g., Python, Bash). Experience with cloud infrastructure and services. Ability to work collaboratively in a fast-paced environment. Excellent problem-solving skills and attention to detail.
About the job
As a Staff Site Reliability Engineer at Zscaler, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based security services. You will engage in troubleshooting complex Linux and network issues while implementing automation solutions to enhance operational efficiency.
Your expertise will contribute to our mission of delivering unparalleled security solutions to our clients.
About Zscaler
Zscaler is a leading cloud security company that enables organizations to securely transform their networks and applications. Our innovative solutions are trusted by thousands of customers globally. Join us to be part of a pioneering team that is redefining the future of security.
Sumo Logic seeks a Staff Site Reliability Engineer based in Bangalore, Karnataka, India. The main focus of this position is to maintain and enhance the reliability and performance of company systems. Collaboration with development teams is central, especially when resolving operational issues and building solutions that keep systems stable. Key Responsibilities Partner with engineers to boost system reliability and maximize uptime. Create and improve monitoring and automation tools to support operational goals. Diagnose and resolve operational challenges as they occur. Contribute to optimizing performance throughout the infrastructure.
As a Staff Site Reliability Engineer at Zscaler, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based security services. You will engage in troubleshooting complex Linux and network issues while implementing automation solutions to enhance operational efficiency.Your expertise will contribute to our mission of delivering unparalleled security solutions to our clients.
Role Overview Black Duck Software is looking for a Senior Site Reliability Engineer in Bangalore. This role focuses on maintaining the reliability, availability, and performance of our systems. Collaboration with development teams is central to the work, with an emphasis on building and supporting scalable infrastructure. What You Will Do Work with developers to design, implement, and maintain scalable systems. Troubleshoot production issues and identify long-term solutions. Strengthen the resilience of our platform through process and technical improvements. Promote a culture of continuous improvement across teams.
Roles and ResponsibilitiesGuarantee the reliability, availability, and optimal performance of our systems and services.Automate and optimize operations and processes for greater efficiency.Continuously monitor system health, identify bottlenecks, and proactively resolve potential issues.Collaborate with development teams to enhance system architecture and performance.Conduct thorough post-incident reviews and implement necessary improvements.Develop and maintain infrastructure as code using industry-standard tools like Terraform and Ansible.
About the Role:Production EngineerThe Production Engineer at Rubrik is essential for achieving operational excellence. This position involves managing alerts, addressing outages, and leading incident resolution as an Incident Manager. The ideal candidate will possess hands-on experience in maintaining highly available critical services across multi-cloud environments while continuously enhancing processes through automation and intelligent monitoring.What You’ll Do:Become a vital part of a 24/7 Production Operations team dedicated to managing and supporting critical infrastructure and services in multi-cloud environments.Supervise staging and production environments to ensure optimal uptime and reliability.Implement and uphold comprehensive observability solutions for real-time monitoring, alerting, and metrics collection.Lead incident management initiatives by promptly responding to alerts and outages, coordinating teams for timely resolutions.Investigate recurring incidents to identify root causes, minimize toil, and enhance system resilience.Design and develop automation tools to proactively detect, triage, and remediate production issues.Maintain and update runbooks to facilitate incident response and address recurring issues.Exhibit strong decision-making skills under pressure, effectively managing critical situations with urgency and composure.
Veeam is a leading provider of data and AI solutions, dedicated to helping organizations protect and manage their data effectively. Recognized as a pioneer in data resilience and security posture management, we empower businesses to navigate the complexities of identity, data, security, and AI risk. With our headquarters in Seattle and operations in over 30 countries, Veeam proudly safeguards the operations of more than 550,000 customers globally. Join our dynamic team and be part of a transformative journey as we advance together, fostering growth, learning, and making a significant impact for renowned brands around the world.About the RoleAs a Staff Site Reliability Engineer, you will take on a pivotal role as a hands-on technical leader within our Site Reliability Engineering (SRE) team. Your expertise will guide senior engineers, influence product development efforts, and ensure our systems are constructed to be reliable, scalable, and observable from the ground up.You will spearhead strategic initiatives, mentor peers in SRE practices, and help define architectural best practices across our platform. This role is crucial for aligning teams, enforcing high standards, and scaling SRE principles globally at Veeam.What You’ll DoReliability Engineering & Resilience:Serve as a technical authority, mentoring senior engineers and guiding design decisions to enhance service reliability and resilience.Lead the establishment and enforcement of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets; ensure adherence across engineering teams.Collaborate with fellow staff members across teams to unify strategy and promote shared reliability standards and objectives.Engage with development and product teams to proactively design for failure, construct resilient architectures, and operationalize reliability from inception.Observability & Operational Excellence:Promote the organization-wide adoption of observability best practices and tools.Ensure that metrics, logs, and traces yield deep, actionable insights throughout systems.Lead complex incident responses, conduct postmortems, and drive systemic reliability enhancements.Encourage and uphold a blameless culture of learning and continuous improvement.
About the Role:Production EngineerThe Production Engineer at Rubrik is pivotal in ensuring operational excellence, managing alerts, addressing outages, and spearheading incident resolution as an Incident Manager. This position demands hands-on expertise in maintaining highly available critical services across multi-cloud environments while fostering continuous improvements through automation and intelligent monitoring.What You Will Do:Become a key member of a 24/7 Production Operations team dedicated to managing and supporting vital infrastructure and services across multi-cloud environments.Supervise staging and production environments to guarantee maximum uptime and reliability.Deploy and maintain comprehensive observability solutions for real-time monitoring, alerting, and metrics collection.Lead incident management initiatives by promptly responding to alerts and outages, coordinating teams for swift resolution.Investigate recurring incidents to identify root causes, mitigate toil, and enhance system resilience.Design and develop automation tools to proactively detect, triage, and rectify production issues.Update and maintain runbooks to facilitate incident response and address recurring issues.Exhibit strong decision-making abilities under pressure, managing critical situations with urgency and composure.
Join the UiPath TeamThe team at UiPath is passionate about harnessing the transformative potential of automation to redefine the way the world operates. We are dedicated to developing industry-leading enterprise software that empowers organizations.To realize this vision, we seek individuals who are inquisitive, motivated, generous, and authentic. We value those who thrive in a dynamic, fast-paced environment and who genuinely care—about their colleagues, the mission of UiPath, and the broader impact of our work.Are you ready to make a difference?Your RoleAs a Principal Site Reliability Engineer at UiPath, you will play a pivotal role in enhancing the reliability of our expansive, cloud-native systems. This position requires a comprehensive understanding of the full reliability spectrum, going beyond any single domain. You will define and drive the architecture, scalability, measurement, and automation of reliability across our systems.This role focuses on shaping the reliability practices at UiPath rather than merely reacting to outages or coding. You will collaborate with engineering and platform teams to integrate reliability into our systems, workflows, and organizational culture. Your contributions will elevate our standards for monitoring, automation, and ensuring our systems can withstand real-world loads and failures.You will take ownership of service reliability, observability, automation, and continuous improvement initiatives, partnering with teams in Romania and India as necessary.Your Responsibilities at UiPathComprehensive Reliability Ownership: Develop and refine the reliability strategy for our distributed systems, ensuring a balance of availability, performance, velocity, and cost through well-defined SLIs/SLOs and error budgets.Incident Management & Operational Excellence: Lead and actively participate in high-severity incidents, driving structured troubleshooting in uncertain situations and ensuring sustainable systemic enhancements.Observability & Operational Insights: Advocate for robust observability practices to make service health and performance risks visible and actionable.Automation, Tooling & Engineering Discipline: Automate manual operational tasks through effective tooling and self-service options while applying disciplined engineering methodologies.Infrastructure, Cloud & IaC: Champion reliable and scalable cloud infrastructure utilizing Infrastructure as Code, collaborating with platform teams to establish best practices.Technical Leadership & Organizational Impact: Influence strategic decisions to improve reliability outcomes and mentor team members to foster a culture of excellence.
Please note that we will only accept candidates who possess the appropriate rights and documentation for employment in India.About Us:Axi is a premier global provider specializing in margin and deliverable Foreign Exchange, Contracts for Difference (CFDs), and Financial Spread Betting. Our evolution into a world-class, multifaceted brokerage is marked by a presence across six regions and significant investments in cutting-edge trading technology, designed to deliver the most comprehensive trading experience for clients ranging from novices to institutional investors.Your Role:As a Site Reliability Engineer, you will be pivotal in ensuring the availability, reliability, and operational excellence of Axi's technology infrastructure. You will design, implement, and maintain sophisticated monitoring, alerting, and log management solutions. Collaborating closely with Technology teams throughout the Development and Operations phases, your goal is to proactively identify and address any business-impacting incidents before they are reported by affected users, ensuring thorough observability and analysis through effective log management.Your Responsibilities:Act as the Product Owner for Monitoring and Observability within Axi's Technology Operations Environment.Evaluate the current environment and propose a roadmap for optimizing product offerings while managing the lifecycle of existing products.Support technology delivery teams through all product delivery phases by gathering requirements, producing detailed designs, conducting PoCs, and architecting solutions.Tweak and refine health rules while maintaining existing monitoring solutions.Minimize toil by documenting and automating repeatable processes.Communicate ideas and designs effectively to both technical and non-technical stakeholders.Consistently document processes and maintain an up-to-date knowledge base of your product expertise.
At Emergent Labs Inc., we are pioneering the future of software development by creating autonomous coding agents that revolutionize traditional programming methods. Our innovative systems can generate, test, and deploy production applications directly from plain-language commands, allowing for a seamless development experience.Since our public launch, we have achieved remarkable milestones, reaching $100 million in Annual Recurring Revenue (ARR) within just 8 months. Our platform has empowered over 6 million users across more than 190 countries to build over 6.5 million applications. With the backing of renowned investors like Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator, we have raised over $100 million to further our mission.We are committed to tackling the complexities of AI-driven software creation, ensuring correctness, reliability, security, and scalability in production environments. Our team consists of seasoned professionals, including repeat founders, Olympiad medalists, and alumni from IIT and IIM, as well as leaders from tech giants like Google, Amazon, and Dropbox.If you are a builder eager to have ownership, work at speed, and make a global impact, we want you on our team!
Veeam is recognized as the premier Data and AI Trust Company, dedicated to assisting organizations in comprehending, securing, and fortifying their data and AI systems. As the leading entity in data resilience and security posture management, Veeam is designed to address the convergence of identity, data, security, and AI risk. Our headquarters are in Seattle, and we operate in over 30 countries, safeguarding the data of more than 550,000 customers globally who rely on Veeam to maintain business continuity. Join us as we advance together, fostering growth, learning, and making a significant impact for some of the world’s most renowned brands.We are seeking a Senior Software Engineer - Reliability to take on a pivotal role as a hands-on technical leader within our Site Reliability Engineering (SRE) team. In this position, you will mentor senior engineers, influence product development, and ensure that our operational systems are designed for reliability, scalability, and observability from the ground up.Your responsibilities will include driving strategic initiatives, mentoring others in SRE practices, and defining architectural best practices across our platform. This role is crucial for aligning teams, maintaining high standards, and scaling SRE principles globally within Veeam.Your tasks will include:Reliability Engineering & ResilienceDesign and enhance infrastructure to ensure high availability, fault tolerance, and scalability across public clouds, starting with Azure and planning expansion to other providers.Establish and uphold Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets to define and enforce reliability goals.Lead incident response initiatives, conduct thorough analysis, facilitate blameless postmortems, and host sharing sessions to maximize learning throughout our engineering team, driving improvements across the socio-technical engineering ecosystem.Observability & Operational ExcellencePromote deep observability practices, ensuring telemetry, logs, and metrics are effectively utilized to enhance our operational insights.
InMobi Advertising is a prominent global technology entity empowering marketers to seize pivotal moments. Our advertising platform connects with over 2 billion users across more than 150 countries, translating real-time context into impactful business results, all while upholding privacy-first principles. With the trust of over 30,000 brands and leading publishers, InMobi serves as the nexus where intelligence, creativity, and accountability unite. By integrating lock screens, applications, televisions, and the open web with artificial intelligence and machine learning, we provide attentive engagement, precise personalization, and measurable outcomes.Through Glance AI, we are pioneering AI Commerce, re-envisioning the future of e-commerce through inspiration-driven discovery and shopping experiences. Seamlessly integrated into everyday technology, Glance AI transforms every screen into a portal for immediate, personalized, and delightful discoveries. Covering a wide array of categories such as fashion, beauty, travel, accessories, home décor, pets, and more, Glance AI offers highly tailored shopping experiences. Leveraging rich first-party data and unmatched consumer insights, it utilizes InMobi’s global scale and targeting capabilities to craft high-impact, performance-oriented shopping journeys for brands around the globe.Recognized as a Great Place to Work and lauded by MIT Technology Review, Fast Company’s Top 10 Innovators, and others, InMobi is a workplace where innovative ideas lead to significant global impact. Supported by investors such as SoftBank, Kleiner Perkins, and Sherpalo Ventures, InMobi operates offices in San Mateo, New York, London, Singapore, Tokyo, Seoul, Jakarta, Bengaluru, and more.At InMobi Advertising, you will have the chance to influence how billions of users interact with content, commerce, and brands worldwide. For more information, visit www.inmobi.com
Join our dynamic team of innovators at New Relic, where we are committed to redefining the future of observability. Our platform empowers organizations to excel in an AI-driven landscape by providing deep insights into their complex systems. As we broaden our global presence, we are seeking dedicated individuals passionate about optimizing digital applications for top-tier companies. Embark on your career journey with us!Your OpportunityAs a Senior Software Engineer in the Container Fabric (CF) organization, you will play a vital role in enhancing New Relic's global internal platform. We are searching for an operations-focused engineer with 5-7 years of experience to connect high-performance Go development with expansive Kubernetes orchestration. You will assume a leadership role, guiding critical projects and mentoring junior engineers while ensuring the reliability of our global fleet.What You'll DoArchitectural Leadership: Spearhead the design and implementation of internal tools using Golang, with a focus on Kubernetes Operators and Controllers to streamline resource management.Platform Orchestration: Navigate complex infrastructure transitions.Operational Excellence: Own incident responses, create comprehensive retrospectives, and implement systemic safeguards using advanced overcommit strategies.
Role overview Netradyne is hiring a Staff Engineer - Hardware in Bangalore to design, test, and implement hardware systems for automotive technology. The position centers on developing solutions that enhance vehicle safety and performance. What you will do Design and develop hardware systems tailored for automotive applications Test and validate hardware to ensure it meets safety and performance standards Collaborate with engineers from other disciplines to deliver reliable products Apply technical expertise to support ongoing hardware projects Location This role is based in Bangalore.
About AsporaAt Aspora, we believe that people on the move deserve a banking experience that keeps pace with their lives. Since our inception in 2022, we have been dedicated to creating a seamless, borderless financial operating system that transforms money into a truly mobile and transparent asset for our users.Supported by renowned venture capital firms such as Sequoia Capital, Greylock Partners, Hummingbird Ventures, Y Combinator, and Global Founders Capital, our team of over 75 talented individuals spans across India, the UK, the UAE, the EU, and the US. We operate with a strong sense of ownership, open communication, and an unwavering commitment to customer impact.We value innovative thinkers who challenge the status quo, move swiftly, and navigate regulatory complexities to provide elegant solutions. If you aspire to redefine the future of global banking, we would be thrilled to collaborate on this journey.About the RoleIn the role of Staff Engineer at Aspora, you will play a pivotal role in shaping the backend architecture that underpins our global banking platform. This hands-on position requires you to design, implement, and scale systems that cater to millions of users across various markets.You will tackle complex, ambiguous challenges, distill them into actionable technical strategies, and work in close partnership with product, design, and operations teams to bring high-impact features to fruition. As one of the key individual contributors on the team, you will elevate the technical standards, mentor fellow engineers, and help shape the engineering culture and decision-making processes across teams.If you thrive on building systems from the ground up, solving intricate distributed systems problems, and working at a pace where ideas swiftly transition into deployed features, you will find your place at Aspora.
Zscaler is looking for a Staff Detection Engineer in Bangalore to strengthen its security offerings. This role centers on building and improving detection mechanisms that help safeguard customers from new and evolving threats. Role overview Work closely with a skilled team to design and implement detection strategies. The focus is on staying ahead of emerging security risks and ensuring that solutions remain effective as threats change. Collaboration This position involves regular teamwork with other engineers and security professionals. Sharing knowledge and ideas is a key part of developing strong detection capabilities. Impact The work directly contributes to the safety and trust of Zscaler’s customers by providing reliable protection against sophisticated cyber threats.
Join EnCharge AI, an innovative leader in cutting-edge AI hardware and software solutions for edge-to-cloud computing. Our advanced in-memory computing technology offers unparalleled compute efficiency and density, surpassing today's leading solutions. We are committed to making AI accessible across power, energy, and space-constrained applications. Founded in 2022, our team is composed of seasoned technologists with extensive experience in semiconductor design and AI systems.Position Overview: We are looking for a dynamic and experienced Staff or Senior Staff Engineer to spearhead SOC Timing Convergence. This pivotal leadership position requires a proactive engineer who can adeptly handle the complexities of modern, large-scale SOC designs. You will serve as a crucial link between Architecture, RTL, DFT, and Physical Design, ensuring a reliable and high-performance pathway to tape-out.
Join SanDisk as a Staff Engineer in our Software Development Engineering team, where you will play a pivotal role in developing innovative applications. Our ideal candidate is passionate about technology, has a strong background in software engineering, and is eager to tackle complex challenges. You will work closely with cross-functional teams to design, implement, and optimize software solutions that enhance our product offerings.
Join Zscaler as a Staff Software Development Engineer, where you will play a crucial role in shaping the future of cybersecurity solutions. You will collaborate with cross-functional teams to design, develop, and implement innovative software solutions that help protect organizations from cyber threats.
Join our dynamic team at Karat as a Staff Software Engineer. In this fully remote role, you will have the opportunity to work with innovative technologies and contribute to our mission of transforming the hiring landscape. As a key player in our engineering team, you will design and implement high-quality software solutions, collaborate with cross-functional teams, and influence the architectural direction of our products.
Mar 27, 2026
Sign in to browse more jobs
Create account — see all 865 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.