Senior Infrastructure Engineer Observability Lead jobs in Singapore – Browse 2,056 openings on RoboApply Jobs

Senior Infrastructure Engineer Observability Lead jobs in Singapore

Open roles matching “Senior Infrastructure Engineer Observability Lead” with location signals for Singapore. 2,056 active listings on RoboApply Jobs.

2,056 jobs found

1 - 20 of 2,056 Jobs
Apply
companyncs3 logo
Full-time|On-site|Singapore

We are seeking a highly skilled Senior Infrastructure Engineer / Observability Lead to join our dynamic team at ncs3 in Singapore. In this pivotal role, you will spearhead our observability initiatives, ensuring robust infrastructure monitoring, performance optimization, and data-driven decision-making. You will collaborate with cross-functional teams to design and implement innovative infrastructure solutions that enhance system reliability and scalability.

Mar 30, 2026
Apply
companyfuku logo
Full-time|On-site|Singapore, Singapore, Singapore

We are seeking an experienced Observability Backend Engineer to join our team at fuku, a rapidly expanding global consumer internet platform that connects millions of users through content, community, e-commerce, and advertising. This role will be pivotal as we enhance our AI-driven infrastructure and ensure the reliability and performance of our systems across various international regions. You will be at the forefront of developing advanced observability systems designed to support both traditional distributed systems and the next generation of AI infrastructure, ensuring our platform remains stable and efficient.

Apr 9, 2026
Apply
companyfuku logo
Full-time|On-site|Singapore, Singapore, Singapore

Join our team as an Observability Backend Engineer focused on Internationalisation in Singapore!As a vital part of our backend engineering and infrastructure team, you will contribute to our rapidly expanding consumer internet platform, which serves hundreds of millions of users across various ecosystems, including content, community, e-commerce, and advertising.As we extend our global reach and enhance our AI-driven infrastructure, your role will be pivotal in developing next-generation observability systems. These systems will ensure reliability, performance, and compliance across diverse regions, supporting both traditional distributed systems and emerging AI workloads.Key Responsibilities:- Spearhead the development of our observability stack, focusing on metrics, logging, tracing, and profiling.- Establish foundational observability capabilities throughout the technology stack.- Design and implement observability platforms and systems, including: - Monitoring platforms - End-to-end distributed tracing systems - Logging services - Computation engines (e.g., streaming analytics, real-time alerting, time-series analysis) - Alerting systems - eBPF-based observability capabilities- Oversee architecture design and product-level implementation.- Ensure the observability infrastructure achieves high performance, availability, and stability under high-concurrency workloads.- Continuously refine systems and platforms through iterative enhancements.- Support observability architecture, data compliance, and infrastructure stability initiatives on both domestic and international fronts.- Promote the implementation of AI infrastructure and application observability, driving the integration of “Observability + AI” capabilities.- Enhance the stability of AI-driven systems as well as the efficiency and usability of traditional observability platforms.

Apr 9, 2026
Apply
companyncs3 logo
Full-time|On-site|Singapore

We are seeking a highly skilled and experienced Senior Infrastructure / Desktop Engineer (Team Lead) to join our dynamic team at ncs3. In this pivotal role, you will lead a talented group of engineers in managing and optimizing our IT infrastructure and desktop support services. Your expertise will play a crucial role in ensuring the availability, performance, and security of our systems.As a Team Lead, you will be responsible for mentoring junior engineers, overseeing project delivery, and collaborating with cross-functional teams to drive continuous improvement in our infrastructure processes. Your ability to communicate effectively and foster a collaborative environment will be essential in this role.

Feb 9, 2026
Apply
companyLifted An Upwork Company logo
Contract|Remote|Singapore

Lifted, an Upwork company, is seeking a Senior Software Engineer / Site Reliability Engineer based in Singapore. This role centers on observability, focusing on developing and enhancing tools and practices that improve system reliability and support smooth software delivery. Key responsibilities Design and build monitoring solutions to deliver clear insights into system health and performance. Review and interpret system performance data to spot trends, identify bottlenecks, and highlight areas that need attention. Collaborate with cross-functional teams to detect and address issues early, aiming to prevent user impact. Role focus This position places a strong emphasis on observability. The work involves both hands-on engineering and close coordination with other teams to ensure reliable, efficient software delivery.

Apr 22, 2026
Apply
companyThoughtworks logo
Full-time|On-site|Singapore, Singapore

*Thoughtworks Singapore is currently seeking applicants who possess the right to work in Singapore, specifically Singapore Citizens due to the nature of our operations.As a Lead Consultant and Infrastructure Engineer at Thoughtworks, you will play a pivotal role in assisting clients to design, build, and enhance the systems that support their software delivery and operational capabilities. You will thrive in collaborative environments with diverse teams, leveraging your technical expertise and understanding to address the unique needs of our clients. Your commitment to championing technical quality and promoting effective working methodologies will lead to improved outcomes for our clients. You will also guide organizations in adopting Agile methodologies and DevOps practices to foster collaboration and efficiency.Key ResponsibilitiesAssess client needs to develop a comprehensive technical roadmap and impactful solutions aligned with their strategic objectives.Contribute to the growth of Thoughtworks' cloud and infrastructure practice by collaborating with business development, marketing, and capability development teams.Establish and enhance controls and processes to ensure continuous delivery and evolution of infrastructure and applications, emphasizing automation throughout.Take a proactive role in monitoring project deliverables to ensure they meet technical expectations consistently.Provide expert guidance in DevOps, cloud, and infrastructure engineering both internally and at client sites.Build and maintain trusted partnerships with client leadership across engineering and commercial sectors.Lead the design and implementation of innovative solutions to overcome current business constraints and policies.

Mar 9, 2026
Apply
companyAirwallex logo
Full-time|On-site|SG - Singapore

About AirwallexAirwallex stands as a pioneering unified payments and financial platform tailored for global enterprises. Through our innovative blend of proprietary infrastructure and advanced software, we empower over 200,000 businesses worldwide, including notable names such as Brex, Rippling, Navan, Qantas, SHEIN, and more. Our fully integrated solutions facilitate the management of business accounts, payments, spend management, treasury, and embedded finance on a global scale.Founded in Melbourne, we take pride in our diverse team of over 2,000 skilled professionals across 26 offices worldwide. With a valuation of US$8 billion and support from leading investors like T. Rowe Price, Visa, Mastercard, Robinhood Ventures, Sequoia, Salesforce Ventures, DST Global, and Lone Pine Capital, Airwallex is at the forefront of shaping the global payments and financial landscape. If you're eager to embark on the most ambitious journey of your career, we invite you to join our mission.Attributes We ValueWe seek builders with an entrepreneurial spirit who are passionate about making a substantial impact, accelerating their learning, and embracing true ownership. You should possess strong role-specific expertise, analytical thinking, and be driven by our mission and operating principles. You approach challenges with urgency and sound judgment, foster curiosity, and base decisions on fundamental principles, striking a balance between speed and thoroughness.We value humility and collaboration; you will transform innovative concepts into tangible products and drive projects to completion. By leveraging AI, you'll enhance efficiency and resolve challenges promptly. Here, you'll engage with complex, high-stakes issues alongside exceptional teammates, advancing your career as we redefine the future of global banking. If this resonates with you, let’s create what’s next together.About The TeamAs our engineering division evolves, the infrastructure teams play a critical role in Airwallex's capacity to establish a globally distributed platform, boost engineering productivity, and optimize resource utilization. Our teams are dedicated to ensuring an optimal experience for our developers and customers while maintaining a secure environment that guarantees the availability of our financial solutions.Your RoleThe Data DevOps team is charged with equipping our data engineering and analytics teams with essential tools. You will oversee the development and maintenance of the data infrastructure, as well as create and enhance tools and platforms that enable the engineering team to operate with improved self-service capabilities.

Nov 6, 2024
Apply
companyNCS Pte Ltd logo
Full-time|On-site|Singapore

As an AIOps Engineer (Splunk), you will play a crucial role in designing, building, and testing the AIOps and Observability platform. Your primary focus will be on developing AIOps use cases, operationalizing them to meet customer requirements, and significantly enhancing productivity in service delivery and operations. Key Responsibilities:Architect, design, develop, deploy, and maintain the enterprise logging and observability platform utilizing Splunk or Elastic ELK.Contribute to the architectural design by assessing trade-offs related to scalability, resiliency, high availability, and security.Conduct capacity planning and solution reviews for the ELK/Splunk environments.Implement solutions for business data analysis and design data structures using the Elastic ELK/Splunk observability platform.Oversee high-volume data ingestion and real-time data flow processes.Collaborate with data log streaming platforms and tools for data ingestion from diverse systems and applications.Design and develop multi-tenant dashboard solutions.Establish and maintain operational best practices to ensure the effective functioning of the Elastic ELK/Splunk observability solution.Actively contribute to the enhancement of the Elastic ELK/Splunk observability solution.Optimize and fine-tune the Elastic ELK/Splunk observability solution to fulfill performance requirements.Work closely with developers to promote best practices for the data warehouse and analytics environment.Investigate emerging technologies and advancements to address customer needs and implement relevant upgrades.Develop, test, and operationalize AIOps use cases.Ensure platform operation meets high availability standards and aligns with customer SLA. 

Jun 12, 2025
Apply
companyThoughtWorks logo
On-site|On-site|Singapore

As a Senior Infrastructure Engineer Consultant at Thoughtworks, you will play a pivotal role in empowering our clients to design and enhance the infrastructure systems that underpin their software delivery. You will thrive in collaborative settings, working alongside diverse teams to address complex challenges and develop innovative solutions that align with organizational goals. Your blend of technical acumen and adaptive thinking will drive improvements, ensuring the highest standards of technical quality and operational efficiency. You will guide clients in adopting agile methodologies and the DevOps mindset, fostering a culture of collaboration and continuous improvement.

Feb 3, 2026
Apply
companyAirwallex logo
Full-time|On-site|SG - Singapore

Join Our Team at AirwallexAt Airwallex, we are revolutionizing the world of global payments and financial solutions. Our unique blend of proprietary technology and innovative software supports over 200,000 businesses worldwide, from giants like Brex and Qantas to vibrant startups. With our cutting-edge platform, we offer comprehensive solutions for business accounts, payments, spend management, treasury, and embedded finance.Founded in Melbourne, our diverse team of over 2,000 talented professionals spans 26 offices globally. Currently valued at $8 billion and supported by leading investors such as T. Rowe Price and Visa, we are on a mission to redefine the future of global banking. If you are seeking an opportunity to make a significant impact while growing your career, we want to hear from you!What We Look ForWe seek motivated builders with an entrepreneurial spirit who are eager to make a real impact, learn rapidly, and take ownership of their work. You will bring in-depth technical expertise, analytical thinking, and a passion for our mission and operating principles. You thrive in a fast-paced environment, using curiosity to drive deep understanding and decision-making based on first principles.As a humble team player, you will transform innovative ideas into tangible products while ensuring execution from start to finish. Leveraging AI, you will work smarter and solve challenges efficiently. Here, you will engage with high-impact projects alongside exceptional colleagues and advance your career as we shape the future of global finance.About Our TeamThe Data & AI Infrastructure team is crucial to Airwallex’s ability to create a globally distributed platform. We focus on enhancing engineer productivity and optimizing resource utilization. Our mission is to deliver an outstanding experience for both our developers and customers while maintaining a secure and reliable environment for our financial solutions.Your RoleAs a member of the Data & AI Infrastructure team, you will develop and maintain vital data infrastructure, providing our data engineering and analytics teams with essential tools. Your responsibilities will include creating and enhancing tools and platforms that empower the engineering team with improved self-service capabilities.

Mar 4, 2026
Apply
companyNCS Pte Ltd logo
Full-time|On-site|Singapore

Join our team as a Senior Infrastructure Engineer specializing in Network. In this role, you will be pivotal in designing, implementing, and maintaining robust network solutions that enhance our operational efficiency. You will work collaboratively with cross-functional teams to ensure seamless connectivity and security across our infrastructure.Key responsibilities include:Designing network architecture to support business operations.Implementing security measures to safeguard our network.Monitoring network performance and optimizing configurations.Providing technical support and troubleshooting network issues.

Aug 22, 2024
Apply
companyncs3 logo
Full-time|On-site|Singapore

Join our dynamic team as a Senior Infrastructure Engineer specializing in projects, where your expertise will be integral to the delivery and management of innovative infrastructure solutions. You will collaborate with cross-functional teams to design, implement, and optimize our infrastructure systems to enhance operational efficiency and support strategic initiatives.

Jan 26, 2026
Apply
companyOKX logo
Full-time|On-site|Singapore, Singapore

Join our team as a Senior Engineer specializing in AI Platform and Infrastructure. In this pivotal role, you will be responsible for developing and optimizing our AI infrastructure, ensuring robust performance and scalability. You will collaborate closely with cross-functional teams to design innovative solutions that enhance our AI capabilities. Your expertise will help drive the future of our AI initiatives and contribute to the overall technological advancement of our organization.

Mar 19, 2026
Apply
companyNCS Pte Ltd logo
Full-time|On-site|Singapore

Join our dynamic team at NCS as a Senior Infrastructure Engineer specializing in QRadar. In this pivotal role, you will lead the design, implementation, and optimization of infrastructure solutions to enhance our cybersecurity posture. Your expertise will be crucial in developing robust security frameworks and ensuring compliance with industry standards.

Feb 4, 2025
Apply
companyncs3 logo
Full-time|On-site|Singapore

We are seeking a highly skilled Senior System Engineer to join our dynamic team at ncs3. In this role, you will be responsible for designing, implementing, and maintaining infrastructure systems that support our hybrid IT environment. Your expertise will play a critical role in ensuring the seamless integration of cloud and on-premises resources.

Jan 26, 2026
Apply
company
Full-time|On-site|Singapore, Singapore, Singapore

Atlas is revolutionizing the restaurant industry by developing a comprehensive operating system designed to streamline the processes of starting, managing, and expanding restaurants, whether online or offline. The talented team at Atlas previously founded Grain, a venture-backed online restaurant that achieved millions in revenue. Today, Atlas empowers restaurants with innovative solutions such as online storefronts, POS systems, third-party logistics, and seamless integrations with food platforms and AI technologies.Our current clientele includes notable names like SaladStop, Killiney, and Haidilao, and we are continuously bringing new brands into our ecosystem, including Casa Vostra, Artichoke, and Wewa, adding fresh restaurants every week.Our team and investors hail from prestigious companies such as Y Combinator, Global Founders Capital, Grain, Accenture, Microsoft, Udacity, McKinsey, and Salesforce.Explore our hiring memo here.Role OverviewThe Product Infrastructure Engineers at Atlas are crucial in propelling every engineering effort forward. You will construct the systems that enhance the safety, speed, and predictability of our shipping processes.Your work will be situated at the crossroads of infrastructure and product, with the systems you design powering the fundamental experiences that span compute, databases, APIs, deployment pipelines, and measurement frameworks. Your contributions will not only support scalability; they will shape the evolution of Atlas as a product.Key ResponsibilitiesDesign and develop robust infrastructure for multi-tenant computing, databases, queuing systems, and observability tools.Enhance deployment pipelines, implement feature gating, and facilitate canary rollouts to ensure safe and rapid shipping.Scale shared services and core platform components utilized across the Atlas ecosystem.Develop internal tools for monitoring, metrics, and experimentation to foster learning and reliability.Collaborate with product engineers to ensure scalability, performance, and fault tolerance are prioritized from the outset.Reassess abstractions and defaults that could hinder speed or resilience.Required Skills and Experience6+ years of experience in Software Engineering or Site Reliability Engineering (or Infrastructure Engineering).Proficiency with container orchestration platforms and tools such as Docker and Kubernetes.Experience with infrastructure as code and configuration management tools.Strong incident management skills and experience leading incident responses.Familiarity with Google Cloud Platform services and tools.Knowledge of modern observability platforms like Prometheus, Grafana, and ScoutAPM.Experience with Ruby on Rails and PostgreSQL is a plus.Ideal Candidate AttributesYou value speed and craftsmanship equally.You have created solutions that enhance both the product and the development process.You possess a systems-oriented mindset, understanding how code, data, and infrastructure influence product development.

Oct 28, 2025
Apply
companyEsri logo
Full-time|On-site|Singapore, SG

Join Esri as a Senior GIS Solution Engineer specializing in Infrastructure. In this role, you will leverage your expertise in GIS technology to support our clients in optimizing their infrastructure projects. You will work closely with cross-functional teams to design, implement, and enhance GIS solutions that address complex challenges in infrastructure management.Your contributions will help shape the future of GIS technology in the infrastructure sector, driving innovation and efficiency in project execution.

Mar 26, 2026
Apply
companyncs logo
Full-time|On-site|Singapore

We are seeking a highly skilled Senior Infrastructure Engineer specializing in Active Directory to join our dynamic team at ncs. In this role, you will be responsible for designing, implementing, and maintaining our infrastructure systems with a focus on Active Directory services. You will collaborate with cross-functional teams to ensure optimal performance and security of our infrastructure.

Oct 14, 2024
Apply
companyOKX logo
Full-time|On-site|Singapore, Singapore

Join OKX as a Senior Staff Engineer focusing on AI Platform and Infrastructure, where you will lead the design and development of advanced AI solutions. You will be instrumental in developing large-scale infrastructure to support machine learning and generative AI applications across various business domains, including Compliance, Trading, and Business Intelligence. This role demands a hands-on technical leader who can set the strategic direction for our AI platforms, ensuring reliability and scalability as we integrate AI into mission-critical operations.

Feb 9, 2026
Apply
companyClickHouse logo
Full-time|Remote|Singapore (Remote)

About ClickHouseRanked on the prestigious 2025 Forbes Cloud 100 list, ClickHouse stands as a trailblazer in the rapidly evolving private cloud sector. With a remarkable base of over 3,000 clients and an astonishing annual recurring revenue (ARR) growth exceeding 250% year-on-year, ClickHouse excels in delivering real-time analytics, data warehousing, observability, and AI workloads.The company’s relentless growth trajectory was recently underscored by a successful $400M Series D funding round. In just the last quarter, notable clients such as Capital One, Lovable, Decagon, Polymarket, and Airwallex have embraced our platform or expanded their existing deployments. These clients join an esteemed roster of AI pioneers and global brands, including Meta, Cursor, Sony, and Tesla.Join us in our mission to revolutionize data utilization for businesses. Be part of our exciting journey!About the TeamThe Cloud Infrastructure Engineering team is responsible for constructing and overseeing the essential components of the ClickHouse Cloud data plane from end to end. This encompasses compute, networking, security, and a multi-cloud, multi-region architecture that ensures a dependable and scalable managed ClickHouse experience for our customers. We are seeking exceptionally skilled and experienced cloud infrastructure software engineers to join our team, who will play a pivotal role in designing, deploying, and maintaining our infrastructure.

Mar 5, 2026

Sign in to browse more jobs

Create account — see all 2,056 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.