Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
Desired Experience:Proven experience in designing and managing large-scale infrastructure, including GPU clusters, extensive Kubernetes clusters, or cloud batch job systems. An unwavering commitment to reliability, observability, and optimization across the entire stack.
About the job
At Exa, we are pioneering a groundbreaking search engine dedicated to powering every AI application. Our team is committed to building a robust infrastructure that enables us to crawl the web, develop cutting-edge embedding models for indexing, and create exceptionally high-performance vector databases using Rust for efficient searching. Our resources include a significant $5M H200 GPU cluster, which routinely activates tens of thousands of machines.
The Infrastructure Team at Exa is responsible for creating the foundational tools and infrastructure that sustain all our systems. We are seeking talented infrastructure engineers to help us build the 'machine that builds the machine,' allowing our engineering organization to operate at maximum speed. This may involve developing GPU cluster orchestration with Kubernetes, executing map-reduce batch jobs on Ray, or crafting the most advanced observability tools available.
About Exa
Exa is a forward-thinking technology company focused on developing innovative solutions that enhance AI capabilities. Our mission is to create scalable and efficient infrastructures that empower AI applications, making a significant impact in the tech industry.
Similar jobs
1 - 20 of 2,169 Jobs
Search for Senior Software Engineer Product Infrastructure
Atlas is revolutionizing the restaurant industry by developing a comprehensive operating system designed to streamline the processes of starting, managing, and expanding restaurants, whether online or offline. The talented team at Atlas previously founded Grain, a venture-backed online restaurant that achieved millions in revenue. Today, Atlas empowers restaurants with innovative solutions such as online storefronts, POS systems, third-party logistics, and seamless integrations with food platforms and AI technologies.Our current clientele includes notable names like SaladStop, Killiney, and Haidilao, and we are continuously bringing new brands into our ecosystem, including Casa Vostra, Artichoke, and Wewa, adding fresh restaurants every week.Our team and investors hail from prestigious companies such as Y Combinator, Global Founders Capital, Grain, Accenture, Microsoft, Udacity, McKinsey, and Salesforce.Explore our hiring memo here.Role OverviewThe Product Infrastructure Engineers at Atlas are crucial in propelling every engineering effort forward. You will construct the systems that enhance the safety, speed, and predictability of our shipping processes.Your work will be situated at the crossroads of infrastructure and product, with the systems you design powering the fundamental experiences that span compute, databases, APIs, deployment pipelines, and measurement frameworks. Your contributions will not only support scalability; they will shape the evolution of Atlas as a product.Key ResponsibilitiesDesign and develop robust infrastructure for multi-tenant computing, databases, queuing systems, and observability tools.Enhance deployment pipelines, implement feature gating, and facilitate canary rollouts to ensure safe and rapid shipping.Scale shared services and core platform components utilized across the Atlas ecosystem.Develop internal tools for monitoring, metrics, and experimentation to foster learning and reliability.Collaborate with product engineers to ensure scalability, performance, and fault tolerance are prioritized from the outset.Reassess abstractions and defaults that could hinder speed or resilience.Required Skills and Experience6+ years of experience in Software Engineering or Site Reliability Engineering (or Infrastructure Engineering).Proficiency with container orchestration platforms and tools such as Docker and Kubernetes.Experience with infrastructure as code and configuration management tools.Strong incident management skills and experience leading incident responses.Familiarity with Google Cloud Platform services and tools.Knowledge of modern observability platforms like Prometheus, Grafana, and ScoutAPM.Experience with Ruby on Rails and PostgreSQL is a plus.Ideal Candidate AttributesYou value speed and craftsmanship equally.You have created solutions that enhance both the product and the development process.You possess a systems-oriented mindset, understanding how code, data, and infrastructure influence product development.
About ClickHouseRanked on the prestigious 2025 Forbes Cloud 100 list, ClickHouse stands as a trailblazer in the rapidly evolving private cloud sector. With a remarkable base of over 3,000 clients and an astonishing annual recurring revenue (ARR) growth exceeding 250% year-on-year, ClickHouse excels in delivering real-time analytics, data warehousing, observability, and AI workloads.The company’s relentless growth trajectory was recently underscored by a successful $400M Series D funding round. In just the last quarter, notable clients such as Capital One, Lovable, Decagon, Polymarket, and Airwallex have embraced our platform or expanded their existing deployments. These clients join an esteemed roster of AI pioneers and global brands, including Meta, Cursor, Sony, and Tesla.Join us in our mission to revolutionize data utilization for businesses. Be part of our exciting journey!About the TeamThe Cloud Infrastructure Engineering team is responsible for constructing and overseeing the essential components of the ClickHouse Cloud data plane from end to end. This encompasses compute, networking, security, and a multi-cloud, multi-region architecture that ensures a dependable and scalable managed ClickHouse experience for our customers. We are seeking exceptionally skilled and experienced cloud infrastructure software engineers to join our team, who will play a pivotal role in designing, deploying, and maintaining our infrastructure.
As a leading innovator, Simular is creating the future of computer user agents, developing AI systems that can operate your computer on your behalf. Our backend infrastructure powers everything from live VM orchestration to agent planning and execution.We seek a versatile backend/infrastructure engineer who excels in uncertain environments, possesses strong architectural instincts, and is eager to take ownership of substantial and evolving components of our system. This role is not limited to traditional 'DevOps' or 'API engineer' functions; you'll play a pivotal role in shaping our platform's scalability and evolution for years to come.Key ResponsibilitiesArchitect and scale the systems that support Simular Cloud and our cross-platform agents.Oversee essential backend services, including APIs, data flows, billing systems, observability, and deployment pipelines.Address complex infrastructure challenges such as VM orchestration, reproducibility, parallel execution, and reliability.Identify opportunities to split or refactor services and lead that evolution.Explore new avenues as products expand, including microVMs, multi-agent orchestration, fine-tuned model endpoints, and community task galleries.Collaborate with product teams and researchers to transform ideas into production-ready systems.Ideal Candidate AttributesProficient in programming fundamentals with solid skills in languages such as Python, Go, Rust, or similar.Experience with some aspects of cloud infrastructure, distributed systems, virtualization, and CI/CD; most importantly, the ability to learn and adapt to new technologies as required.Think architecturally, understanding when to hack and when to design for long-term success.Comfortable managing large, ambiguous problems from start to finish.Bonus: Experience with GPU scheduling, large-scale ML infrastructure, or scaling SaaS applications.
At Exa, we are pioneering a groundbreaking search engine dedicated to powering every AI application. Our team is committed to building a robust infrastructure that enables us to crawl the web, develop cutting-edge embedding models for indexing, and create exceptionally high-performance vector databases using Rust for efficient searching. Our resources include a significant $5M H200 GPU cluster, which routinely activates tens of thousands of machines.The Infrastructure Team at Exa is responsible for creating the foundational tools and infrastructure that sustain all our systems. We are seeking talented infrastructure engineers to help us build the 'machine that builds the machine,' allowing our engineering organization to operate at maximum speed. This may involve developing GPU cluster orchestration with Kubernetes, executing map-reduce batch jobs on Ray, or crafting the most advanced observability tools available.
The CompanyArta Finance is on a bold and fulfilling mission to empower individuals worldwide to achieve financial success. By harnessing advanced AI and cutting-edge digital tools, which were once exclusive to ultra-high-net-worth individuals, Arta makes these resources accessible to a diverse global audience. Imagine having your own digital family office, merging intelligent investment strategies, alternative asset opportunities, private market access, and smart automation to effortlessly grow and safeguard your wealth. Our core values revolve around trust, teamwork, and adaptability.The RoleAs a Senior Cloud Infrastructure Engineer, you will be responsible for designing, developing, and maintaining a resilient infrastructure that spans the entire technology stack—from user-serving servers to data pipelines, storage systems, trading platforms, and research infrastructure. Your critical contributions will support both real-time and batch workflows, ensuring scalability, performance, and high reliability for our mission-critical systems.This is an individual contributor position, where your expertise will guide various initiatives in collaboration with cross-functional teams.What You Will DoDesign, develop, and maintain scalable infrastructure for real-time and batch workflows, emphasizing high-throughput and low-latency data processing for autonomous trading, quantitative research, and reliable user-serving servers.Work closely with frontend, research, and trading teams to ensure that the infrastructure and data pipelines meet performance, precision, and usability requirements.Integrate third-party platforms and data providers, ensuring secure and reliable API/data stream ingestion while tackling complex external data integration challenges.Proactively identify and resolve bottlenecks and inefficiencies in distributed systems, optimizing performance and resource utilization.Take ownership of deployment pipelines and production operations to guarantee high availability, fault tolerance, and observability of critical services.Architect and implement scalable systems that support high-throughput, low-latency data processing essential for autonomous trading and quantitative research. Ensure data quality, consistency, and traceability through thorough testing, validation, and monitoring pipelines.Uphold security best practices and collaborate with internal stakeholders to meet compliance and auditing standards.
Join Our Team at AirwallexAt Airwallex, we provide a groundbreaking unified payments and financial platform tailored for global enterprises. Our innovative blend of proprietary technology and software empowers over 200,000 businesses worldwide, including industry leaders like Brex, Rippling, Navan, Qantas, SHEIN, and many more. We offer fully integrated solutions that simplify the management of business accounts, payments, spend oversight, treasury functions, and embedded finance on a global scale.Founded in Melbourne, our vibrant team of over 2,000 talented individuals across 26 offices worldwide is dedicated to creating the future of financial services. With a valuation of US$8 billion and support from leading investors such as T. Rowe Price, Visa, Mastercard, Robinhood Ventures, Sequoia, Salesforce Ventures, DST Global, and Lone Pine Capital, Airwallex is at the forefront of revolutionizing global payments. If you’re ready to take on the most ambitious projects of your career, we invite you to join us.What We Look ForWe seek builders with a founder-like mentality who aspire to create a substantial impact, experience rapid personal growth, and take true ownership of their work. You should possess strong expertise and analytical skills, driven by our mission and operating principles. You are quick to act with sound judgment, deeply curious, and capable of making decisions based on first principles, balancing speed with thoroughness.Collaboration and humility are key traits you bring to the table as you transform innovative ideas into tangible products. You thrive in an environment that encourages the use of AI for smarter work and faster problem-solving. Here, you'll face complex challenges alongside exceptional colleagues and advance your career as we redefine the landscape of global banking. If this resonates with you, let's embark on this journey together.Your Role: Contribute to Our IAM & Account Infrastructure TeamBecome a vital member of our Identity and Access Management (IAM) and Account team, where your contributions will significantly enhance our identity solutions and strengthen our account infrastructure. We are eager to welcome passionate engineers ready to engage in pioneering projects that improve the availability, scalability, and security of our platform. Collaborate globally with product managers, designers, and fellow engineers to deliver outstanding IAM solutions that protect businesses and optimize our account infrastructure for Airwallex's remarkable growth.
Join OKX as a Senior/Staff Android Software Engineer and play a vital role in shaping the future of crypto. You will be responsible for developing and maintaining our core OKX app, which serves millions of users daily. Collaborating with cross-functional teams, you will identify customer needs and implement high-quality features through rapid iterations. This position offers a unique opportunity to gain insights into the complete lifecycle of crypto mobile applications, including professional and retail trading, asset management, and wallet functionalities.
About AirwallexAirwallex is a revolutionary unified payments and financial platform catering to global enterprises. With our distinct blend of proprietary infrastructure and cutting-edge software, we empower over 200,000 businesses worldwide—including industry leaders like Brex, Rippling, Navan, Qantas, and SHEIN—to seamlessly manage everything from business accounts and payments to spend management, treasury services, and embedded finance solutions on a global scale.Founded in Melbourne, our diverse team of over 2,000 talented innovators operates across 26 global offices. With a valuation of US$8 billion and support from top-tier investors such as T. Rowe Price, Visa, Mastercard, Robinhood Ventures, Sequoia, Salesforce Ventures, DST Global, and Lone Pine Capital, Airwallex is at the forefront of shaping the future of global payments and finance. Are you ready to embark on the most ambitious journey of your career? Join us!Attributes We ValueWe seek passionate builders who embody a founder-like spirit, aiming for tangible impact and accelerated growth. You possess strong expertise and sharp critical thinking, driven by our mission and operating principles. You thrive in fast-paced environments, combining sound judgment with a deep curiosity that allows you to make data-informed decisions while maintaining a balance between speed and thoroughness.With a collaborative and humble mindset, you turn innovative concepts into tangible products and ensure projects are executed from start to finish. Leveraging AI, you work efficiently to resolve challenges swiftly. Here, you’ll engage with complex, high-stakes issues alongside exceptional teammates and advance your career as we redefine the landscape of global banking. If this resonates with you, let’s build the future together.Role SummaryWe are in search of a technically adept and execution-focused individual to spearhead the global expansion of our Issuing Product. In this pivotal role, you will serve as the vital link between our external partners—including networks, banks, and vendors—and our internal Solution Engineering (SE) and Product teams. You will be accountable for ensuring the technical success of our regional growth and for developing products that enhance our infrastructure's scalability.Key Responsibilities1. Regional Expansion & Strategic LaunchesEnd-to-End Technical Setup: Lead the technical initiatives for launching the Issuing Product in new markets while enhancing our issuing infrastructure capability map.
Join our dynamic AI Product Team as a Senior Software Engineer, where you will be instrumental in revolutionizing the way analysts interpret and process data. Your role will place you at the cutting edge of applied artificial intelligence, developing advanced systems that improve sense-making, information discovery, and decision support for national security. In this position, you will be responsible for designing and implementing AI-powered applications such as retrieval-augmented generation (RAG) systems and intelligent automation workflows. You will also develop the underlying web systems and data processing pipelines that empower these functionalities. Your contributions will span the entire technology stack, from backend services and data retrieval layers to user-friendly frontend interfaces that deliver AI capabilities. Our team prioritizes the application and integration of AI technologies, focusing on innovation, rapid experimentation, and making a tangible impact in the real world.
As a Senior Infrastructure Engineer Consultant at Thoughtworks, you will play a pivotal role in empowering our clients to design and enhance the infrastructure systems that underpin their software delivery. You will thrive in collaborative settings, working alongside diverse teams to address complex challenges and develop innovative solutions that align with organizational goals. Your blend of technical acumen and adaptive thinking will drive improvements, ensuring the highest standards of technical quality and operational efficiency. You will guide clients in adopting agile methodologies and the DevOps mindset, fostering a culture of collaboration and continuous improvement.
Join our team as a Senior Infrastructure Engineer specializing in Network. In this role, you will be pivotal in designing, implementing, and maintaining robust network solutions that enhance our operational efficiency. You will work collaboratively with cross-functional teams to ensure seamless connectivity and security across our infrastructure.Key responsibilities include:Designing network architecture to support business operations.Implementing security measures to safeguard our network.Monitoring network performance and optimizing configurations.Providing technical support and troubleshooting network issues.
Join our dynamic team as a Senior Infrastructure Engineer specializing in projects, where your expertise will be integral to the delivery and management of innovative infrastructure solutions. You will collaborate with cross-functional teams to design, implement, and optimize our infrastructure systems to enhance operational efficiency and support strategic initiatives.
We are seeking a highly skilled Senior Infrastructure Engineer / Observability Lead to join our dynamic team at ncs3 in Singapore. In this pivotal role, you will spearhead our observability initiatives, ensuring robust infrastructure monitoring, performance optimization, and data-driven decision-making. You will collaborate with cross-functional teams to design and implement innovative infrastructure solutions that enhance system reliability and scalability.
Join our team as a Senior Engineer specializing in AI Platform and Infrastructure. In this pivotal role, you will be responsible for developing and optimizing our AI infrastructure, ensuring robust performance and scalability. You will collaborate closely with cross-functional teams to design innovative solutions that enhance our AI capabilities. Your expertise will help drive the future of our AI initiatives and contribute to the overall technological advancement of our organization.
Join our dynamic team at NCS as a Senior Infrastructure Engineer specializing in QRadar. In this pivotal role, you will lead the design, implementation, and optimization of infrastructure solutions to enhance our cybersecurity posture. Your expertise will be crucial in developing robust security frameworks and ensuring compliance with industry standards.
We are seeking a highly skilled Senior System Engineer to join our dynamic team at ncs3. In this role, you will be responsible for designing, implementing, and maintaining infrastructure systems that support our hybrid IT environment. Your expertise will play a critical role in ensuring the seamless integration of cloud and on-premises resources.
Join our Engineering Enablement team at Motional as a Senior Software Engineer and play a pivotal role in enhancing developer productivity across the organization. In this position, you will be responsible for designing and evolving internal platforms, services, and toolchains that are essential for our engineering workflows. Your collaboration with various teams within the company will be crucial in identifying systemic bottlenecks and implementing scalable, self-service developer platforms. This will help to alleviate cognitive load, streamline CI/CD workflows, and empower teams to transition from code to production with both speed and confidence. Your primary focus will be on our internal developer platform and productivity tools, which are designed to enable engineers to deliver reliable, high-quality code more rapidly and efficiently.
Role overview CoreWeave is hiring a Senior Production Engineer in Singapore. This role focuses on improving and maintaining production systems that support the company's operations. Collaboration with teams across the organization is central to the work. What you will do Optimize and support production systems to keep operations running smoothly Work with colleagues from different functions to address challenges and deliver improvements Identify and implement solutions that boost system performance and reliability
Join Esri as a Senior GIS Solution Engineer specializing in Infrastructure. In this role, you will leverage your expertise in GIS technology to support our clients in optimizing their infrastructure projects. You will work closely with cross-functional teams to design, implement, and enhance GIS solutions that address complex challenges in infrastructure management.Your contributions will help shape the future of GIS technology in the infrastructure sector, driving innovation and efficiency in project execution.
Join Airwallex as a Product Director, InfrastructureAt Airwallex, we are redefining the payments landscape for businesses around the globe. As the only unified financial platform designed specifically for global enterprises, we leverage our proprietary infrastructure and software to empower over 200,000 businesses, including industry leaders like Brex, Rippling, Navan, Qantas, and SHEIN. Our solutions encompass everything from business accounts and payments to spend management and treasury, all integrated seamlessly to facilitate business growth on a global scale.Founded in Melbourne, our diverse and innovative team of over 2,000 talented professionals operates across 26 offices worldwide. Our valuation stands at an impressive US$8 billion, supported by world-class investors such as T. Rowe Price, Visa, Mastercard, Robinhood Ventures, and more. If you are ready to embrace the most ambitious work of your career, we invite you to join our mission.What We Look ForWe seek proactive builders who share our founder-like energy and desire to create meaningful impact. You should possess strong expertise in your role, coupled with a sharp analytical mindset. You are driven by our mission and operating principles. Your ability to make quick yet judicious decisions, paired with a deep curiosity to explore challenges, will be vital in your role.As a team player, you will transform innovative concepts into tangible products and ensure efficient execution from start to finish. Utilizing AI to enhance productivity and expedite problem-solving is part of our culture. You will face complex, high-visibility challenges alongside exceptional teammates, advancing your career as we innovate the future of global banking. If this resonates with you, let’s create the future together.Note: This position is located in Singapore and requires proficiency in Mandarin.About the TeamYou will collaborate with the Global Treasury Payment Network (GTPN) product team, a dynamic group focused on developing the infrastructure that facilitates international money movement at Airwallex. The team plays a crucial role in enabling our customers to transfer funds worldwide efficiently, while directly contributing to our growth objectives and strategic initiatives. We prioritize curiosity, customer focus, and rapid cross-functional execution, aiming to deliver reliable, scalable products while fostering individual growth through mentorship and continuous learning opportunities.
Jan 23, 2026
Sign in to browse more jobs
Create account — see all 2,169 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.