Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Manager
Qualifications
Proven experience in engineering management, particularly within AI or data-centric environments. Strong background in software development and architectural design. Excellent leadership and team-building skills, with a focus on mentoring engineers. Ability to communicate effectively with technical and non-technical stakeholders. Experience with Agile methodologies and project management tools. A passion for innovation and continuous improvement.
About the job
Decagon is seeking an Engineering Manager to lead its AI & Data Infrastructure team in San Francisco. This role centers on guiding engineers as they develop AI solutions and robust data frameworks to advance Decagon’s technology roadmap.
Role overview
The Engineering Manager will oversee a team dedicated to AI and data infrastructure initiatives. The position involves hands-on leadership, ensuring projects move forward and align with company objectives.
What you will do
Lead and mentor engineers working on AI and data infrastructure projects
Drive project execution to enhance product capabilities
Foster a collaborative and supportive team environment
Oversee strategic planning and allocate resources for the team
Manage team performance and encourage professional growth
Requirements
Experience leading technical teams in AI and data infrastructure
Strong leadership and clear communication abilities
Skill in strategic planning and resource management
Dedication to building technology solutions that make a difference
This position offers the chance to shape Decagon’s products and technology direction through AI and data-driven work.
About Decagon
At Decagon, we are at the forefront of technological innovation, specializing in AI and data-driven solutions. Our team is dedicated to pushing the boundaries of what is possible, creating products that not only meet the needs of our clients but also set new industry standards. We value creativity, collaboration, and a commitment to excellence.
Full-time|On-site|San Francisco, CA, US; Palo Alto, CA, US
About the Role Pinterest is looking for an Engineering Manager II to guide the Infrastructure team. This group builds and maintains the systems that keep Pinterest running smoothly and reliably at scale. What You Will Do Lead a team of engineers focused on infrastructure projects Shape technical strategy and direction for core systems Work to ensure high availability and strong performance across services Location This role is based in San Francisco, CA or Palo Alto, CA.
Full-time|$190K/yr - $253.8K/yr|On-site|Mountain View, California; San Francisco, California
P-931 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world—from revolutionizing transportation to fast-tracking medical innovations. We achieve this by developing and managing the foremost data and AI infrastructure platform, enabling our clients to leverage profound data insights to enhance their enterprises. Founded by engineers with a customer-centric approach, we seize every chance to resolve technical challenges, from crafting next-generation UI/UX for data interactions to scaling our services and infrastructure across millions of virtual machines. And we’re just getting started. Within Databricks, the Compute Infrastructure organization is responsible for building and operating the essential framework that supports all Data, AI, and stateful workloads across major cloud platforms. Our system launches tens of millions of VMs daily, manages thousands of Kubernetes clusters, and must deliver exceptional elasticity, reliability, and cost-effectiveness. We are in search of an Engineering Manager to lead a team focused on pivotal components of this platform. Your contributions will significantly impact product delivery speed, customer satisfaction, and our company's scalability. The impact you will have: Own and enhance the compute platform to support all Databricks workloads, enabling engineers to create top-tier products with high velocity and superior performance. Recruit exceptional engineers and nurture their development through guidance, feedback, and career advancement opportunities. Elevate the technical and operational standards through robust design practices, rigorous testing, and a culture of engineering excellence and platform thinking. Collaborate with engineering and product leadership to establish long-term strategies and roadmaps. Lead cross-functional initiatives encompassing both product and infrastructure domains. Influence architectural decisions that extend beyond your immediate team.
Join Our Mission at WatershedWatershed is a pioneering enterprise sustainability platform, trusted by industry leaders such as Airbnb, Carlyle Group, FedEx, Visa, and Dr. Martens. Our platform empowers organizations to manage climate and ESG data effectively, generate audit-ready metrics for voluntary and regulatory compliance including CSRD, and actively pursue significant decarbonization efforts. We are seeking passionate team members who are eager to contribute to product development in a mission-driven startup environment, and who are excited to help shape a thriving team culture.With offices in San Francisco, New York, Denver, London, Paris, Berlin, Sydney, Mexico City, and a robust remote workforce across the US and Europe, we aim to create a dynamic and inclusive workplace that welcomes your participation!Your RoleAre you passionate about developing both cutting-edge technology and high-performing teams? Watershed is scaling its world-class engineering team, and we’re looking for an Engineering Manager to lead our Cloud Infrastructure team. This team plays a critical role in enabling our growth by managing the foundational infrastructure that supports Watershed’s multi-region deployments on GCP.We seek a leader who will articulate an inspiring vision for the team, tackle a diverse array of complex challenges, and empower each engineer to achieve their best work. As one of the initial managers in a rapidly growing organization, you will not only influence engineering culture but also the broader culture at Watershed. Together with our leadership and HR teams, you will help foster a diverse, inclusive, and dedicated workforce.Key ResponsibilitiesLead the Cloud Infrastructure team as the primary manager, balancing people management, technical leadership, and product strategy.Inspire and guide the team through significant growth phases ($100M+), including recruiting new talent and adapting the team's vision and objectives as business needs evolve.Collaborate closely with engineering peers and cross-functional partners, including Security and Finance, to align direction, set priorities, and ensure seamless collaboration.
About the Role Crusoe Technologies is looking for an Infrastructure Engineer & Lab Manager in San Francisco, CA. This role oversees the daily operations of lab environments, making sure systems stay reliable and ready for technical projects. The position combines hands-on infrastructure work with lab management responsibilities to keep teams productive and projects moving forward. Main Responsibilities Lead infrastructure projects from planning through execution Manage lab operations, equipment, and workflows Monitor and maintain system performance to support ongoing technical initiatives
Decagon seeks an Engineering Manager to lead its Platform Infrastructure team in San Francisco. This position shapes the technical foundation behind Decagon’s scalable applications, focusing on both performance and reliability. The role involves hands-on leadership and a commitment to building infrastructure that supports the company’s growth. Role overview This manager will oversee a group of engineers dedicated to platform infrastructure. The team’s work underpins the systems that allow Decagon’s products to scale smoothly and operate dependably. What you will do Guide and support engineers working on key infrastructure projects Direct the development and maintenance of systems that power Decagon’s applications Encourage solutions that boost platform performance and reliability
About Our TeamThe Infrastructure Engineering team operates within the IT department, dedicated to the reliable construction, deployment, and management of critical on-premises and hybrid environments that empower our internal services and vital research and development projects.This newly established team is committed to implementing rigorous Site Reliability Engineering (SRE) practices in environments where uptime, safety, recoverability, and security are paramount. We aim to replace unique, one-off infrastructure with standardized infrastructure-as-code components that enhance reliability and operational efficiency as OpenAI continues to grow.About This RoleWe are in search of an Infrastructure Engineering Lead who will architect, build, and maintain reliable, secure, and scalable infrastructure that supports identity, access, endpoint, and shared platform services throughout the organization.You will take full ownership of infrastructure and identity systems from conceptual design and provisioning to policy enforcement, upgrades, recovery, and ongoing operations. Your goal will be to develop robust, production-grade platforms that minimize operational hurdles, enforce security by default, and empower teams to work more effectively and confidently.This position is ideal for a senior engineer who excels in navigating ambiguity, relishes the challenge of overseeing complex systems from start to finish, and enhances reliability and security by transforming fragile implementations into standardized, repeatable infrastructure.This role is based at our San Francisco headquarters and requires in-office attendance.Key Responsibilities:Define and refine infrastructure patterns for on-prem and hybrid environments, including self-hosted platforms, vendor-supported systems, and lab settings.Establish standardized, production-grade deployment and operational models that replace custom-built solutions.Collaborate with IT, Security, Identity, and Network teams to ensure infrastructure is designed to meet reliability, security, and access standards.Design and enhance the production architecture for Identity and Access Management (IAM) adjacent platforms, such as Microsoft Entra, utilizing SRE principles.Develop common management protocols and shared resources within Azure subscriptions to ensure uniformity and policy compliance in operations.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Full-time|Remote|San Francisco, CA, New York, NY, Portland, OR, or Remote within Canada or United States
Join Mercury as a Senior Infrastructure Engineer, where you will be pivotal in shaping the infrastructure that supports our innovative financial solutions. You will work closely with cross-functional teams to design, implement, and maintain scalable and reliable infrastructure systems. This role is ideal for individuals who thrive in a fast-paced environment and are passionate about leveraging technology to drive business success.
About Our TeamAt OpenAI, we believe that identity is the cornerstone of trust in a world driven by artificial intelligence. As users engage with ChatGPT, they require robust systems to safeguard their personal and organizational data, allowing secure sharing across applications and agents. By ensuring strong identity management, we unlock the potential of AI to be safe, personalized, and genuinely beneficial, empowering users while maintaining their control. Our team is dedicated to developing the trusted OpenAI account layer, which connects users, businesses, developers, and agents with OpenAI applications and the expansive AI ecosystem. Our systems serve as the gateway to OpenAI, held to the highest standards of availability, security, and performance, and are relied upon by various internal teams.About the RoleAs the Engineering Manager for Identity Infrastructure, you will spearhead a team responsible for critical identity services that demand exceptional reliability, security, and speed, even under substantial load. You will establish the vision for the team, foster its growth, and collaborate closely with engineering and security stakeholders to enhance the platform that facilitates trusted access to OpenAI's products.We seek individuals who possess a deep understanding of infrastructure combined with strong leadership skills—someone capable of guiding technical strategies, ensuring reliable operations, and cultivating a high-performing team that delivers essential systems.This position is based in San Francisco, CA, with a hybrid work model of three days in the office per week. We also offer relocation assistance to new team members.Key Responsibilities:Cultivate, lead, and enhance a high-performing engineering team. Provide coaching and mentorship to engineers and senior technical leads to maximize their impact and support their professional development.Define and take ownership of the technical strategy, roadmap, and execution of OpenAI's essential identity systems.Direct the architecture of distributed systems operating at scale, ensuring high standards for reliability, privacy, and performance.Collaborate closely with Product and Infrastructure teams to address new product requirements while maintaining elevated system standards.Ensure effective team operations, uphold strong engineering principles, and consistently achieve ambitious objectives.Establish processes and a culture that promotes rapid learning, high trust, and responsible innovation.Lead the team responsible for the authentication stack and token exchange infrastructure, ensuring high availability, robust security measures, and low latency performance.Continuously improve request validation and response handling to enhance system reliability and user experience.
Decagon is seeking an Engineering Manager to lead its AI & Data Infrastructure team in San Francisco. This role centers on guiding engineers as they develop AI solutions and robust data frameworks to advance Decagon’s technology roadmap. Role overview The Engineering Manager will oversee a team dedicated to AI and data infrastructure initiatives. The position involves hands-on leadership, ensuring projects move forward and align with company objectives. What you will do Lead and mentor engineers working on AI and data infrastructure projects Drive project execution to enhance product capabilities Foster a collaborative and supportive team environment Oversee strategic planning and allocate resources for the team Manage team performance and encourage professional growth Requirements Experience leading technical teams in AI and data infrastructure Strong leadership and clear communication abilities Skill in strategic planning and resource management Dedication to building technology solutions that make a difference This position offers the chance to shape Decagon’s products and technology direction through AI and data-driven work.
About SesameAt Sesame, we envision a world where computers can interact with us in authentic, lifelike ways—seeing, hearing, and collaborating as humans do. Our mission is to create an innovative computer interface that seamlessly integrates voice agents into everyday life. Our diverse team comprises founders from Oculus and Ubiquity6 and seasoned professionals from Meta, Google, and Apple, each bringing extensive expertise in hardware and software. Join us in pioneering a future where technology feels alive.About the RoleAs a Backend Infrastructure Engineer at Sesame, you will play a pivotal role in shaping the foundational aspects of our technology stack. This position focuses on developing high-impact infrastructure, services, and tools that are broad-reaching rather than narrowly defined. You will tackle scalability and architectural challenges across various domains, including agentic workflows, speech recognition and synthesis, IoT, large-scale training, and efficient low-latency inference. If you're driven by the challenge of creating an ultra-efficient, scalable, and reliable engineering ecosystem through a blend of tooling, services, libraries, and infrastructure, this is the perfect opportunity for you.Responsibilities:Design and develop foundational infrastructure to support serving, training, and applications at Sesame.Enhance productivity for engineering teams by automating processes and creating exceptional tools.Deliver software solutions that empower product and machine learning engineers to build secure, scalable, and dependable systems from the ground up.Your responsibilities will encompass provider, service, security, and developer infrastructure, as well as the architecture and implementation of core services and libraries.
Join our dynamic team at Bland Inc. as a Senior Infrastructure Engineer, where you will play a critical role in designing and implementing robust infrastructure solutions. You will work alongside a talented group of professionals, using cutting-edge technology to drive innovation and efficiency.
About MercorMercor operates at the cutting edge of labor markets and artificial intelligence research. Collaborating with top AI laboratories and corporations, we supply the essential human intelligence that drives AI advancement.Our extensive talent network educates state-of-the-art AI models much like teachers impart knowledge to students: by sharing insights, experiences, and contextual understanding that cannot be encoded. Currently, over 30,000 experts in our network generate more than $2 million daily.At Mercor, we are pioneering a new realm of work where expertise fuels AI progress. Achieving this ambitious vision demands a dynamic, fast-paced, and deeply dedicated team. Here, you will collaborate with researchers, operators, and AI firms at the forefront of transforming societal systems.As a profitable Series C company with a valuation of $10 billion, Mercor operates five days a week from our new headquarters in San Francisco.About the RoleIn your role as an Infrastructure Engineer at Mercor, you will be instrumental in constructing and scaling the systems that support our rapid expansion. You will ensure that our infrastructure is highly reliable, cost-efficient, and capable of accommodating surges in traffic and computational demands. Your collaboration with product, research, and operations engineers will be vital in designing scalable architectures, optimizing deployments, and enhancing observability.We are broadening our search across Infrastructure roles, including Developer Productivity Engineer, Database Engineer, and Platform Engineer. Candidates will be matched to teams after the initial screening, so we encourage applications even if your expertise is predominantly in one area.What You'll Work OnDesigning and maintaining core infrastructure across cloud environments.Creating Infrastructure-as-Code workflows to automate deployments and scaling.Enhancing monitoring, logging, and alerting systems to ensure reliability.Managing CI/CD pipelines (Github, Spacelift) for seamless deployments.Assisting in disaster recovery planning and ensuring system availability.Collaborating with product and research teams to design architectures that meet workload demands.Identifying and resolving performance bottlenecks in compute, storage, and networking.
About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.
Be part of our mission to redefine AI by shaping the narrative surrounding document understanding.Role OverviewAt LlamaIndex, our Infrastructure team lays the groundwork for our product and provides essential tools that facilitate the development, deployment, and monitoring of our code. We are tasked with designing, constructing, and scaling the core infrastructure that drives a high-capacity data platform for AI applications. We seek individuals who are passionate about creating supportive systems that enhance our engineering capabilities and contribute to our rapidly expanding product suite.Ideal candidates will have a strong background in cloud infrastructure management, navigating various scalability challenges, and enhancing the productivity of the broader Engineering team. Key traits we value in our culture include a customer-centric mindset, collaboration, diligence, and optimism. We are looking for proactive team players who are eager to help us evolve our culture as we grow.Key ResponsibilitiesCollaborate with engineering teams to develop and maintain foundational systems that empower developers and support our rapid growth.Design and execute scalable infrastructure solutions suitable for various deployment models, including SaaS, single-tenant, and private environments.Oversee and optimize cloud resources and Kubernetes clusters to ensure cost-effectiveness and high performance.Facilitate successful external customer deployments by establishing clear infrastructure guidelines and principles.Enhance the release and deployment processes to improve efficiency and reliability.Ensure compliance with applicable regulations and implement comprehensive security measures across all deployment environments.QualificationsMinimum of 5 years of engineering experience.Experience working on Platform or Infrastructure teams on substantial projects involving infrastructure components like Terraform/CDKTF, Kubernetes, Helm, testing infrastructure, release management, and observability.Proficient in optimizing cloud resource utilization.Skilled in tuning Kubernetes clusters and cloud resources for optimal performance and cost efficiency.Dedicated to cultivating LlamaIndex’s engineering culture as we expand.Ability to balance speed and pragmatism in delivering solutions.
Full-time|Remote|San Francisco, CA or Remote (USA)
Join Fieldguide as a Senior Infrastructure Engineer and be at the forefront of our innovative infrastructure solutions. In this role, you will lead the design, implementation, and maintenance of our infrastructure systems while ensuring optimal performance, security, and scalability. Your expertise will help shape our technology strategy and drive impactful projects.
About HappyRobotHappyRobot is pioneering the AI-native operating system for the real economy, bridging the gap between intelligence and action. By harnessing real-time truths, specialized AI workers, and orchestrating intelligence, we empower enterprises to manage complex, mission-critical operations with unprecedented autonomy.Our AI OS accumulates knowledge, optimizes processes at every level, and evolves continually. Our initial focus is on supply chain and industrial-scale operations, where resilience, speed, and ongoing improvement are paramount—liberating humans to engage in strategy, creativity, and other high-value endeavors.To explore our vision further, check out our Manifesto. To date, HappyRobot has successfully raised $62 million, including a recent $44 million in Series B funding in September 2025, with support from esteemed investors like Y Combinator (YC), Andreessen Horowitz (a16z), and Base10—partners dedicated to our mission of redefining enterprise operations. We are using this investment to build a world-class team of individuals with relentless drive, exceptional problem-solving skills, and a passion for pushing boundaries in a dynamic, high-intensity environment. If this resonates with you, we invite you to join us at HappyRobot.About the RoleWe are in search of an Infrastructure Engineer to spearhead the enhancement of our operational resilience as we scale. You will be responsible for the stability, observability, and debugging processes that ensure our systems operate seamlessly. As the primary troubleshooter for complex failures in real-time, you will design tools that transform chaos into clarity and assist in transitioning our operations from reactive to proactive.This role carries significant impact and trust, as you will influence how we approach reliability—reducing incident frequency, creating internal tools, and directly enhancing developer focus and system uptime. If you thrive on uncovering the root causes of challenging issues and fortifying systems (and teams), this is your opportunity.
Join Example Org, a pioneering software company revolutionizing real-time collaboration on essential workflows. Established in 2012, we proudly serve over 10,000 customers globally and have the backing of esteemed investors like Example Capital. With our Series C funding, we are valued at $750 million.As an Infrastructure Engineer, you will be an integral part of our dynamic team, reporting directly to the team manager. Your contributions will be vital in enhancing our workflows and driving our projects to success.Your ResponsibilitiesEngage in collaborative meetings to align on project deliverables.Lead innovative initiatives that push our technological boundaries.Assist in recruiting and building a strong team.Mentor and support the professional development of team members.
Are you a passionate engineer with a knack for building robust infrastructure? Join our dynamic team at fal as a Senior/Staff Infrastructure Engineer. In this pivotal role, you will design and implement innovative solutions that enhance our infrastructure's efficiency and reliability.As a key member of our engineering team, your responsibilities will include:Architecting scalable infrastructure solutions to meet our growing needs.Collaborating with cross-functional teams to identify and resolve infrastructure challenges.Implementing automation tools and frameworks to streamline operations.Monitoring performance and ensuring the security of our systems.Providing mentorship and guidance to junior engineers.We are looking for individuals who thrive in a fast-paced environment and have a deep understanding of infrastructure technologies.
Feb 23, 2026
Sign in to browse more jobs
Create account — see all 8,251 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.