Infrastructure Security Software Engineer jobs in San Francisco – Browse 5,857 openings on RoboApply Jobs

Infrastructure Security Software Engineer jobs in San Francisco

Open roles matching “Infrastructure Security Software Engineer” with location signals for San Francisco. 5,857 active listings on RoboApply Jobs.

5,857 jobs found

1 - 20 of 5,857 Jobs
Apply
company
Full-time|On-site|San Francisco

About Resolve AIAt Resolve AI, we are redefining software maintenance and production troubleshooting with our innovative, fully autonomous AI Production Engineer. This pioneering technology autonomously investigates and resolves intricate system challenges from start to finish.Founded by industry leaders Spiros Xanthos and Mayank Agarwal, who were instrumental in the creation of OpenTelemetry and have successfully led previous ventures to Splunk and VMware.We have secured over $150 million in funding from top-tier investors such as Lightspeed, Greylock, and Unusual Ventures, along with individual contributions from renowned figures like Jeff Dean (Google DeepMind), Thomas Dohmke (GitHub), Matt Garman (AWS), Reid Hoffman (LinkedIn), and Fei-Fei Li (Stanford).Your RoleTake charge of security across the entire Resolve AI platform, including application security, cloud infrastructure, internal services, integrations, and agent execution.Design robust security mechanisms tailored for autonomous systems, ensuring agents perform actions safely, audibly, and within established limits.Collaborate closely with engineers across various teams to proactively shape secure system designs from the outset.Integrate security into our software delivery processes: automate security in CI/CD, implement guardrails in our infrastructure, and establish scalable defaults.Assess and enhance our management of identity, permissions, and access across diverse customers, tools, and environments.Monitor emerging risks associated with AI-driven systems and suggest actionable, engineering-focused mitigations.Assist Resolve AI in aligning with enterprise customer expectations by clearly communicating our security posture and engaging in technical security discussions as required.Establish the foundational principles for security practices at Resolve, ensuring they are pragmatic, high-signal, and in line with real-world system operations.

Feb 28, 2026
Apply
companymercor logo
Full-time|On-site|San Francisco

Role Overview mercor is hiring a Cloud Infrastructure Security Engineer in San Francisco. This role focuses on protecting cloud systems, maintaining data integrity, and applying security protocols to guard against new threats. Collaboration with cross-functional teams is central to designing and maintaining secure cloud environments.

Apr 16, 2026
Apply
companydoppel logo
Full-time|On-site|San Francisco

Why Join Doppel?At Doppel, we are dedicated to tackling one of the most significant threats posed by AI: mass-manufactured social engineering. With scams, deepfakes, and social engineering attacks proliferating across digital platforms such as websites, social media, advertisements, encrypted messaging apps, and mobile devices, our mission is both simple and ambitious: to enhance internet safety by outsmarting the fastest-evolving digital threats.Supported by renowned investors like a16z and Bessemer, and trusted by industry leaders such as OpenAI, United Airlines, and Coinbase, Doppel is on a rapid growth trajectory. If you are passionate about addressing real-world challenges through innovative technology, we want to hear from you!What We're BuildingWe are developing an AI-driven platform to combat social engineering on a large scale. This involves creating scalable systems that monitor billions of domains, social media accounts, applications, and dark web forums, utilizing AI agents to detect and neutralize digital threats effectively.What We're Looking ForWe are in search of a skilled backend engineer to enhance the infrastructure needed for our rapidly expanding engineering team. Recent projects include:Developed a self-hosted Elasticsearch infrastructure on Kubernetes, facilitating real-time search capabilities across millions of alerts and associated metadata.Established core infrastructure using Terraform (Infrastructure as Code), enabling reproducible, version-controlled environments and expediting onboarding for new engineers.Implemented a dedicated staging environment, which enhances safety during releases, feature validation, and automated integration testing prior to production deployments.Introduced observability and tracing mechanisms (metrics, logging, distributed tracing), significantly improving our capacity to debug performance issues and sustain reliability at scale.What We Offer A mission-driven culture emphasizing low ego, high accountability, deep customer focus, and exceptional talent density. Complimentary lunch and dinner in the office. Flexible Paid Time Off (PTO). Quarterly team offsites.

Sep 12, 2025
Apply
companyOpenAI logo
Full-time|Hybrid|San Francisco

Join Our Innovative TeamAt OpenAI, security is the cornerstone of our commitment to ensuring artificial general intelligence serves all of humanity. The Identity Infrastructure Engineering team is pivotal in this mission, crafting robust identity and access management solutions that safeguard our model weights, customer data, and essential systems across diverse cloud environments. Collaborating closely with teams across Applied Engineering, Research, IT, and Security, we deliver a secure and scalable platform that empowers permissioning, orchestration, and groundbreaking AI research.Your RoleAs a Software Engineer on our Identity Infrastructure Engineering team, you will play a crucial role in designing, deploying, and managing foundational security tools and infrastructure. This position involves leveraging a wide array of technologies to support multi-cloud deployments, ensuring our researchers and engineers can securely build, test, and scale transformative AI systems. We seek individuals who are technically adept, collaborative, and passionate about integrating secure-by-default principles throughout our technology stack.We invite Software Engineers eager to address challenges in:Identity & Access Management: Develop and sustain systems and interfaces that efficiently manage user and service identities, guaranteeing consistent, fine-grained access controls across various cloud providers and internal services.Multi-Cloud Security: Architect and implement tools that protect model weights, proprietary data, and sensitive assets, seamlessly operating within AWS, Azure, GCP, and future cloud environments.Automation & Tooling: Create robust frameworks, APIs, and CLI tools that automate ongoing security tasks (such as credential provisioning and rotation), allowing teams to focus on AI innovation without compromising security.In this position, you will:Develop new features for our IAM platform that integrate seamlessly with evolving cloud services, enabling teams to operate efficiently while adhering to security best practices.Lead security innovations by designing tools, processes, and frameworks that enhance our infrastructure.This role is primarily based in San Francisco, Seattle, or New York City, with remote work options considered. We embrace a hybrid work model, requiring three days in the office weekly, and provide relocation assistance for new hires.

Feb 5, 2026
Apply
companyFable Security logo
Full-time|$160K/yr - $225K/yr|Hybrid|San Francisco, CA (Hybrid)

About Fable SecurityIn today’s digital landscape, AI-driven threats and human errors represent the most significant risks to enterprise security. Cybercriminals exploit human behavior, contributing to 70% of security breaches. At Fable, we empower individuals to transform from potential targets to active defenders with innovative tools.Fable is at the forefront of human risk management, offering a platform that effectively influences employee behavior. Our user-friendly, scalable solution analyzes complex employee data, identifies high-risk behaviors, and delivers timely interventions directly to users in their work environment.Supported by notable investors like Redpoint Ventures and Greylock Partners, and founded by former members of the Abnormal Security team, Fable is tackling one of cybersecurity's greatest challenges in a rapidly expanding market. Our team comprises alumni from esteemed organizations such as Meta, Twitter, and Flexport, as well as top universities including Waterloo, Columbia, and Stanford. This is an exceptional opportunity for you to join us at a time of rapid growth and help shape the future of security.Why Join UsBuild and scale the foundational data infrastructure that drives a groundbreaking product.Collaborate closely with engineering, data science, and product teams to operationalize data at scale.Become part of a small, high-caliber team where your contributions will have a significant impact.As part of an early-stage company, every engineer plays a crucial role in shaping the evolution of our products and the company's approach to data management.Your RoleAs a Platform and Infrastructure Engineer, you will be instrumental in developing and scaling the core systems that underpin Fable’s product and data operations.Your responsibilities will span backend systems including real-time services and data pipelines. You will ensure reliability, scalability, and optimal performance across all layers. This highly collaborative role involves working closely with data and ML teams, contributing to systems that effectively manage data ingestion, processing, and delivery.This role demands cross-functional collaboration with engineering, data, and product teams to create robust, production-grade systems that grow alongside the company.ResponsibilitiesDesign, develop, and maintain scalable backend and infrastructure systems.Collaborate with cross-functional teams to deliver high-quality software solutions.Ensure system reliability, performance, and security through rigorous testing and monitoring.

Apr 6, 2026
Apply
companymercor logo
Full-time|Remote|San Francisco or NYC

About the Role mercor is hiring a Cloud Infrastructure Security Engineer in San Francisco or New York City. This role focuses on protecting cloud-based systems and data across the company. The engineer will design, implement, and monitor security controls to defend cloud infrastructure against threats. What You Will Do Develop and maintain security measures for cloud environments Monitor cloud infrastructure for vulnerabilities and incidents Respond to security threats and support incident investigations Work to ensure compliance with industry security standards Contribute expertise to strengthen overall security posture Location This position is based in San Francisco or New York City.

Apr 15, 2026
Apply
companySierra logo
Full-time|On-site|San Francisco, CA

About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.

Oct 15, 2025
Apply
companyExa logo
Full-time|On-site|San Francisco, California

At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.

Sep 3, 2025
Apply
companyFable Security logo
Full-time|$160K/yr - $225K/yr|Hybrid|San Francisco, CA (Hybrid)

About Fable SecurityAt Fable Security, we recognize that AI-driven threats and human error pose significant risks to enterprise security. Cybercriminals exploit human behavior, which is responsible for over 70% of security breaches. Our mission is to empower individuals with the right tools, transforming them from targets into an active line of defense.We have developed a human risk platform that effectively shapes employee behavior. Our user-friendly and scalable platform integrates complex employee data, identifies risky behaviors, and automatically delivers timely, relevant interventions where employees are most engaged—in real time.Supported by renowned investors such as Redpoint Ventures and Greylock Partners, and founded by members of the Abnormal Security team, Fable is addressing one of cybersecurity’s most pressing challenges within a multi-billion-dollar market. Our diverse team includes alumni from Meta, Twitter, and prestigious universities like Columbia, Stanford, and UCLA. As we experience rapid growth, this is a prime opportunity to contribute to and influence the future of security.Why Join UsHelp us build and scale the core data infrastructure that drives a groundbreaking product.Collaborate with engineering, data science, and product teams to operationalize data effectively at scale.Be part of a small, elite team where your contributions will have a significant impact.As part of an early-stage company, every engineer plays a crucial role in shaping product functionality and evolution. You will define not only the technical architecture but also the company’s data philosophy.Your RoleIn the position of Data Infrastructure Engineer, you will be responsible for the architecture, scalability, and reliability of our data platform.You will design and construct systems that support everything from real-time product functionalities to internal analytics and machine learning processes, covering the spectrum from data ingestion to production-ready datasets. Additionally, you will establish best practices that underpin our data-driven products.This role is highly cross-functional, requiring close collaboration with engineering, data, and product teams to ensure our data foundation evolves in tandem with our growth.ResponsibilitiesDesign, develop, and sustain scalable data systems.Implement best practices for data architecture and management.Collaborate with cross-functional teams to facilitate data-driven decision-making.

Apr 6, 2026
Apply
companyThinking Machines Lab logo
Full-time|$200K/yr - $475K/yr|On-site|San Francisco

At Thinking Machines Lab, our mission is to empower humanity by advancing collaborative general intelligence. We are dedicated to building a future where everyone can access the knowledge and tools necessary to harness AI for their unique needs and objectives.We are a team of scientists, engineers, and builders who have developed some of the most widely used AI products, including ChatGPT and Character.ai, and contributed to open-weight models like Mistral, along with popular open-source projects such as PyTorch, OpenAI Gym, Fairseq, and Segment Anything.About the RoleWe are seeking an Infrastructure Engineer to take charge of evolving the security infrastructure that supports our foundational models. In this pivotal role, you will collaborate across computing, storage, networking, and data platforms to ensure our systems remain secure, reliable, and scalable. You will design controls, architecture, and tooling that embed security into the platform's core functionalities. Working closely with research and product teams, you will enable them to operate swiftly while safeguarding our models, data, and environments.Note: This is an "evergreen role" that we maintain for ongoing interest. While we receive numerous applications, there may not always be an immediate position that perfectly matches your skills and experience. We encourage you to apply, as we continuously assess applications and reach out to candidates when new opportunities arise. Feel free to reapply if you gain more experience, but please refrain from applying more than once every six months. Additionally, we occasionally post openings for specific roles to meet project or team-specific needs, and in those cases, you are welcome to apply directly in conjunction with this evergreen role.What You’ll DoDesign security patterns for platforms and services, including network segmentation, service-to-service authentication, RBAC, and policy enforcement in Kubernetes and cloud environments.Oversee identity, access, and secrets management for users and services: workload and cross-cloud identity, least-privilege IAM, and secrets management.Create secure platforms for data ingestion, processing, and curation, encompassing classification, encryption, access controls, and safe sharing practices across teams.Develop threat models and review designs with researchers and engineers to facilitate safe and scalable feature launches.Automate security checks and implement guardrails: policy-as-code, secure infrastructure baselines, CI/CD validation, and tools that streamline secure operations.

Dec 2, 2025
Apply
companySemgrep logo
Full-time|$145.5K/yr - $171K/yr|On-site|San Francisco, Boston, New York, Denver

About SemgrepSemgrep is the vanguard of code security, revolutionizing the way developers create software by enabling frictionless innovation. Our platform aids teams in identifying, flagging, and resolving real security issues prior to deployment, supported by a security system that evolves with every line of code written. Semgrep ensures the integrity of code during development and provides essential guardrails, allowing developers to work swiftly and securely. Trusted by prominent organizations such as Snowflake, Dropbox, and Figma, Semgrep enhances visibility and control for security teams while delivering solutions seamlessly within the developer workflow. Our AI technology minimizes false positives and prioritizes actionable vulnerabilities, achieving validation from 95% of security experts across over 6 million findings.Founded in San Francisco and backed by leading venture capital firms like Menlo Ventures, Felicis Ventures, Lightspeed Venture Partners, Redpoint Ventures, and Sequoia Capital, Semgrep has been recognized by Gartner in Application Security Testing. Discover more at semgrep.dev.About the RoleThe Infrastructure team at Semgrep manages the critical cloud infrastructure that supports both internal and external applications, including our flagship offerings, Semgrep Code and Semgrep Supply Chain. Our mission is to empower Semgrep employees to confidently build, operate, and maintain their systems. We achieve this by offering a cohesive platform that transitions projects seamlessly from code to cloud, utilizing CI/CD systems, Kubernetes in AWS environments, and robust tools for production software operations.In this role, you will immerse yourself in the application security domain, collaborating with seasoned engineers to develop a robust infrastructure platform and build secure, reliable, and high-performing distributed systems. Embrace Semgrep’s transparent culture, where your insights and contributions can directly shape the success of our startup. Your work will play a vital role in positioning Semgrep as a leading static-analysis project, making a lasting impact not only within our organization but also in the broader developer community.Your Responsibilities:Collaborate with senior and staff engineers to design, implement, and deploy infrastructure initiatives.Learn and apply infrastructure best practices in conjunction with team members.Take ownership of projects, platforms, and tools within the team, managing them over several weeks.

Nov 6, 2025
Apply
companyLyft logo
On-site|On-site|San Francisco, CA

At Lyft, we are dedicated to connecting people and creating a community where every team member feels valued and empowered to reach their full potential.As a pivotal player in transforming how our communities move, Lyft's engineering team is rapidly expanding. We are seeking passionate Software Engineers specialized in Security to join our dynamic Security team. Together, we will enhance our ability to deliver secure services at scale.Lyft is entrusted with the sensitive information of both drivers and passengers, and we take the responsibility of safeguarding that data seriously. Our Security team spearheads initiatives across the organization to protect our systems and uphold user trust.Our work encompasses designing and building a robust security architecture, collaborating with various teams during the development and launch of new products, anticipating potential challenges, and managing security incidents effectively. Our impact spans the entire organization, covering all aspects of the technology stack, including infrastructure, web applications, mobile apps, IT, and even autonomous vehicles. We adopt an engineering-focused approach to security, aiming to automate and streamline our processes while ensuring frequent updates. Explore more about our innovations on our blog at https://eng.lyft.com/tagged/security.The Cloud Security team is dedicated to enhancing Lyft's security posture by architecting a comprehensive security model tailored for our cloud infrastructure, protecting both our employees and intellectual property.As a Senior Software Engineer, you will play a crucial role in shaping this team and driving high-impact security initiatives. Your responsibilities will include leading security reviews, implementing detection measures, addressing vulnerabilities, enforcing the principle of least privilege, and establishing secure configurations for our multi-cloud and container environments.

Dec 26, 2025
Apply
companyServal logo
Full-time|On-site|San Francisco

Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.

Jan 29, 2026
Apply
companyfal logo
Full-time|On-site|San Francisco

We are seeking a talented and dedicated Staff Security Engineer to join our dynamic team at fal. In this role, you will be vital in enhancing our infrastructure's security posture while ensuring compliance with industry standards and best practices.Your expertise will contribute to designing, implementing, and maintaining robust security solutions that safeguard our systems and data. You will collaborate closely with cross-functional teams to identify vulnerabilities and develop effective mitigation strategies.

Apr 6, 2026
Apply
companyImprint logo
Full-time|On-site|San Francisco

About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.

Jan 16, 2026
Apply
companyvooma logo
Full-time|On-site|San Francisco Office

About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.

Jul 1, 2025
Apply
companyBaseten logo
Full-time|$300K/yr - $300K/yr|On-site|San Francisco

ABOUT BASETENJoin Baseten, where we drive mission-critical AI inference for leading companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our unique blend of applied AI research, robust infrastructure, and intuitive developer tools empowers organizations at the forefront of AI innovation to deploy state-of-the-art models into production. Recently, we secured a $300M Series E funding round, backed by esteemed investors such as BOND, IVP, Spark Capital, Greylock, and Conviction. Be a part of our rapid growth and help shape the platform that engineers trust for launching AI products.THE ROLEAs an Infrastructure Software Engineer at Baseten, you will play a pivotal role in developing and maintaining our ML inference platform that powers AI applications in production. Your contributions will enhance the core infrastructure, enabling developers to deploy, scale, and monitor machine learning models with exceptional performance.EXAMPLE INITIATIVESYou will engage in innovative projects within our Infrastructure team, including:Multi-cloud capacity managementInference on B200 GPUsMulti-node inferenceFractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and develop infrastructure components for our ML inference platform, primarily using Python and Go.Implement and maintain Kubernetes deployments for optimal model serving.Contribute to the orchestration layer for model deployments.Build and enhance monitoring systems to track model performance metrics effectively.Develop efficient resource management solutions to optimize performance.

Mar 9, 2025
Apply
companySift logo
Full-time|$150K/yr - $200K/yr|On-site|San Francisco, CA

At Sift, we are revolutionizing the way cutting-edge machines are constructed, tested, and managed. Our innovative platform provides engineers with real-time visibility into high-frequency telemetry, effectively removing bottlenecks and facilitating quicker, more dependable development.Sift originated from our experience at SpaceX, contributing to projects like Dragon, Falcon, Starlink, and Starship, where the demands of scaling telemetry, debugging flight systems, and ensuring mission reliability necessitated a new kind of infrastructure. Founded by a talented team from SpaceX, Google, and Palantir, Sift is tailored for mission-critical systems where precision and scalability are imperative.As one of the pioneering engineers at Sift, your role will extend beyond just coding—you will play a crucial part in defining the architecture, shaping the product, and influencing the culture of a company dedicated to addressing real engineering challenges. If you're eager to take on intricate technical obstacles and build foundational systems that support complex machines from the ground up, we would love to connect with you.

Oct 30, 2025
Apply
companyIvo logo
Full-time|On-site|San Francisco, California

Join Ivo's Engineering Team!At Ivo, we are pioneers in the tech industry. Our engineers are innovators who have created groundbreaking solutions such as:• An AI agent that seamlessly integrates with MS Word to enhance document editing [2023]• Revolutionizing embedding models with agentic RAG technology [2023]• Advanced LLM-based legal fact extraction capabilities [2024]• A legal assistant designed to search extensive contract databases without compromising accuracy [2024]• Clustering legal documents from the same lineage [2025]• Automatic deviation analysis to uncover hidden risks in vast contract databases [2025]• Merging contracts with their amendments to create a “composite” contract timeline that has moved our clients to tears [2025]Role OverviewAs an Infrastructure Engineer at Ivo, you will lay the groundwork for our platform's future. Your responsibilities will include:• Designing and owning the future of our infrastructure, allowing you the freedom to innovate.• Managing multiple customer deployments, ensuring each receives tailored containers, databases, and VPCs.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics and logs into visually appealing dashboards and setting up pager alerts.• Leading infrastructure-related incidents and being on-call as necessary.• Enhancing our CI/CD system to reduce deployment time from ~12 minutes.If you're passionate about LLMs, you'll thrive in our engineering team, where you’ll have the opportunity to:• Develop real-time LLM evaluations to monitor the accuracy of our responses.• Collaborate with talented engineers to push the boundaries of DevOps.

Nov 20, 2025
Apply
companyAstranis logo
Full-time|On-site|San Francisco

Astranis is seeking a talented and motivated Software Engineer to join our Infrastructure team. In this role, you will be at the forefront of developing and maintaining critical software systems that support our innovative satellite technology. You'll collaborate with cross-functional teams to design, implement, and optimize our infrastructure solutions, ensuring high reliability and performance.

Apr 9, 2026

Sign in to browse more jobs

Create account — see all 5,857 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.