Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Key Responsibilities:Design, build, and maintain robust and scalable infrastructure solutions to support an expanding platform and customer base. Collaborate closely with software engineers to enhance application performance and reliability. Implement comprehensive monitoring, alerting, and logging systems for proactive issue identification and resolution. Automate deployment processes and optimize infrastructure management using cutting-edge DevOps tools and practices. Advance our backend architecture/infrastructure for both cloud-based and on-premise deployments. Work with the team to prioritize and strategize our roadmap to maximize customer impact. Lead initiatives aimed at improving infrastructure reliability, performance, and cost efficiency. Required Skills and Qualifications:8+ years of experience in distributed systems focusing on designing and managing cloud-based environments (AWS, Azure, GCP). Proficient with containerization technologies (Docker, Kubernetes) and container orchestration platforms. Strong passion for developing and operating tools that enhance developer productivity and platform engineering. Familiarity with CI/CD pipelines and version control systems. Understanding of automated testing best practices and frameworks to ensure software reliability through integration, performance, and other testing methodologies.
About the job
At Siftstack, we are revolutionizing the development, testing, and operation of modern machines. Our innovative platform provides engineers with real-time observability over high-frequency telemetry, effectively removing bottlenecks and accelerating the development process.
Emerging from our groundbreaking work at SpaceX on projects including Dragon, Falcon, Starlink, and Starship, Siftstack was founded by an exceptional team with experience from SpaceX, Google, and Palantir. We are dedicated to building mission-critical systems where precision and scalability are essential.
As a senior engineer at Siftstack, you will not just write code; you will have a significant role in shaping the architecture, guiding the product's evolution, and influencing the culture of a company that tackles real engineering challenges. If you are eager to face complex technical issues and contribute to foundational systems for innovative machines, we want to connect with you.
About Siftstack
Siftstack is at the forefront of creating advanced infrastructure for modern engineering challenges, stemming from our legacy at SpaceX. With a focus on reliability and scalability, we are committed to developing solutions that empower engineers in their work with complex systems.
Similar jobs
1 - 20 of 7,180 Jobs
Search for Senior Software Engineer Compute Infrastructure
Databricks is looking for a Senior Software Engineer focused on Compute Infrastructure in San Francisco, California. This position centers on building and improving compute architecture to support greater performance and scalability across Databricks' platform. What you will do Develop and optimize compute infrastructure to handle demanding data processing and analytics workloads. Work closely with teams from different disciplines to deliver reliable, high-quality solutions for customers. Impact Your contributions will help define how data processing and analytics evolve at Databricks. The work directly supports customers’ ability to scale and perform complex tasks in the cloud. Who we’re looking for Strong background in cloud technologies and compute systems. Enjoys tackling complex technical challenges. Collaborative approach to problem-solving with cross-functional teams.
Team and Platform Focus The Compute Infrastructure team at OpenAI designs, builds, and maintains the systems that support AI research at scale. This work brings together accelerators, CPUs, networking, storage, data centers, orchestration software, agent infrastructure, developer tools, and observability. The aim is to create a reliable, unified experience for researchers and product teams across the company. Projects span the full stack: capacity planning, cluster lifecycle management, bare-metal automation, and distributed systems. The team manages Kubernetes scheduling, system optimization, high-performance networking, storage, fleet health, reliability, workload profiling, benchmarking, and improvements to the developer experience. Even small improvements in communication, scheduling, hardware efficiency, or debugging can significantly accelerate research. OpenAI matches engineers to areas within Compute Infrastructure that align with their skills and interests. Role Overview This Software Engineer role centers on building and evolving the compute platform that supports OpenAI’s research and products. Candidates may bring expertise in low-level systems, high-performance computing, distributed infrastructure, reliability, CaaS, agent infrastructure, developer platforms, tooling, or infrastructure user experience. The most important qualities are strong analytical skills, the ability to write resilient code, and a collaborative approach that helps colleagues move faster and with more confidence. What You Will Work On Working close to hardware or at the user interaction layer Developing CaaS and agent infrastructure Managing control and data planes that connect the system Bringing new supercomputing capabilities online Optimizing training workloads through profiler traces and benchmarks Improving NCCL and collective communication Analyzing GPUs, NICs, topology, firmware, thermal dynamics, and failure modes Designing abstractions to unify diverse clusters into a single platform Areas of Expertise No one is expected to cover every area listed. Some engineers focus on system performance, kernel or runtime behavior, large-scale networking protocols, RDMA, NCCL, GPU hardware, benchmarking, scheduling, or hardware reliability. Others improve the platform’s usability through APIs, tools, workflows, and developer experience. The team values strong engineering judgment and a drive to advance the field.
Full-time|$190K/yr - $253.8K/yr|On-site|Mountain View, California; San Francisco, California
P-931 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world—from revolutionizing transportation to fast-tracking medical innovations. We achieve this by developing and managing the foremost data and AI infrastructure platform, enabling our clients to leverage profound data insights to enhance their enterprises. Founded by engineers with a customer-centric approach, we seize every chance to resolve technical challenges, from crafting next-generation UI/UX for data interactions to scaling our services and infrastructure across millions of virtual machines. And we’re just getting started. Within Databricks, the Compute Infrastructure organization is responsible for building and operating the essential framework that supports all Data, AI, and stateful workloads across major cloud platforms. Our system launches tens of millions of VMs daily, manages thousands of Kubernetes clusters, and must deliver exceptional elasticity, reliability, and cost-effectiveness. We are in search of an Engineering Manager to lead a team focused on pivotal components of this platform. Your contributions will significantly impact product delivery speed, customer satisfaction, and our company's scalability. The impact you will have: Own and enhance the compute platform to support all Databricks workloads, enabling engineers to create top-tier products with high velocity and superior performance. Recruit exceptional engineers and nurture their development through guidance, feedback, and career advancement opportunities. Elevate the technical and operational standards through robust design practices, rigorous testing, and a culture of engineering excellence and platform thinking. Collaborate with engineering and product leadership to establish long-term strategies and roadmaps. Lead cross-functional initiatives encompassing both product and infrastructure domains. Influence architectural decisions that extend beyond your immediate team.
Full-time|$150K/yr - $200K/yr|On-site|San Francisco, CA
At Sift, we are revolutionizing the way cutting-edge machines are constructed, tested, and managed. Our innovative platform provides engineers with real-time visibility into high-frequency telemetry, effectively removing bottlenecks and facilitating quicker, more dependable development.Sift originated from our experience at SpaceX, contributing to projects like Dragon, Falcon, Starlink, and Starship, where the demands of scaling telemetry, debugging flight systems, and ensuring mission reliability necessitated a new kind of infrastructure. Founded by a talented team from SpaceX, Google, and Palantir, Sift is tailored for mission-critical systems where precision and scalability are imperative.As one of the pioneering engineers at Sift, your role will extend beyond just coding—you will play a crucial part in defining the architecture, shaping the product, and influencing the culture of a company dedicated to addressing real engineering challenges. If you're eager to take on intricate technical obstacles and build foundational systems that support complex machines from the ground up, we would love to connect with you.
Full-time|$180K/yr - $250K/yr|On-site|San Francisco
Join our innovative team at fal as a Staff Software Engineer specializing in large-scale computation platforms. We are seeking a seasoned software engineer with extensive experience in developing backend systems that efficiently orchestrate workloads and manage resource constraints. Your expertise in foundational cloud infrastructure and Linux provisioning will be crucial as you work towards achieving high reliability and scalability with minimal operational overhead.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Netic is revolutionizing the essential services sector with our AI-driven revenue engine, empowering the backbone of the American economy.With $43M in funding from leading investors such as Founders Fund, Greylock, Hanabi, and Dylan Field, who spearheaded our Series B, we have enabled our clients to secure hundreds of thousands of jobs across various service industries in North America. Today, numerous companies thrive entirely on an AI-first model powered by Netic.As a member of our team consisting of innovative builders from top organizations such as Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard, you will be at the forefront of integrating frontier AI into the physical economy, where challenges are complex, data is intricate, and impacts are immediate and substantial.In the role of a founding Product Infrastructure Engineer, you will design and scale the crucial infrastructure that supports our autonomous AI agents, addressing real-world challenges with significant, tangible outcomes. You will work alongside a passionate team of builders to develop infrastructure and processes from scratch, utilizing state-of-the-art cloud and orchestration technologies. If you excel in dynamic, ambiguous settings and are eager to set new benchmarks in the agentic domain, this is your chance to make a lasting impact.
Join our innovative team at Astranis as a Senior Software Engineer specializing in Infrastructure. In this role, you will be responsible for designing, implementing, and maintaining robust infrastructure solutions that support our cutting-edge satellite technology. Your expertise will play a crucial role in enhancing the reliability and scalability of our systems.
Full-time|$190K/yr - $280K/yr|Hybrid|San Francisco, California
About SentrySentry is dedicated to eliminating poor software experiences. Our mission is to empower developers to create high-quality software swiftly, allowing everyone to enjoy technology to its fullest.With over $217 million raised in funding and a community of over 100,000 organizations, including giants like Disney, Microsoft, and Atlassian, we are developing state-of-the-art performance and error monitoring tools. Our solutions help our partners minimize time spent on bug fixes and maximize product development.In our commitment to collaboration, Sentry follows a hybrid work model across our global offices. We have designated Mondays, Tuesdays, and Thursdays as in-office days to foster effective teamwork. If you are passionate about building tools that enhance the digital experience, join us in creating the next generation of software monitoring solutions.About the RoleAt Sentry.io, we offer vital services for diagnosing application health issues. Our tools are crucial for organizations aiming to respond adeptly in dynamic markets. We ensure a seamless and enjoyable experience in the development and deployment of these tools through a robust continuous integration environment and an insightful deployment pipeline.As part of the Infrastructure Engineering team, your contributions will be instrumental in supporting Sentry's growth and enabling engineering teams to operate with agility and confidence.Your responsibilities will include designing, developing, and maintaining internal software and platform capabilities that alleviate the cognitive load associated with infrastructure and developer tooling. You will create dependable, reusable abstractions that facilitate rapid shipping of features while incorporating durability, security, and operational excellence into service development and management.This role demands strong engineering judgment: selecting reliable technologies, planning for scalability from the outset, and crafting solutions that serve multiple teams. Your focus will be on practical systems that enhance reliability and ownership across the organization, driving adoption through comprehensive documentation, well-designed APIs, and seamless developer experiences that integrate into daily workflows.Ultimately, you will empower engineering teams to flourish within a culture of ownership—enabling them to deploy, manage, and evolve services confidently while minimizing operational burdens.Key ResponsibilitiesDesign systems that scale with company growth, ensuring a balance of reliability, performance, and cost-efficiency.Develop platform services that enhance internal operations and developer productivity.
Julius operates as an applied AI lab, developing advanced coding agents for a broad user base. The platform executes about 1 million lines of code every 36 hours, serves over 1 million users, and generates more than 3 million visualizations. All code runs in tightly managed, isolated sandboxes. Julius is a revenue-generating business backed by AI Grant, YCombinator, Bessemer Venture Partners, and founders from leading technology companies. Role overview This mid to senior level Software Engineer - Infrastructure role focuses on designing and scaling the code-execution sandboxes that form the backbone of Julius. The infrastructure spans cloud platforms such as AWS and GCP, orchestrating over 500,000 containers each month. The main priorities are reliability, performance, and security in a multi-tenant compute environment. What you will do Design and maintain secure, multi-tenant container infrastructure with rapid startup and intelligent autoscaling. Deploy and manage cloud resources using Helm and Terraform, including SSO, network controls, and audit logging. Enhance observability through metrics, traces, and logs. Define SLOs and lead incident response efforts. Optimize container images, scheduling, networking, and costs. Develop and enforce fair-use and rate-limiting policies. Requirements Hands-on experience with production Kubernetes and container internals (Docker or containerd), as well as strong networking skills. Familiarity with cloud services (AWS, GCP, or Azure) and Infrastructure as Code tools such as Terraform and Helm. Proficiency with monitoring and logging tools like Prometheus, Grafana, OpenTelemetry, ELK, or Vector. Understanding of security best practices for containerized, multi-tenant systems. Preferred qualifications Experience with technologies such as gVisor, Kata, Firecracker, Cilium, eBPF, GPU scheduling, or serverless autoscaling frameworks (KEDA, Knative, Karpenter). Interest in AI projects, especially those involving large language models (LLMs). Benefits and compensation Competitive base salary Substantial equity options Comprehensive health and dental coverage Gym reimbursement Daily team meals Commuter assistance Julius offers the chance to work in San Francisco, CA, alongside a small and highly skilled team tackling large-scale infrastructure challenges. The systems here operate at significant scale and complexity, providing opportunities to solve demanding technical problems in a collaborative setting.
Compensation: Competitive base salary + substantial equityBenefits: Health & dental insurance, gym reimbursement, daily team lunches, 401(K)About JuliusAt Julius, we're pioneering advancements in applied AI by developing cutting-edge coding agents. Our platform executes approximately 1 million lines of code every 36 hours, serving over 1 million users and generating 3 million+ visualizations. We manage all code in isolated remote containers. As a revenue-generating entity, we are backed by AI Grant and founders with remarkable backgrounds from companies like Vercel, Notion, Perplexity, Palantir, Replit, Zapier, Intercom, and Dropbox.The RoleJoin us in building and scaling the robust code-execution platform that powers Julius, across both cloud and on-prem environments. We orchestrate over 500,000 containers/month and the demand is growing rapidly. You will take ownership of reliability, performance, and security within our multi-tenant compute environment.Your ResponsibilitiesDesign and manage a secure, multi-tenant container infrastructure that ensures quick startup and intelligent autoscaling.Implement on-prem/private cloud deployments using Helm and Terraform, integrating SSO, network controls, and audit logging.Enhance observability (metrics, traces, logs) with well-defined SLOs and lead incident response initiatives.Optimize images, scheduling, networking, and costs, while developing fair-use and rate-limiting controls.Your QualificationsStrong experience with production Kubernetes and container internals (Docker/containerd); solid understanding of networking principles.Familiarity with cloud environments (AWS/GCP/Azure) and Infrastructure as Code (Terraform/Helm).Proficiency in monitoring and logging tools (Prometheus, Grafana, OpenTelemetry, ELK/Vector).Understanding of security best practices for containerized, multi-tenant systems.Preferred QualificationsExperience with gVisor, Kata, Firecracker; Cilium/eBPF; GPU scheduling; serverless autoscaling (KEDA/Knative/Karpenter).Proven experience delivering on-prem or air-gapped enterprise software solutions.A passion for AI, with experience building side projects involving LLMs.Why Join Julius?Be part of a small, senior team where your contributions will have a massive impact. Tackle challenging infrastructure problems at a meaningful scale.
Senior Software Engineer, Infrastructure & PlatformRole OverviewIn the role of Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will take on the exciting challenge of designing and constructing the essential infrastructure that drives our innovative data generation, evaluation, and agentic systems.Your responsibilities will include developing shared platforms that empower our engineering and research teams to execute large-scale human-in-the-loop workflows, evaluation harnesses, and automated data pipelines essential for training cutting-edge AI models.This position demands a high level of technical expertise and offers extensive ownership. You will be responsible for architecting and building the foundational infrastructure relied upon by numerous engineers, ensuring that systems are scalable, reliable, and capable of handling high-throughput workloads.Collaboration with the founding team will be key as you define system architecture, establish best engineering practices, and create the infrastructure that supports the evolution of AI development.
Join our dynamic team at Parafin as a Senior Software Engineer specializing in Infrastructure. In this pivotal role, you will design, develop, and maintain robust infrastructure solutions that support our scalable applications. Your expertise will help us enhance system performance, reliability, and security.We are looking for innovative thinkers who thrive in a collaborative environment. You will work closely with cross-functional teams to implement cutting-edge technologies that drive our product forward.
Full-time|$200K/yr - $200K/yr|On-site|San Francisco
Join Convex in revolutionizing application development!At Convex, we are on a mission to redefine how software is constructed on the Internet. Our innovative platform enables developers to create swift, dependable, and dynamic applications without the need for a backend team. We offer a comprehensive full-stack application platform, meticulously designed with abstractions for databases, computing, and backend services, allowing both developers and LLMs to innovate rapidly, ensuring products that are scalable and maintain simplicity throughout their lifecycle.About Our Team:Our Convex team comprises engineers who have architected and built some of the largest backends globally, managing exabytes of data and millions of transactions per second. We are a friendly, collaborative group of passionate individuals who thrive on in-person collaboration in our San Francisco office.Position Overview:As Convex evolves, we are seeking outstanding senior or staff-level engineers to help us architect and sustain the future of our infrastructure at scale. If you have a passion for distributed systems and a robust background in designing and managing web infrastructure, we want to connect with you!We value robust architecture, effective collaboration, and simplicity. Our team embraces high ownership and places significant emphasis on operational excellence. This role is not solely focused on operations; we seek individuals who are dedicated to designing and constructing systems in the most effective manner possible, especially in a startup environment.Your Responsibilities:Architect, construct, and oversee Convex’s global cloud infrastructure.Analyze and enhance the performance and reliability of our systems.Independently prioritize projects, collaborating closely with the engineering team and CTO.Establish best practices and reliability standards as we expand our team and systems.Develop sophisticated systems and database code.Engage with feedback from leadership regarding seeking simpler and more elegant solutions.What We Value:A strong enthusiasm for distributed systems and backend infrastructure.A collaborative spirit and a desire to grow with the team.A commitment to best practices and maintaining high standards in engineering.
About EventualAt Eventual, we are reimagining how AI applications process vast amounts of data, from images to complex datasets. Traditional data platforms are not equipped to handle the petabytes of multimodal data essential for AI, causing teams to struggle with inadequate infrastructure. Founded in 2022, our mission is to simplify data querying, making it as intuitive as working with tables while ensuring scalability for production workloads.Our open-source engine, Daft, is specifically designed for real-world AI systems. It efficiently manages external APIs, GPU clusters, and addresses failures that traditional engines cannot handle. Daft is already integral to operations at leading companies such as Amazon, Mobileye, Together AI, and CloudKitchens.We pride ourselves on our exceptional team, which includes talents from Databricks, AWS, Nvidia, Pinecone, GitHub Copilot, Tesla, and others. We have quadrupled our team size in just a year, supported by Series A and seed funding from notable investors like Felicis, CRV, Microsoft M12, and Y Combinator. We are now eager to expand further. Join us—Eventual is just getting started.We are seeking passionate individuals who are excited to collaborate in a close-knit team environment, working together four days a week in our San Francisco Mission district office.Your Role:As a Software Engineer, you will take charge of developing Eventual's core products and architecture. You’ll deliver features that our customers will use immediately and collaborate with a dedicated team that values open communication and cross-functional teamwork. Our fast-paced environment is focused on solving a variety of complex technical and product challenges. While our experienced team is here to provide guidance and mentorship, we appreciate engineers who can independently identify and tackle challenging technical issues.Key Responsibilities:Design and develop highly reliable and resilient products and features.Collaborate closely with cross-functional product and customer-facing teams to understand requirements and deliver thoughtful solutions.Write high-quality, extensible, and maintainable code.Create and build scalable applications and components.Architect and manage Kubernetes clusters optimized for our needs.
At Hover, we empower individuals to conceptualize, enhance, and safeguard the spaces they cherish. Utilizing proprietary AI and over a decade's worth of real property data, we provide answers to pivotal questions such as, 'What will it look like?' and 'What will it cost?' Our platform offers homeowners, contractors, and insurance professionals accurately measured, interactive 3D models of properties — all achievable from a smartphone scan in mere minutes.Driven by curiosity and purpose, we maintain a strong commitment to our customers, communities, and one another. We believe that diverse perspectives foster the best ideas, and we take pride in nurturing an inclusive, high-performance culture that encourages growth, accountability, and excellence. Supported by premier investors like Google Ventures and Menlo Ventures, and trusted by industry leaders such as Travelers, State Farm, and Nationwide, we are revolutionizing how individuals perceive and interact with their environments.About the RoleAs a Senior Software Engineer specializing in Infrastructure, you will delve into cloud infrastructure challenges unique to a company focused on 3D data, computer vision, and machine learning. Your enthusiasm for building internal tools and your talent for crafting elegant solutions to complex issues will be crucial in this role.Our Infrastructure team is responsible for everything beyond the application binary, serving as a critical partner to the rest of the engineering department. Through automation, we aim to streamline processes, ensuring that the simplest path is also the fastest and most secure. We manage and optimize all cloud infrastructure components including our Kubernetes environment, databases, networks, storage, and caching systems. Collaborating with engineering peers, we establish consistent solutions to common architectural challenges, particularly those involving rich geospatial and machine learning workloads. We are well-versed in best practices for cloud architecture and CI/CD, leveraging application development as a means to implement these practices.Your ContributionsYou will play a pivotal role in developing straightforward solutions to intriguing problems, thereby enhancing the foundation upon which our engineering teams build. Collaborating closely with engineers across the organization, you will help make their applications faster, easier to manage, and more reliable in production. Your work will span frontend, backend, computer vision, data, security, and machine learning teams to scale new ideas into production effectively. Given the small and highly collaborative nature of our team, you can expect a varied and impactful workload, which may include:Designing scalable cloud architectureEnhancing CI/CD pipelines and developer tooling
Full-time|$170K/yr - $220K/yr|On-site|San Francisco, CA
At Siftstack, we are revolutionizing the development, testing, and operation of modern machines. Our innovative platform provides engineers with real-time observability over high-frequency telemetry, effectively removing bottlenecks and accelerating the development process.Emerging from our groundbreaking work at SpaceX on projects including Dragon, Falcon, Starlink, and Starship, Siftstack was founded by an exceptional team with experience from SpaceX, Google, and Palantir. We are dedicated to building mission-critical systems where precision and scalability are essential.As a senior engineer at Siftstack, you will not just write code; you will have a significant role in shaping the architecture, guiding the product's evolution, and influencing the culture of a company that tackles real engineering challenges. If you are eager to face complex technical issues and contribute to foundational systems for innovative machines, we want to connect with you.
Join the Space Exploration Journey!As a Senior Software Engineer specializing in Space Infrastructure, you will play a pivotal role in enhancing our capabilities to manage a diverse fleet of satellites, including dedicated, rideshare, and constellation missions. Your work will involve the integration of automated satellite operations, both ground and flight software, while tackling challenges encountered in orbit.Our team is dedicated to ensuring the dependable, efficient, and standardized performance of Loft’s space infrastructure. You will oversee the operational stability of Loft satellites, focusing on the satellite bus, the Hub, and Loft's payloads, which serve as platforms for executing customer missions.Reliability is the cornerstone of Loft's business model and that of our clients. This role offers you the flexibility to engage with various systems, from coding for Cockpit, our mission control system, to writing software that runs onboard our satellites. Additionally, you may have the opportunity to serve as a Flight Director, overseeing the health and safety of our satellite fleet.
Join the Crew at Ivo! Engineering ExcellenceAt Ivo, we are pioneers in the realm of technology. Our engineers are the architects of innovation, creating groundbreaking solutions that redefine the industry. Recent triumphs include:An AI-powered assistant integrated into MS Word that edits documents with remarkable precision.Revolutionizing embedding models with cutting-edge agentic RAG technology.Pioneering legal fact extraction with large-scale LLM applications.Developing an intelligent legal assistant capable of navigating vast contract databases while ensuring accuracy.Implementing advanced clustering techniques for legal documents based on familial relationships.Introducing automatic deviation analysis to uncover hidden risks across extensive contract databases.Merging contracts with amendments to create comprehensive timelines that have left clients in tears of joy. Your RoleAs a Senior Infrastructure Engineer, you will lay the groundwork for Ivo's platform. Your responsibilities will include:Shaping the future of our infrastructure while enjoying the flexibility to design our systems.Managing a multitude of customer deployments, each with unique containers, databases, and VPCs.Instrumenting the system to identify performance bottlenecks and errors.Creating intuitive dashboards and alerts to aggregate metrics, logs, and health checks.Leading incident responses related to infrastructure challenges.Optimizing our CI/CD pipeline to improve deployment times significantly. We seek someone passionate about LLMs and eager to push the boundaries of DevOps innovation. Join us and be a vital part of our engineering team!
Full-time|$172K/yr - $209K/yr|On-site|San Francisco, CA - US
About Crusoe Energy Systems Crusoe Energy Systems manages every layer of AI infrastructure, from energy generation to advanced computational resources. The team focuses on making AI infrastructure more efficient and environmentally conscious, addressing the growing global demand for computing power. Based in San Francisco, Crusoe brings together experts in energy, manufacturing, data center construction, and cloud services. Role Overview: Senior Software Engineer - Cloud Infrastructure This Senior Software Engineer position centers on designing and building cloud infrastructure management systems for Crusoe Cloud, a vertically integrated, AI-focused platform. The engineer will help deliver complete solutions that support the company’s business goals, including system planning, monitoring, deployment, and operations. The role involves hands-on work developing platforms, tools, and frameworks that emphasize reliability, scalability, operational efficiency, and ease of use. As Crusoe Cloud grows, this engineer will play a key part in streamlining infrastructure planning and management processes. What You Will Do Work closely with cross-functional teams to design and implement infrastructure management software and availability platforms for customers using Crusoe’s AI infrastructure. Help improve the reliability, scalability, and security of systems and platforms. Develop workflows that support business objectives and performance targets. Build and maintain high-performing, highly available cloud solutions to meet expanding infrastructure needs. Who Thrives Here Engineers who enjoy solving complex problems, move quickly, and want to work alongside a diverse, supportive team will find this role rewarding. Crusoe values collaboration and a shared drive to advance AI infrastructure. Location San Francisco, CA - US
Apr 17, 2026
Sign in to browse more jobs
Create account — see all 7,180 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.