Software Engineer Cloud jobs in San Francisco – Browse 5,638 openings on RoboApply Jobs

Software Engineer Cloud jobs in San Francisco

Open roles matching “Software Engineer Cloud” with location signals for San Francisco. 5,638 active listings on RoboApply Jobs.

5,638 jobs found

1 - 20 of 5,638 Jobs
Apply
companySofi logo
Full-time|Remote|Add ALL locations here

Sofi is hiring a Software Engineer, Cloud to help build and support cloud-based solutions for a diverse and expanding customer base. This position plays a key part in shaping the technology that powers Sofi’s financial products. Role overview This role centers on designing, developing, and maintaining systems in the cloud. The Software Engineer will work closely with colleagues from different teams to introduce new technologies and refine existing systems. What you will do Create and maintain cloud-based applications and services. Work with cross-functional teams to implement new features and improve performance. Contribute to projects that enhance the overall user experience. Impact Work in this role supports Sofi’s mission to make financial tools accessible to everyone. Contributions directly affect how customers interact with and benefit from Sofi’s offerings.

Apr 28, 2026
Apply
companysfcompute logo
Full-time|On-site|San Francisco, CA

At sfcompute, we are pioneering a solution to mitigate the risks associated with one of the largest infrastructure build-outs in history.Our focus is on transforming how GPU clusters are financed. By creating a robust market for GPU offtake, we enable companies to secure contracts for leasing clusters even before they are constructed, significantly reducing financial risks involved.We recognize that financing GPU clusters can be perilous due to slim margins and massive volumes. Lenders are hesitant to take on the risks associated with cluster developers, while developers are equally wary of unsold clusters. Our innovative approach of utilizing fixed-price long-term contracts allows us to transfer this risk to customers effectively.As AI technology advances, only those who can shoulder the financial risk will have access to required computing power. We aim to democratize access by allowing smaller entities, such as startups, to engage in the GPU market without the burden of extensive long-term contracts.Join us in creating a dynamic and liquid market for GPU offtake that empowers businesses of all sizes.About the RoleWe are seeking a proactive Software Engineer to contribute to the development of our compute delivery platform that supports our innovative offtake machine. In this position, you will design and implement advanced systems that seamlessly connect our compute marketplace with the orchestration software for virtual machines operating on state-of-the-art HPC hardware.

Dec 12, 2025
Apply
companyEventual Computing logo
Cloud Software Engineer

Eventual Computing

Full-time|On-site|San Francisco

Cloud Software EngineerAbout Eventual ComputingAt Eventual Computing, we understand that cutting-edge AI applications—from foundational models to autonomous vehicles—demand the ability to process vast amounts of images, video, and complex datasets. Unfortunately, current data platforms, such as Databricks and Snowflake, are primarily designed for traditional spreadsheet analytics, making them ill-equipped for the petabytes of multimodal data that drive AI innovations. This inefficiency results in teams spending valuable time on fragile infrastructure instead of advancing their research and developing their core products.Founded in 2022, our mission is to revolutionize data querying, making it as intuitive as working with tables while ensuring it can scale to meet production demands. Our open-source engine, Daft, is tailored for real-world AI systems, adept at coordinating with external APIs, managing GPU clusters, and addressing failures that traditional engines struggle with. Daft is already powering essential workloads for industry leaders like Amazon, Mobileye, Together AI, and CloudKitchens.Our team comprises top-tier talent from renowned organizations including Databricks, AWS, Nvidia, Pinecone, GitHub Copilot, and Tesla. Having quadrupled in size within a year, we are backed by Series A and seed funding from prestigious investors like Felicis, CRV, Microsoft M12, Citi, Essence, Y Combinator, Caffeinated Capital, Array.vc, and prominent angels including co-founders of Databricks and Perplexity. We are excited to expand our team further—this is just the beginning for Eventual Computing.We invite enthusiastic individuals to join our close-knit team, working collaboratively four days a week in our San Francisco Mission District office.

Sep 22, 2025
Apply
companySofi logo
Full-time|On-site|CA - San Francisco; WA - Seattle

Join Sofi as a Senior Software Engineer specializing in Cloud Efficiency, where you will play a crucial role in enhancing the performance and scalability of our cloud-based applications. You will collaborate with cross-functional teams to design and implement innovative solutions that drive operational excellence and customer satisfaction.Your expertise in cloud technologies and software development will be instrumental in optimizing our infrastructure, ensuring reliability, and supporting the growth of our dynamic platform.

Mar 25, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

About the TeamAt OpenAI, we are pioneering the future of software engineering with Codex, an advanced AI-powered software engineer designed to assist, delegate tasks, and proactively manage future challenges. Our dynamic team blends research, engineering, design, and product expertise to enhance the Codex agent harness and product, ensuring it excels in complex software engineering tasks.The Codex team is dedicated to creating cutting-edge AI systems capable of writing code, reasoning about software, and serving as intelligent assistants for both developers and non-developers. We oversee the complete lifecycle of experimentation, deployment, and refinement of innovative coding capabilities across research, engineering, product, and infrastructure.Codex Cloud is focused on developing cloud-first software engineering products and the essential platform capabilities that enable agentic experiences across OpenAI. This cross-functional team collaborates across the stack to deliver both exceptional product experiences and foundational platform capabilities.About the RoleAs a member of Codex Cloud, you will take ownership of cloud-based agentic experiences, such as code reviews and cloud tasks, alongside the essential runtime/orchestration layer for these tasks. This infrastructure transforms Codex from a code generator into a scalable AI software engineer. It provides secure, sandboxed environments where Codex can execute commands, manage files, run tests, and progressively enhance its output across real codebases—just like a human developer. Furthermore, Codex Runtime serves as the distributed systems foundation that enables numerous supervised AI agents to operate within OpenAI's and clients' data centers, addressing increasingly complex objectives over extended periods. By providing agents with a tangible working context and dependable execution environment, Codex Runtime empowers Codex to validate its modifications, debug errors, and produce production-ready results, defining how AI agents interact safely and reliably with the world's software.Your Responsibilities:Influence the evolution of Codex by analyzing actual user interactions with AI-powered software engineering, driving enhancements across product, infrastructure, and model behavior to establish Codex as a trusted collaborator for organizations.Craft comprehensive customer-facing software engineering experiences for both consumer and enterprise segments, automating and elevating the software development lifecycle.

Jan 21, 2026
Apply
companyZipline logo
Full-time|$180K/yr - $180K/yr|On-site|South San Francisco, California, USA

Senior Software Engineer – Cloud Communications Platform Location: South San Francisco, California, USA About Zipline Are you passionate about making a difference in the world? At Zipline, we are dedicated to revolutionizing the movement of goods globally. Our mission is to tackle the world’s most pressing access challenges by developing the first instant delivery and logistics system that serves all individuals, irrespective of their location. From facilitating Rwanda’s national blood delivery network and distributing COVID-19 vaccines in Ghana to offering on-demand home delivery for major retailers and enabling healthcare providers to deliver care directly to homes in the U.S., we are reshaping logistics for businesses, governments, and consumers alike. While our technology is sophisticated, the concept is straightforward: a teleportation service that delivers what you need, when you need it. By utilizing robotics and autonomy, we are committed to decarbonizing delivery, alleviating road congestion, minimizing fossil fuel usage, and enhancing the resilience of the global supply chain. Join Zipline and contribute to creating an equitable and resilient logistics system that impacts billions of lives. About You and The Role Zipline operates a large-scale autonomous system that relies on dependable, low-latency communication between vehicles, ground infrastructure, and cloud services. Our Cloud Communications team is responsible for the platform that transfers critical data from embedded systems to the cloud, ensuring data reliability, scalability, and observability. We seek a Senior Software Engineer to enhance and fortify this platform. This role centers on connecting hardware assets to the cloud, hosting and orchestrating new data use cases, and constructing distributed observability across embedded software, cellular networks, and cloud microservices. In this position, you will collaborate closely with the Embedded and Autonomy teams that develop software to extract data from devices. Your primary responsibility will be to guarantee that data is securely ingested into the cloud, deduplicated, stored, processed, monitored, and accessible for both real-time and offline workflows. This is a high-ownership role directly influencing flight reliability, operational visibility, and the scalability of our global network. What You’ll Do Lead the evolution of services connecting vehicles, charging and loading stations, fulfillment hardware, and other field-deployed infrastructure to the cloud. Design and maintain asset-to-cloud APIs, message schemas, and communication clients in collaboration with embedded teams. Develop and manage ingestion pipelines for new data use cases.

Mar 7, 2026
Apply
companyOpenAI logo
Full-time|On-site|San Francisco

Join Our Innovative TeamThe Applied Engineering team at OpenAI is dedicated to bridging the gap between research, engineering, product, and design, delivering cutting-edge AI technology to consumers and businesses alike.As a pivotal member of our team, you will manage the core infrastructure that underpins products such as ChatGPT and our API. This includes overseeing our Kubernetes clusters, infrastructure deployment, networking stack, cloud abstractions, and more.Our mission is to learn from our deployments and ensure the responsible and safe use of AI technology. We place a higher priority on safety than on unchecked growth.About Your RoleAs a vital contributor to the cloud infrastructure team, you'll be responsible for constructing and maintaining infrastructure abstractions that facilitate swift and scalable product delivery.This position is based in our San Francisco, CA office.Your Responsibilities:Architect and develop robust development and production platforms that ensure reliability and security at scale.Optimize our infrastructure for scalability to meet future demands.Foster a diverse, equitable, and inclusive work culture that encourages open communication and challenges conventional thinking.Participate in an on-call rotation to maintain the reliability of the systems we build and respond to critical incidents as necessary.You Will Excel in This Position If You:Possess over 5 years of experience in building core infrastructure.Have extensive experience with orchestration systems such as Kubernetes at scale.Are skilled in creating abstractions over cloud platforms.Take pride in developing and managing scalable, reliable, and secure systems.Thrive in environments characterized by ambiguity and rapid change.This role is exclusively located at our San Francisco headquarters. We offer relocation assistance to qualified candidates.

Aug 4, 2025
Apply
companyLyft logo
On-site|On-site|San Francisco, CA

At Lyft, we are dedicated to connecting people and creating a community where every team member feels valued and empowered to reach their full potential.As a pivotal player in transforming how our communities move, Lyft's engineering team is rapidly expanding. We are seeking passionate Software Engineers specialized in Security to join our dynamic Security team. Together, we will enhance our ability to deliver secure services at scale.Lyft is entrusted with the sensitive information of both drivers and passengers, and we take the responsibility of safeguarding that data seriously. Our Security team spearheads initiatives across the organization to protect our systems and uphold user trust.Our work encompasses designing and building a robust security architecture, collaborating with various teams during the development and launch of new products, anticipating potential challenges, and managing security incidents effectively. Our impact spans the entire organization, covering all aspects of the technology stack, including infrastructure, web applications, mobile apps, IT, and even autonomous vehicles. We adopt an engineering-focused approach to security, aiming to automate and streamline our processes while ensuring frequent updates. Explore more about our innovations on our blog at https://eng.lyft.com/tagged/security.The Cloud Security team is dedicated to enhancing Lyft's security posture by architecting a comprehensive security model tailored for our cloud infrastructure, protecting both our employees and intellectual property.As a Senior Software Engineer, you will play a crucial role in shaping this team and driving high-impact security initiatives. Your responsibilities will include leading security reviews, implementing detection measures, addressing vulnerabilities, enforcing the principle of least privilege, and establishing secure configurations for our multi-cloud and container environments.

Dec 26, 2025
Apply
companyAnthropic logo
Full-time|Remote|San Francisco, CA | Seattle, WA

Join our innovative team at Anthropic as a Software Engineer specializing in Cloud Inference Safeguards. In this role, you will play a crucial part in developing and enhancing the systems that ensure the robustness and security of our cloud-based inference services. You will collaborate with cross-functional teams to design, implement, and maintain scalable solutions that meet our high standards for reliability and performance.

Mar 27, 2026
Apply
companyCrusoe logo
Full-time|$166K/yr - $201K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to accelerate the availability of energy and intelligence. We are building the foundational technology that empowers individuals to innovate boldly with AI while maintaining speed, scale, and sustainability.Join us in the AI revolution with sustainable technology at Crusoe, where you will lead significant innovations, make a real impact, and collaborate with a team that is pioneering responsible and transformative cloud infrastructure.About the Role:We are seeking a highly proficient engineer with extensive experience in designing and managing observability platforms at scale. You will be responsible for architecting, developing, and operating Crusoe’s next-generation observability stack, which will allow engineers to gain insights into the internal state of distributed systems through metrics, logs, and traces. Your contributions will guarantee reliability, performance, and actionable insights across Crusoe’s global infrastructure and cloud platform.Key Responsibilities:Design and manage scalable observability systems (metrics, logging, tracing) in multi-datacenter Kubernetes environments.Architect comprehensive telemetry pipelines, covering ingestion, storage, querying, and visualization.Enhance monitoring and alerting mechanisms with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry.Develop scalable log collection and processing pipelines utilizing Fluent Bit, Vector, Loki, or ELK/Opensearch stacks.Implement distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrate with service meshes, load balancers, and APIs.Establish and promote the adoption of SLOs, SLIs, and error budgets across various services and teams.Automate the provisioning and scaling of observability infrastructure using Kubernetes, Terraform, and custom tools (Go, Python).Ensure the reliability and cost-effectiveness of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure).Integrate security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls.Collaborate with engineering teams to embed observability into applications, services, and infrastructure.Mentor engineers and influence Crusoe’s observability strategy and technical roadmap.

Oct 1, 2025
Apply
companyCrusoe logo
Full-time|$215K/yr - $215K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to propel the availability of energy and intelligence to new heights. We are developing the innovative engine that enables ambitious AI-driven creation without compromising on scale, speed, or environmental sustainability.Join us in the AI revolution with sustainable technology at Crusoe. You will play a pivotal role in fostering significant innovation, creating a real impact, and collaborating with a team that is leading the charge for responsible and transformative cloud infrastructure.Crusoe aims to be the world’s preferred AI-centric cloud infrastructure provider, crafting vertically integrated, purpose-built solutions trusted by Fortune 500 companies to support their most complex AI applications. We are redefining the landscape of AI cloud infrastructure, aligning the future of computing with climate sustainability. Our platform is recognized as the gold standard in reliability and performance, with data centers optimized for AI workloads powered by clean, renewable energy.About the Role:We are looking for skilled Staff Software Engineers to design, build, and scale the customer-facing platforms and services of Crusoe Cloud. The Cloud Customer Experience (CCX) team is dedicated to delivering a premier user experience for our AI-focused cloud platform. Your mission will be to ensure a seamless and intuitive user flow while maintaining backend reliability and scalability that distinguishes us from our competitors. You will focus on developing highly scalable and reliable services for authentication, user management, billing, usage, and more.Key Responsibilities:Design, develop, and maintain scalable and reliable services that enhance our cloud platform’s user experiences.Work collaboratively with cross-functional teams, including product and design, to assess tools, frameworks, and customer requirements, crafting innovative solutions that set Crusoe Cloud apart.Architect and build backend systems that support our cloud platform, encompassing everything from authentication flows to scalable and reliable access to infrastructure resources.Contribute to architectural decisions that enhance reliability and maintainability across the organization.Mentor fellow engineers, improve hiring practices, and contribute to fostering a robust, inclusive engineering culture.

Sep 17, 2025
Apply
companyAnthropic logo
On-site|On-site|San Francisco, CA | New York City, NY | Seattle, WA

About AnthropicAt Anthropic, our mission is to develop AI systems that are safe, interpretable, and controllable. We believe in harnessing AI for the greater good of our users and society at large. Our dynamic team comprises dedicated researchers, engineers, policy experts, and business leaders who collaborate to create beneficial AI systems.About the RoleThe Cloud Inference team is responsible for scaling and optimizing Claude to cater to a vast array of developers and enterprise clients across platforms such as AWS, GCP, Azure, and future cloud service providers (CSPs). We manage the complete lifecycle of Claude on each cloud platform—from API integration and intelligent request routing to inference execution, capacity management, and daily operations.Our engineers wield significant influence, driving multiple key revenue streams while optimizing one of Anthropic's most valuable resources—compute power. As we expand to additional cloud providers, the intricacies of efficiently managing inference across diverse platforms with varying hardware, networking frameworks, and operational models grow substantially. We seek engineers adept at navigating these variances, developing strong abstractions that are effective across providers, and making informed infrastructure choices that keep us cost-effective at scale.Your contributions will enhance the operational scale of our services, expedite our capacity to launch cutting-edge models and innovative features to clients across all platforms, and ensure our large language models (LLMs) adhere to stringent safety, performance, and security standards.

Feb 5, 2026
Apply
company
Full-time|On-site|San Francisco

About Prima MenteAt Prima Mente, we are pioneering the integration of artificial intelligence with frontier biology. Our mission is to generate proprietary data and develop general-purpose biological foundation models that translate groundbreaking discoveries into tangible research and clinical outcomes. We are focused on understanding the complexities of the human brain, safeguarding it from neurological disorders, and enhancing cognitive health. With a dedicated team of AI researchers, experimentalists, clinicians, and operational experts, we proudly operate from our hubs in London, San Francisco, and Dubai.Position Overview – Senior Software Engineer, Backend & CloudIn this role, you will be responsible for designing, developing, and scaling SaaS solutions that make our biological foundation models accessible to end users. Your work will predominantly focus on backend (70%), followed by cloud (25%) and a small portion of frontend (5%).Your contributions will support:Managing extensive biological datasets with complex I/O and structured metadata.Tracking experiment lineage, artifacts, and model version histories.Implementing tenant-aware access controls and role-based permissions.Creating reproducible workflows that connect research code to production services.You will transform intricate model workflows into user-friendly, reliable, and observable products in production.Key ResponsibilitiesBackend & Application ServicesYou will engage with data models, invariants, and potential failure modes.Design and implement REST or gRPC APIs to support datasets, experiments, and user workflows.Define and adapt service boundaries as the system evolves.Design, migrate, and optimize schemas within an RDMS, preferably PostgreSQL.Implement authentication and authorization controls at the tenant level.Enhance performance, query efficiency, and data integrity.Add structured logging and metrics for efficient debugging.Cloud & InfrastructureYou will work directly with cloud resources, ensuring efficient deployment and operation.Deploy and manage services on AWS or GCP.Provision and configure computing, storage, and networking services.Set up IAM roles adhering to least-privilege access principles.Utilize Docker for service containerization.

Mar 2, 2026
Apply
companyCrusoe logo
Full-time|$166K/yr - $201K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to advance the availability of energy and intelligence. We are developing the engine that propels a future where individuals can engage in ambitious AI projects without compromising on scale, speed, or sustainability.Join us in revolutionizing the AI landscape with our sustainable technology. At Crusoe, you'll not only foster meaningful innovation but also make a tangible impact while collaborating with a team that is pioneering responsible and transformative cloud infrastructure solutions.About This Role:As a Senior Software Engineer on our storage team, you will be an integral part of our core engineering unit, tasked with designing, constructing, and optimizing our next-generation cloud storage products. We seek a hands-on engineer with profound expertise in storage system development. Your role will involve creating highly performant, reliable, and scalable distributed storage systems that are vital to both our infrastructure and our clients' AI and HPC workloads.Your Responsibilities Include:Developing Our Multi-Petabyte Cloud Storage PlatformCreating core components of our foundational storage products specifically designed for high-performance AI and ML applications.Enhancing distributed file, block, and object storage solutions, with an emphasis on filesystem-based approaches.System Design & ArchitectureDesigning and implementing scalable, resilient storage architectures that are highly extensible.Proposing and prototyping innovative strategies to enhance performance and system throughput for our most demanding customer workloads.Developing observability, metrics, and tooling for our services and infrastructure.High-Velocity Problem SolvingIdentifying and resolving unique and complex problems in distributed systems at the scale we operate.Providing ongoing support for production systems and customer workloads, which includes troubleshooting, performance tuning, and incident management.Cross-Functional CollaborationCultivating strong collaboration with other engineering teams (e.g., Software Infrastructure, Product) and cross-functional departments.Taking ownership and representing the storage team in critical business initiatives.

Aug 26, 2025
Apply
companyCrusoe Energy Systems logo
Full-time|$172K/yr - $209K/yr|On-site|San Francisco, CA - US

About Crusoe Energy Systems Crusoe Energy Systems manages every layer of AI infrastructure, from energy generation to advanced computational resources. The team focuses on making AI infrastructure more efficient and environmentally conscious, addressing the growing global demand for computing power. Based in San Francisco, Crusoe brings together experts in energy, manufacturing, data center construction, and cloud services. Role Overview: Senior Software Engineer - Cloud Infrastructure This Senior Software Engineer position centers on designing and building cloud infrastructure management systems for Crusoe Cloud, a vertically integrated, AI-focused platform. The engineer will help deliver complete solutions that support the company’s business goals, including system planning, monitoring, deployment, and operations. The role involves hands-on work developing platforms, tools, and frameworks that emphasize reliability, scalability, operational efficiency, and ease of use. As Crusoe Cloud grows, this engineer will play a key part in streamlining infrastructure planning and management processes. What You Will Do Work closely with cross-functional teams to design and implement infrastructure management software and availability platforms for customers using Crusoe’s AI infrastructure. Help improve the reliability, scalability, and security of systems and platforms. Develop workflows that support business objectives and performance targets. Build and maintain high-performing, highly available cloud solutions to meet expanding infrastructure needs. Who Thrives Here Engineers who enjoy solving complex problems, move quickly, and want to work alongside a diverse, supportive team will find this role rewarding. Crusoe values collaboration and a shared drive to advance AI infrastructure. Location San Francisco, CA - US

Apr 17, 2026
Apply
companyBritive logo
Full-time|Remote|San Francisco, California (remote)

As digital transformation accelerates, the need for robust cloud security solutions becomes vital. Britive stands at the forefront of this evolving landscape, offering an innovative privileged access management platform designed to deliver comprehensive Privileged Access Visibility, Dynamic Privilege Management, and Secrets Governance across diverse cloud environments, platforms, and SaaS applications.Our cutting-edge, patent-pending technology is already in use by numerous Fortune 500 companies, solidifying our reputation as one of the most promising startups in the Cloud Security sector. Founded by seasoned veterans in the CyberSecurity field, Britive is supported by leading venture capital firms, ensuring a solid foundation for growth and innovation.About YouYou are a dedicated Software Engineer eager to create and enhance our multi-tenant SaaS applications on the AWS platform. With a solid software engineering foundation and an in-depth understanding of AWS tools and services, you are ready to contribute from day one. Your positive, proactive attitude and enthusiasm for delivering technical solutions in a dynamic startup environment will be key to your success.Your ImpactKey Responsibilities:Design and develop a large-scale application stack operating on AWS.Collaborate with cross-functional teams to implement new features and enhancements.Maintain and optimize existing systems for improved performance and scalability.Ensure high-quality code through best practices and thorough testing.

Apr 1, 2025
Apply
companyCrusoe logo
Full-time|$204K/yr - $247K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to accelerate the abundance of energy and intelligence. We are building the infrastructure that empowers individuals to use AI creatively without compromising on scale, speed, or sustainability.Join us in leading the AI revolution with innovative technology at Crusoe. As part of our team, you'll contribute to significant advancements, effect real change, and be at the forefront of responsible and transformative cloud infrastructure.About This Role:The Crusoe Cloud Software Development team is on the lookout for an enthusiastic and seasoned Senior Staff Software Engineer who specializes in Hypervisor Virtualization and Research. This key position is vital for designing, developing, and optimizing our virtualization technologies, specifically designed for an AI-centric cloud infrastructure. A profound understanding of hypervisor internals, CPU and memory virtualization, I/O virtualization, and performance optimization is crucial for creating reliable, high-performance, and secure virtualized environments that will support our pioneering AI products. This is a full-time opportunity.What You’ll Be Working On:Hypervisor Development & Optimization: Design, develop, and optimize essential hypervisor components (e.g., KVM, QEMU, or bespoke solutions) to maximize performance and efficiency for AI workloads, concentrating on CPU, memory, and I/O virtualization techniques.Virtualization Research & Innovation: Engage in comprehensive research on advanced virtualization technologies, investigating innovative methods for isolating and accelerating AI computing, storage, and networking resources. Identify and prototype new virtualization features and enhancements to elevate density, throughput, and latency.Virtual Hardware & Device Emulation: Create and refine virtual hardware components and device emulation, ensuring peak performance and compatibility for specialized AI accelerators (e.g., GPUs, DPUs) within the virtualized ecosystem.Performance Analysis & Tuning: Assess and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a focus on optimization for AI/ML workloads, including profiling, bottleneck detection, and implementing low-level enhancements.System-Level Troubleshooting: Identify and resolve intricate system issues within the virtualization layer, collaborating closely with hardware and guest OS teams to debug and tackle integration challenges.

Aug 12, 2025
Apply
company
Full-time|On-site|San Francisco, California

Join our dynamic team at leverdemo-8 as a Software Engineer specializing in Cloud Infrastructure. We are passionate about reimagining the hiring landscape and are looking for talented engineers to enhance our YugaByte DB for enterprise applications. Your expertise will contribute to optimizing orchestration support across major public clouds including AWS, Google Cloud, and Azure, as well as Kubernetes services and private data centers. You'll play a crucial role in the control and manageability plane of YugaByte and collaborate with tools such as Prometheus and Alert Manager to ensure seamless infrastructure management.Please note that this position is part of Lever's testing environment; we kindly ask you not to apply for this role.

Apr 28, 2020
Apply
companyCrusoe logo
Full-time|$209K/yr - $253K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to catalyze the proliferation of energy and intelligence. We are engineering the driving force behind a future where individuals can ambitiously create with AI without compromising on scale, speed, or sustainability.Join us at Crusoe as we lead the charge in the AI revolution through sustainable technology. You will play a pivotal role in fostering meaningful innovation, making a significant impact, and collaborating with a team that is pioneering the development of responsible and transformative cloud infrastructure.Position Overview:We are in search of experienced Staff/Senior Staff Software Engineers who will be tasked with the architecture, design, and development of advanced Cloud Infrastructure management systems and platforms. You will be vital in delivering end-to-end use cases and workflows for our integrated AI-First Crusoe Cloud. Your contributions will be essential in constructing systems and platforms that effectively plan, monitor, deploy, and operate Crusoe Cloud, achieving key business revenue metrics.Your expertise will be crucial in evaluating, implementing, and building platforms, tools, and frameworks that prioritize reliability, scalability, operational efficiency, and user-friendliness. You will enhance our infrastructure planning and management workflows, driving efficiency and improving the overall performance and reliability of our cloud platform as we ambitiously scale our Crusoe Cloud products and services by more than 10X.In this role, you will also develop and refine technical designs and architectures, mentor fellow engineers, and actively contribute to the growth of the team in partnership with engineering managers.Your Key Responsibilities:Engage collaboratively across teams to design, architect, and implement physical infrastructure management software systems and availability platforms that meet end-to-end customer use cases, ensuring an exceptional customer experience.Champion the reliability, scalability, and security of our systems and platforms, acting as the guardian of our infrastructure!Create workflows designed to enhance efficiency and achieve key business objectives and metrics.Design and implement high-performance, highly available cloud architectures, optimizing for both performance and cost-effectiveness.Enhance cloud deployment, configuration management, and operations by developing and maintaining effective platforms, interfaces, and automation tools.Actively participate in the evolution of our platform, working closely with cross-functional teams.

Nov 24, 2025
Apply
companyImprint logo
Full-time|On-site|San Francisco

About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.

Jan 16, 2026

Sign in to browse more jobs

Create account — see all 5,638 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.