Staff Software Engineer, Model LifeCycle at Crusoe | San Francisco, CA

CrusoeSan Francisco, CA - US

On-site Full-time $208.7K/yr - $253K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Qualifications:Strong Engineering Fundamentals:A Bachelor's or Master's degree in Computer Science, Engineering, or a related discipline.8-10+ years of industry experience with a proven track record of leading diverse projects successfully.

About the job

Join us in revolutionizing the AI landscape with sustainable technology. Here, you will spearhead significant innovations, create real-world impact, and collaborate with a team that is defining the future of responsible cloud infrastructure.

Position Overview:

As a Staff Software Engineer on the Model LifeCycle team, you will be instrumental in developing a robust managed platform that oversees the entire application development lifecycle, specifically focusing on the integration of Machine Learning models, including Large Language Models (LLMs).

Your Responsibilities:

Enhance systems for large foundation models through fine-tuning (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and efficient scaling.
Design and sustain comprehensive training pipelines for Large Language Models.
Contribute to the development of distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling).
Create and uphold the infrastructure for agent execution.
Implement features for dataset, model, and experiment management: ensuring versioning, lineage tracking, evaluation, and reproducible fine-tuning at scale.

Collaboration and Impact:

Collaborate closely with Principal Engineers, product teams, and platform teams to implement core abstractions and APIs.
Participate in architectural decisions regarding training runtimes, scheduling, storage, and model lifecycle management.
Engage actively with the open-source LLM community.
This role offers considerable ownership — you will be pivotal in designing and implementing core systems.

About Crusoe

Crusoe is at the forefront of the AI revolution, committed to harnessing sustainable technology to advance energy and intelligence. We are building a transformative cloud infrastructure that empowers individuals to innovate without compromise.

Similar jobs

1 - 20 of 12,146 Jobs

Search for Senior Staff Software Engineer Cape At Crusoe San Francisco

12,146 results

Select all on this page (20)

Apply

Senior Staff Software Engineer - CAPE at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

Role Overview Crusoe is seeking a Senior Staff Software Engineer focused on CAPE for its San Francisco office. This role centers on designing and building software solutions that support Crusoe's mission to advance technology in the energy sector. What You Will Do Design and implement software systems for CAPE projects Collaborate with cross-functional teams to deliver solutions that align with user needs and company goals Tackle complex technical challenges in support of Crusoe's energy initiatives What We Look For Strong technical background in software engineering Experience solving complex problems and delivering reliable software Ability to work effectively with colleagues across multiple disciplines Location San Francisco, CA - US

Apr 15, 2026

Apply

Senior Staff Software Engineer - CAPE

Crusoe Technologies, Inc.

Full-time|On-site|San Francisco, CA - US

About the Senior Staff Software Engineer Role Crusoe Technologies is hiring a Senior Staff Software Engineer for the CAPE project in San Francisco, CA. This role focuses on building and maintaining scalable software that strengthens our infrastructure and streamlines operations. What You Will Do Design and develop software solutions for the CAPE project Maintain and improve existing systems to support infrastructure growth Work closely with teams across disciplines to deliver reliable, efficient software What We Look For Advanced programming skills Deep understanding of software architecture Experience collaborating with cross-functional groups

Apr 15, 2026

Apply

Staff Software Engineer at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

About the Role Crusoe is hiring a Staff Software Engineer in San Francisco, CA. This role focuses on building high-performance software applications that support the company’s technology initiatives. What You Will Do Design and develop software applications with an emphasis on performance and reliability Collaborate with engineering teams to deliver solutions that meet business needs Contribute technical expertise to key projects and code reviews Location This position is based in San Francisco, CA.

Apr 15, 2026

Apply

Staff Software Engineer - Networking at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Staff Software Engineer specializing in Networking. In this critical role, you will design and implement innovative software solutions that enhance our networking infrastructure. You will collaborate with cross-functional teams to optimize performance and reliability, ensuring that our services run efficiently and securely.

Mar 25, 2026

Apply

Senior Software Engineer, Networking at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

Join our innovative team at Crusoe as a Senior Software Engineer specializing in Networking. In this critical role, you will develop cutting-edge software solutions that enhance our networking capabilities and support our mission of delivering efficient computing resources.Your expertise will contribute to building scalable and reliable network architectures, enabling us to serve our clients better. Collaborate with cross-functional teams and leverage your knowledge in software engineering to push the boundaries of technology.

Mar 25, 2026

Apply

Senior Software Engineer - Streaming at Crusoe | San Francisco

Crusoe

Full-time|Remote|San Francisco, CA - US

Join Crusoe as a Senior Software Engineer in our Streaming division, where you'll be at the forefront of innovative streaming technology solutions. You will collaborate with cross-functional teams to design, develop, and implement high-performance streaming applications that enhance user experience.As a vital member of our engineering team, you will leverage your expertise in software development to contribute to cutting-edge projects that push the boundaries of streaming technology.

Mar 10, 2026

Apply

Senior Staff Software Engineer - Model LifeCycle at Crusoe | San Francisco

Crusoe

Full-time|Remote|San Francisco, CA - US

As a Senior Staff Software Engineer specializing in Model LifeCycle at Crusoe, you will play a vital role in shaping the future of software solutions that optimize and enhance our innovative operations. You will lead complex projects, mentor junior engineers, and collaborate with cross-functional teams to deliver high-impact results.

Mar 10, 2026

Apply

Senior Software Engineer for Observability at Crusoe | San Francisco

Crusoe

Full-time|Remote|San Francisco, CA - US

Join Crusoe as a Senior Software Engineer specializing in Observability, where you will play a pivotal role in enhancing our systems and ensuring robust performance across our platforms. You will collaborate with cross-functional teams to develop innovative solutions that improve the visibility and reliability of our software applications.

Feb 26, 2026

Apply

Staff Software Engineer, Model LifeCycle at Crusoe | San Francisco, CA

Crusoe

Full-time|$208.7K/yr - $253K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to propel the availability of energy and intelligence. We are developing the engine that empowers individuals to pursue ambitious projects with AI, all while upholding standards of scale, speed, and sustainability.Join us in revolutionizing the AI landscape with sustainable technology. Here, you will spearhead significant innovations, create real-world impact, and collaborate with a team that is defining the future of responsible cloud infrastructure.Position Overview:As a Staff Software Engineer on the Model LifeCycle team, you will be instrumental in developing a robust managed platform that oversees the entire application development lifecycle, specifically focusing on the integration of Machine Learning models, including Large Language Models (LLMs).Your Responsibilities:Enhance systems for large foundation models through fine-tuning (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and efficient scaling.Design and sustain comprehensive training pipelines for Large Language Models.Contribute to the development of distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling).Create and uphold the infrastructure for agent execution.Implement features for dataset, model, and experiment management: ensuring versioning, lineage tracking, evaluation, and reproducible fine-tuning at scale.Collaboration and Impact:Collaborate closely with Principal Engineers, product teams, and platform teams to implement core abstractions and APIs.Participate in architectural decisions regarding training runtimes, scheduling, storage, and model lifecycle management.Engage actively with the open-source LLM community.This role offers considerable ownership — you will be pivotal in designing and implementing core systems.

Feb 9, 2026

Apply

Senior Staff Cloud Support Engineer at Crusoe | San Francisco, CA

Crusoe

Full-time|$180K/yr - $220K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to revolutionize the future by accelerating the abundance of energy and intelligence. We are building the foundational engine that empowers individuals to create bold innovations with AI while ensuring sustainability, speed, and scalability.Join us in the forefront of the AI revolution with cutting-edge sustainable technology. You will play a pivotal role in driving meaningful innovation, making a significant impact, and collaborating with a team that is leading the way in responsible, transformative cloud infrastructure.About the RoleAs a Senior Staff Cloud Support Engineer, you will serve as a technical expert within Crusoe Cloud and significantly enhance the efforts of our Customer Experience, SRE, Networking, Fleet, and Product teams. Your role transcends basic ticket resolution; you will design reliability frameworks, influence architectural decisions, mentor senior engineers, and safeguard revenue by averting large-scale incidents. With profound expertise in Linux systems, Kubernetes, networking, and AI/ML infrastructure, you will apply your knowledge with a strong focus on customer satisfaction. You will be comfortable navigating uncertainty, leading incident responses, and shaping the global scaling of high-performance AI infrastructure.Key ResponsibilitiesAct as the top escalation point for complex P1/P0 incidents.Lead cross-functional investigations into root causes involving compute, networking (IB/RDMA/RoCE), storage, and orchestration layers.Collaborate with SRE and Software teams (Storage, Networking, Compute, K8) to devise systemic solutions rather than temporary fixes.Reliability ArchitectureDesign and enhance node validation, burn-in processes, performance baselining, and release readiness.Influence Kubernetes architecture, workload orchestration (Slurm, Terraform), and AI/ML cluster stability.Minimize MTTR and prevent incident recurrence through structural enhancements.AI/ML Infrastructure ExpertiseTroubleshoot NCCL, IB, GPU driver/firmware issues, and distributed training failures.Support complex AI workloads (training + inference) through performance tuning and observability enhancements.Customer-Facing AuthorityAct as a senior technical advisor during high-stakes customer incidents.

Feb 16, 2026

Apply

Senior Data Engineer at Crusoe | San Francisco, CA

Crusoe

Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Senior Data Engineer, where you will play a critical role in enhancing our data infrastructure and analytics capabilities. You will be responsible for designing, developing, and maintaining robust data pipelines to support our cutting-edge applications.As a key member of our engineering team, you will work closely with data scientists and analysts to ensure that data is accessible, accurate, and actionable.

Mar 25, 2026

Apply

Senior API Integration Engineer at Crusoe | San Francisco, CA

Crusoe

Full-time|$165K/yr - $200K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to accelerate the abundance of energy and intelligence, creating an environment where innovation thrives. As we build the infrastructure that empowers ambitious AI-driven projects, we prioritize sustainability without compromising on scale or speed.Join us in being part of the AI revolution with cutting-edge technology at Crusoe, where you will spearhead impactful innovations and collaborate with a team committed to transforming cloud infrastructure responsibly.About This RoleWe are looking for a Senior API Integration Engineer who will act as a vital technical partner in our enterprise-wide digital transformation efforts. This role is pivotal in driving intelligent automation and scalable system integrations, particularly within our People Tech ecosystem, with a strong focus on Workday HCM.What You’ll Be Working OnDesigning and developing enterprise-grade integrations utilizing the Workato ONE platform to facilitate intelligent workflow automation.Creating and maintaining robust API integrations across Workday HCM and the wider People Tech landscape, which includes payroll, ATS, LMS, compensation, benefits, performance, and analytics.Employing AI-driven automation within Workato to enhance efficiency, reliability, and process optimization.Establishing reusable integration architecture patterns, frameworks, and governance standards that can scale across both automated and human-led workflows.Collaborating with business and IT stakeholders to gather requirements, lead discovery sessions, assess ROI, and translate complex needs into scalable tech solutions.Overseeing integration initiatives from concept through deployment, including sprint execution, technical reviews, and delivery accountability.Providing senior-level escalation support to ensure the reliability and monitoring of critical integrations.What You’ll Bring to the TeamA minimum of 7 years of experience as an API Developer or Integration Engineer in enterprise-level environments.At least 3 years of hands-on production experience with Workato, especially with Workato ONE.Proven expertise in building and maintaining complex Workato integrations.

Feb 12, 2026

Apply

Senior to Senior Staff Solutions Engineer at Crusoe | San Francisco, CA

Crusoe

Full-time|$175K/yr - $250K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to drive the proliferation of energy and intelligence in the digital age. We are developing an innovative platform that enables individuals to harness the power of AI for ambitious projects, all while ensuring unparalleled scale, speed, and sustainability.Join us at the forefront of the AI revolution, where sustainable technology meets transformative cloud infrastructure. At Crusoe, you will be part of a team that is committed to meaningful innovation and making a significant impact.About the Role:We are looking for a Senior to Senior Staff level Solutions Engineer to collaborate closely with our key enterprise clients as they deploy AI and machine learning workloads on Crusoe's cutting-edge GPU infrastructure. This role is hands-on and customer-centric, requiring extensive technical knowledge in Kubernetes, MLOps, and cloud infrastructure.You will lead clients through the entire deployment journey, overseeing the proof of concept (PoC) process, optimizing workloads after the sale, and serving as an essential technical liaison between our clients and engineering teams. Successful candidates will possess a strong passion for AI infrastructure, be proficient in containerized environments, and have the ability to effectively translate workloads across various cloud platforms.What You'll Be Working On:Customer Enablement: Spearhead the technical onboarding and deployment of sophisticated AI/ML workloads with strategic enterprise customers—taking ownership of the PoC through to post-sales optimization.Kubernetes + MLOps Focus: Design and implement ML workloads utilizing Kubernetes-based technologies (e.g., Ray, Kubeflow) while ensuring optimal performance, scalability, and efficiency.Infrastructure-Centric Thinking: Engage directly with Crusoe infrastructure to deploy and fine-tune AI/ML workloads, guaranteeing performance at both the container and hardware levels.Cross-Cloud Translation: Assist clients in migrating and adapting workloads across AWS, Azure, and GCP, while clearly articulating the trade-offs between cloud-native and Crusoe-native strategies.Technical Storytelling: Facilitate workshops, live demonstrations, and solution reviews. Contribute to case studies, solution briefs, and blog articles that showcase real-world customer success stories.Voice of the Customer: Provide feedback to internal engineering and product teams to continuously enhance Crusoe’s platform based on practical implementation experiences.What You'll Bring to the Team:Deep Kubernetes Expertise: 7+ years of experience in building and deploying containerized applications.

Nov 13, 2025

Apply

Staff Hardware Systems Engineer at Crusoe | San Francisco, CA

Crusoe

Full-time|$208K/yr - $253K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to drive the evolution of energy and intelligence. We are developing the technology that fuels a future where individuals can ambitiously harness AI capabilities without compromising on scale, speed, or sustainability.Join us in revolutionizing AI with sustainable solutions at Crusoe. In this role, you will be at the forefront of innovation, making a significant impact while collaborating with a team that is shaping the future of responsible and transformative cloud infrastructure.About This Role:We are looking for a dedicated Hardware Production/Sustaining Engineer to enhance Crusoe's Hardware Systems Engineering team. This position is critical for bridging essential skill gaps in debugging, validation, and production support for high-performance computing systems. You will manage the entire hardware lifecycle—from prototype initiation to large-scale production—focusing on automation, deep troubleshooting, and reliability within Crusoe Cloud’s GPU- and CPU-oriented infrastructure.Your collaboration with cross-functional teams will be vital in supporting, debugging, and enhancing hardware platforms on a large scale, specifically targeting PCIe, InfiniBand, and NVMe/storage, which have been highlighted as key areas for expanded expertise. Your contributions will directly influence Crusoe’s capability to deploy and maintain sustainable, AI-driven computing systems that deliver exceptional performance and reliability.Your Responsibilities Will Include:Leading the complete hardware development and sustaining lifecycle, encompassing feasibility studies, bring-up, validation, deployment, and ongoing production support.Creating and sustaining automation frameworks and scripts for hardware testing, diagnostics, and continual reliability enhancements.Executing in-depth troubleshooting and debugging across:PCIe (including link training, topology, and performance issues)InfiniBand (focusing on fabric debugging, throughput, and connectivity challenges)NVMe/storage (addressing performance bottlenecks, firmware interactions, and failure analyses)Performing extensive system validation and characterization for GPU, CPU, and high-performance computing platforms.Assisting in end-to-end integration and solution testing to guarantee that Crusoe Cloud products fulfill performance, reliability, and scalability standards.Collaborating with teams across mechanical, thermal, firmware, software, and manufacturing domains to troubleshoot and enhance system performance.

Feb 19, 2026

Apply

Staff Software Engineer - Cloud Availability Platform Engineering (CAPE)

Crusoe

Full-time|$209K/yr - $253K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to catalyze the proliferation of energy and intelligence. We are engineering the driving force behind a future where individuals can ambitiously create with AI without compromising on scale, speed, or sustainability.Join us at Crusoe as we lead the charge in the AI revolution through sustainable technology. You will play a pivotal role in fostering meaningful innovation, making a significant impact, and collaborating with a team that is pioneering the development of responsible and transformative cloud infrastructure.Position Overview:We are in search of experienced Staff/Senior Staff Software Engineers who will be tasked with the architecture, design, and development of advanced Cloud Infrastructure management systems and platforms. You will be vital in delivering end-to-end use cases and workflows for our integrated AI-First Crusoe Cloud. Your contributions will be essential in constructing systems and platforms that effectively plan, monitor, deploy, and operate Crusoe Cloud, achieving key business revenue metrics.Your expertise will be crucial in evaluating, implementing, and building platforms, tools, and frameworks that prioritize reliability, scalability, operational efficiency, and user-friendliness. You will enhance our infrastructure planning and management workflows, driving efficiency and improving the overall performance and reliability of our cloud platform as we ambitiously scale our Crusoe Cloud products and services by more than 10X.In this role, you will also develop and refine technical designs and architectures, mentor fellow engineers, and actively contribute to the growth of the team in partnership with engineering managers.Your Key Responsibilities:Engage collaboratively across teams to design, architect, and implement physical infrastructure management software systems and availability platforms that meet end-to-end customer use cases, ensuring an exceptional customer experience.Champion the reliability, scalability, and security of our systems and platforms, acting as the guardian of our infrastructure!Create workflows designed to enhance efficiency and achieve key business objectives and metrics.Design and implement high-performance, highly available cloud architectures, optimizing for both performance and cost-effectiveness.Enhance cloud deployment, configuration management, and operations by developing and maintaining effective platforms, interfaces, and automation tools.Actively participate in the evolution of our platform, working closely with cross-functional teams.

Nov 24, 2025

Apply

Staff Software Engineer

Crusoe

Full-time|On-site|San Francisco, CA - US

Join our innovative team at Crusoe as a Staff Software Engineer. In this pivotal role, you will leverage your advanced software engineering skills to design, develop, and optimize cutting-edge solutions that enhance our technology stack. Collaborate with cross-functional teams to drive projects from concept to completion, ensuring high-quality deliverables that meet user needs and business objectives.

Mar 19, 2026

Apply

Senior Brand & Experiential Designer at Crusoe | San Francisco

Crusoe

Full-time|Remote|San Francisco, CA - US

Join Crusoe as a Senior Brand & Experiential Designer and play a pivotal role in shaping our brand identity. You will lead the design of experiential marketing initiatives that resonate with our audience, ensuring a cohesive and engaging brand experience across all touchpoints.

Apr 10, 2026

Apply

Senior Financial Analyst at Crusoe | San Francisco, CA

Crusoe

Full-time|$130K/yr - $165K/yr|On-site|San Francisco, CA - US

At Crusoe, our mission is to accelerate the availability of energy and intelligence. We are on a journey to build a robust platform that empowers individuals to innovate boldly with AI, all while maintaining scale, speed, and sustainability.Join us in the AI revolution through sustainable technology at Crusoe. You will contribute to significant innovations, create a real impact, and be part of a team that is leading the way in responsible and transformative cloud infrastructure.About the Role:Become a vital member of Crusoe’s Finance team, where you will play a crucial role in Financial Planning and Analysis, supporting our rapidly evolving Cloud business. Your responsibilities will include budgeting, forecasting, and conducting financial analysis. Your insights will directly shape strategic financial decisions, enhance operational efficiency, and strengthen our finance functions.We are looking for a results-oriented professional with over 5 years of experience who is eager to take a hands-on approach in defining strategic financial objectives and driving their execution in a fast-paced environment. (#INDFNC)What You Will Be Working On:Budgeting & Forecasting Process: Collaborate with Cloud business leaders to develop precise monthly P&L/capex forecasts and annual budgets, ensuring timely performance tracking and analysis.Cloud Deal Analysis: Perform financial modeling and analysis for complex Cloud deals, assessing ROI, pricing strategies, and contract terms to assist the sales and business development teams.Financial Analysis: Spearhead cost analysis efforts and continuously track financial performance to identify risks and opportunities for profitability enhancements.Process Improvement: Lead initiatives aimed at improving financial reporting accuracy and transparency.Executive Reports: Prepare and assist the Finance team in generating monthly executive reports, board materials, and investor updates.Capital Raise Support: Aid in creating models, presentations, and due diligence materials for debt and equity fundraising events.Ad-Hoc Analysis: Execute special projects and conduct ad-hoc financial analyses as necessary; serve as an internal and external point of contact for finance inquiries.

Jan 15, 2026

Apply

Senior Technical Program Manager at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Senior Technical Program Manager, where you will lead transformative projects at the intersection of technology and innovation. In this role, you will be responsible for managing complex technical programs, ensuring timely delivery while collaborating with cross-functional teams. Your expertise will help shape our strategic initiatives and drive operational excellence.

Mar 16, 2026

Apply

Enterprise IT Architect at Crusoe | San Francisco

Crusoe

Full-time|On-site|San Francisco, CA - US

Join Crusoe as an Enterprise IT Architect, where you will play a pivotal role in shaping our technology landscape. This position offers the opportunity to design and implement innovative IT architecture solutions that meet our business needs and drive operational excellence.

Apr 6, 2026

Create account — see all 12,146 results