Solutions Architect - Kubernetes
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Qualifications
About CoreWeave
CoreWeave is The Essential Cloud for AI™, built by pioneers for pioneers. We provide a robust platform that empowers innovators to build and scale AI solutions confidently. Our combination of superior infrastructure performance and deep technical expertise has made us a trusted partner for leading AI labs, startups, and global enterprises. Founded in 2017 and now publicly traded (Nasdaq: CRWV), CoreWeave is at the forefront of the AI revolution. Discover more at www.coreweave.com.
Similar jobs
Search for Solutions Architect Hpc Ai Ml
78 results
CoreWeave
Join CoreWeave as a Solutions Architect, where you will be at the forefront of delivering high-performance cloud solutions tailored for AI, HPC, and ML workloads. In this dynamic role, you will engage directly with clients, establishing Kubernetes environments, developing proofs of concept, and optimizing their cloud infrastructure experience. Your expertise will contribute to shaping the future of AI technology, ensuring that our customers have seamless, reliable access to the tools they need to innovate and succeed.
CoreWeave is The Essential Cloud for AI™. Designed by pioneers for pioneers, CoreWeave offers a robust platform of technology, tools, and expert teams to empower innovators in building and scaling AI solutions with assurance. With a trusted reputation among top AI laboratories, emerging startups, and large enterprises, CoreWeave merges outstanding infrastructure performance with extensive technical knowledge to expedite breakthroughs and transform compute into capability. Established in 2017, CoreWeave became a publicly traded entity (Nasdaq: CRWV) in March 2025. Discover more at www.coreweave.com.What You’ll DoAs a key member of the Field Engineering team at Weights & Biases, you will play an essential role in fostering customer success and promoting our platform's adoption. Collaborating closely with Sales, Support, Product, and Engineering, you will lead technical initiatives post-sales. Engaging with some of the most sophisticated AI teams globally, you will assist them in constructing, optimizing, and scaling their ML and GenAI workflows across various sectors including computer vision, robotics, natural language processing, and large language models (LLMs).About the RoleWe are seeking an AI Solutions Engineer (AISE) focused on Post-Sales Scale Customer Success, dedicated to facilitating the successful implementation and scaling of AI/ML workflows and GenAI applications on Weights & Biases. You will design and deliver comprehensive technical enablement and adoption programs for numerous customers simultaneously, develop reusable resources to enhance self-service success, and leverage product signals and feedback loops to iteratively improve results.Key Responsibilities Facilitate 1-to-many onboarding and enablement programsLead and execute scalable onboarding and adoption strategies (webinars, cohort sessions, group training, and office hours) to ensure customers realize value quickly and consistently.Create reusable technical assets to enhance self-service successDevelop and maintain playbooks, reference architectures, templates, sample code/notebooks, and troubleshooting guides to standardize best practices and minimize repetitive 1:1 support.Manage scaled initiatives using signals and feedback loopsContinuously refine the onboarding and adoption processes based on customer feedback and product usage signals.
CoreWeave
Join CoreWeave as a Solutions Architect specializing in Kubernetes. You will lead customers in optimizing their AI workloads, ensuring high performance and seamless experiences on our cloud platform. Collaborate with engineering teams and leverage your expertise to drive innovation and establish strong technical relationships with clients.
CoreWeave
Join CoreWeave as a Solutions Architect and be at the forefront of the AI revolution! Our Customer Experience (CX) Organization is committed to delivering seamless, high-performance solutions for clients running AI workloads at scale. In this pivotal role, you will engage directly with customers, guiding them through the entire lifecycle of their projects—from establishing Kubernetes environments to optimizing workloads. Your expertise in networking technologies within high-performance compute (HPC) environments will be crucial in fostering strong technical relationships and ensuring customer success. If you are passionate about innovation and ready to make a significant impact, CoreWeave is the perfect place for you!
CoreWeave
Join CoreWeave, the leading cloud provider for AI, as a Solutions Architect specializing in security. In this influential role, you will ensure our clients experience seamless and high-performance AI workloads. Collaborate closely with engineering teams and engage directly with customers throughout their journey, from setting up Kubernetes environments to optimizing workloads. If you thrive on innovation and want to shape the future of AI, this is the opportunity for you!
CoreWeave
At CoreWeave, we redefine cloud computing for AI with our innovative platform designed for pioneers. Our technology, tools, and dedicated teams empower innovators to build and scale AI solutions confidently. Trusted by top AI labs, startups, and global enterprises, we merge exceptional infrastructure performance with profound technical expertise to drive breakthroughs and transform compute into capability. Established in 2017, we proudly became a publicly traded company (Nasdaq: CRWV) in March 2025. Discover more at www.coreweave.com.Your Role:The Customer Experience (CX) Organization at CoreWeave is committed to providing every client running AI workloads at scale with a seamless, reliable, and high-performance experience. This team is vital in supporting the infrastructure that drives the AI revolution—ensuring the integrity of our cloud platform across data centers, hardware systems, and customer workloads. By closely aligning with internal and customer engineering teams, the CX organization offers critical insights from the field and contributes significantly to the CoreWeave product roadmap and development.What You Will Do:As a Solutions Architect specializing in storage at CoreWeave, you will take on a crucial and dynamic role. You’ll have the opportunity to showcase your thought leadership and engage directly throughout our customers' entire lifecycle. From setting up their Kubernetes environments to developing proofs of concept, onboarding, and optimizing workloads, you will spearhead innovation at every stage. If you are passionate about innovation, excited about the potential of specialized compute, and eager to be part of a team shaping the future, CoreWeave is the place for you. Join us on this exciting journey!Key Responsibilities:Act as the primary technical liaison for customers, fostering strong relationships and ensuring their success with CoreWeave's cloud infrastructure, with a specific focus on storage technologies within high-performance compute (HPC) environments.Collaborate...
Sonsoft Inc.
Join our team as a Data Modeler Architect at Sonsoft Inc. in Livingston, New Jersey! As a vital member of our data architecture team, you will design, develop, and maintain robust data models to support our data-driven initiatives. Your expertise will be essential in optimizing data structures and ensuring data integrity across various platforms.
Join CoreWeave as a Network Security Engineering Manager, where you will spearhead the design and implementation of security protocols within our cutting-edge AI hyperscaler infrastructure. In this pivotal role, you will oversee the security measures that fortify our expansive GPU clusters and distributed AI workloads. Collaborating with Networking, Infrastructure, and Site Reliability Engineering (SRE) teams, you will play a crucial role in shaping the next generation of network fabrics and architectures. You will lead a talented team of network security engineers, ensuring that network security evolves in tandem with our rapidly scaling platform while maintaining peak performance and reliability.
CoreWeave
About CoreWeave CoreWeave is The Essential Cloud for AI™. The company provides a platform of technology, tools, and teams that support innovators building and scaling AI solutions. CoreWeave's infrastructure is trusted by leading AI labs, startups, and enterprises for its performance and reliability. CoreWeave is publicly traded on Nasdaq (CRWV) as of March 2025. Learn more at www.coreweave.com. Role Overview: Senior Storage Engineer As part of the Storage Engine Team, the Senior Storage Engineer designs and develops managed storage products that meet the needs of demanding AI workloads. This role involves close collaboration with engineering teams across infrastructure, compute, and platform to ensure storage services are reliable, scalable, and high-performing. What You Will Do Design and build distributed storage solutions that support scaling for data-intensive AI workloads. Develop exabyte-scale, S3-compatible object storage and integrate dedicated storage clusters for a variety of customer environments. Apply technologies such as RDMA, GPU Direct Storage, and distributed filesystem protocols (NFS, FUSE) to improve storage performance and efficiency. Lead projects to strengthen the reliability, durability, security, and observability of the storage stack. Work with operations teams to monitor, troubleshoot, and refine storage systems in production settings. Create metrics and dashboards that track storage performance and health. Analyze telemetry and system data to identify ways to improve throughput, latency, and resilience. Collaborate with platform, product, and infrastructure teams to deliver seamless storage capabilities across the stack. Mentor other engineers and share knowledge on building distributed, high-performance storage systems. Locations Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA
CoreWeave, Inc.
CoreWeave is The Essential Cloud for AI™. Established by visionaries for innovators, CoreWeave offers a technology platform that empowers clients to build and scale AI solutions with confidence. Trusted by leading AI laboratories, startups, and global organizations, we blend top-tier infrastructure performance with extensive technical expertise to facilitate breakthroughs and transform computing into capability. Founded in 2017, CoreWeave became a publicly traded entity (Nasdaq: CRWV) in March 2025. Discover more at www.coreweave.com.What You’ll Do:About the Team:The Supply Chain Strategy and Transformation team is charged with architecting and scaling the operational model for how CoreWeave plans, sources, builds, and delivers data center capacity for our clientele. We collaborate closely with Supply Chain, Product Engineering, Data Center Operations, and Enterprise Engineering to translate strategy into execution and measurable outcomes.About the Role:In the role of Senior Manager, Supply Chain Risk, Resilience & Compliance, you will spearhead the strategy, governance, and roadmap across pivotal programs aimed at enhancing supply chain resilience, improving internal controls, and establishing scalable governance for business continuity and sustainability. This position entails creating frameworks, operational mechanisms, and fostering cross-functional alliances to identify and monitor supply chain risk, bolster compliance readiness, support continuity planning, and standardize end-of-life asset processes across the network.Key Responsibilities:Own the strategy and governance for supply chain risk management and resilience.Develop and enhance the supply chain risk monitoring framework and control tower.Establish risk indicators, escalation paths, reporting schedules, and mitigation governance across various supply chain risk domains.
CoreWeave is the premier cloud solution for AI, designed by innovators for innovators. Our platform empowers visionaries with the technology, tools, and expertise required to confidently develop and scale AI solutions. Trusted by top AI labs, startups, and global enterprises, CoreWeave merges high-performance infrastructure with unmatched technical proficiency to foster breakthroughs and transform compute into capability. Established in 2017, we became a publicly traded company (Nasdaq: CRWV) in March 2025. Discover more at www.coreweave.com.We are on the lookout for a Director of Engineering, Media & Entertainment (M&E) to spearhead the creation of advanced cloud platforms and tools that enhance contemporary content creation workflows. This pivotal role will shape the engineering strategy and implementation for solutions that facilitate visual effects (VFX), animation, rendering, and post-production processes utilized by studios, artists, and creative teams globally.As a key leader in our engineering department, you will establish and lead dynamic engineering teams tasked with designing scalable infrastructure, developer tools, and user-centric systems that empower creative professionals to execute intricate production workloads within the cloud. You will work in close collaboration with product, design, infrastructure, and customer teams to convert real-world production workflows into dependable, high-performance software platforms.This role fuses extensive engineering leadership with specialized knowledge in M&E workflows, ensuring our platform delivers outstanding performance, reliability, and usability tailored for demanding creative workloads.
Join CoreWeave as a Principal Engineer, where you will spearhead the architecture, development, and operations of Managed Databases tailored for AI workloads. In this pivotal role, you will define and advance our technical roadmap, guiding how both clients and internal teams efficiently store and access operational data for AI and machine learning applications. Collaborate closely with engineering stakeholders across CoreWeave’s diverse product offerings to shape the future of AI infrastructure.
CoreWeave
Join CoreWeave as a Data Center Quality Manager!At CoreWeave, we are redefining the cloud experience for AI, providing innovative technology and tools that empower creators to build and scale their AI initiatives seamlessly. As a trusted partner for prominent AI labs and enterprises, we deliver exceptional infrastructure performance coupled with unmatched technical expertise. Established in 2017 and recently going public (Nasdaq: CRWV) in March 2025, CoreWeave is at the forefront of AI cloud solutions. Discover more at www.coreweave.com.Your Role:The Quality Management team is vital in ensuring the integrity of CoreWeave’s rapidly expanding data center infrastructure. You will collaborate with Engineering, Strategic Sourcing, Manufacturing, Construction, and Operations to instill quality throughout the asset lifecycle—from design to long-term operation. This role is essential in managing risks, ensuring performance consistency, and driving continuous improvement across our mission-critical infrastructure.Quality Strategy & GovernanceDevelop and uphold CoreWeave’s quality framework encompassing suppliers, manufacturing partners, and field operations.Set quality standards, acceptance criteria, and inspection protocols in line with engineering and operational specifications.Oversee quality governance models, including escalation procedures, corrective actions, and executive reporting.Standardize key quality processes such as NCR, deviation requests, and lessons learned mechanisms.Supplier & Manufacturing QualityLead the onboarding, qualification, and performance management of critical infrastructure vendors.Conduct supplier audits, factory acceptance tests (FATs), and manufacturing readiness evaluations to ensure compliance prior to delivery.
About CoreWeave CoreWeave builds cloud infrastructure tailored for AI. Founded in 2017, the company supports AI labs, startups, and global enterprises with high-performance platforms and deep technical expertise. CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com. Role Overview The Senior Product Manager - Go-To-Market Strategy joins the IT Business Systems (GTM) team. This group manages the systems behind CoreWeave’s commercial operations, including lead management, contract processing, billing, customer support, and revenue tracking. This role partners closely with Business Operations to guide the GTM systems roadmap. The Senior Product Manager leads requirements gathering and drives delivery for initiatives across Salesforce CRM & CPQ, Service Cloud, billing systems, and their integrations. The work blends strategic product development with the detailed approach of a senior business systems analyst. What You Will Do Co-manage the GTM systems roadmap with Business Operations, handling requirements collection, prioritization, and backlog management for Salesforce (Sales Cloud, Revenue Cloud/CPQ, Service Cloud), billing systems, CLM, marketing automation, and integrations. Serve as the main point of contact between global teams and stakeholders, ensuring alignment and clear communication throughout project lifecycles. Key Initiatives This position leads projects such as M&A integration, billing platform deployment, and improvements to the customer experience. A product-focused mindset, attention to user experience, and accountability for metrics are essential. Locations Roles are available in Livingston, NJ; New York, NY; Sunnyvale, CA; San Francisco, CA; and Bellevue, WA.
CoreWeave builds cloud infrastructure for AI innovators, supporting organizations with secure, high-performance platforms. Trusted by leading AI labs and enterprises, CoreWeave integrates with existing identity and security systems to deliver reliable solutions for sensitive workloads. Since March 2025, CoreWeave has been publicly traded on Nasdaq (CRWV). Learn more at www.coreweave.com. About the Security Products Division The Security Products group develops tools that help enterprise clients extend their identity and encryption frameworks into the CoreWeave Cloud. This team builds products like CoreWeave IAM, encryption lifecycle management, and infrastructure for detailed access control and secure onboarding. Collaboration with Product, Information Security, and core platform teams is central to making CoreWeave a secure and compliant environment for large-scale AI applications. Role Overview: Director of Engineering, Security Products CoreWeave seeks a Director of Engineering to lead teams building identity, authorization, and encryption platforms. This technical and strategic leader will shape the roadmap for security products that support customer trust and enable scalable, regulated AI applications. The Director will work closely with Product and Security partners to design and maintain reliable, auditable, and user-friendly services. Mentoring engineering leaders and individual contributors is a key part of this role, setting high standards for engineering quality in a complex, high-stakes environment. What You Will Do Define and execute the engineering roadmap for CoreWeave security products, including IAM, authorization, encryption lifecycle management, and workload identity. Locations Livingston, NJ / New York, NY / Sunnyvale, CA
The Senior Security Engineer - PKI & Secrets role at CoreWeave sits within the Security Foundations team. This team is responsible for protecting the CoreWeave Cloud, covering everything from data centers and GPU fleets to the platform layers that support customer AI workloads. The PKI & Secrets group manages the cryptographic infrastructure that ensures CoreWeave’s data and systems remain confidential, authentic, and tamper-resistant. Areas of focus include public key infrastructure (PKI), secrets management, hardware security modules (HSMs), key management, and code signing. This position involves close collaboration with teams across the company to deliver cryptographic services that meet high standards for security, reliability, and scalability. What you will do Design, implement, and maintain PKI infrastructure, including certificate authority (CA) hierarchies, issuance policies, certificate lifecycle management, and trust distribution for both Kubernetes clusters and bare-metal hosts. Manage and improve secrets management platforms, focusing on access policies, secret lifecycle governance, and integration using tools such as External Secrets Operator and cert-manager. Operate and scale HSM infrastructure, handling PKCS#11 integration, key ceremony procedures, and high-availability designs to support certificate authorities and signing services. Contribute to the development of key management systems and related security protocols. Locations This position is available in Livingston, NJ; New York, NY; Sunnyvale, CA; Bellevue, WA; and San Francisco, CA.
CoreWeave is The Essential Cloud for AI™. Designed by innovators for innovators, we provide a cutting-edge platform of technology, tools, and dedicated teams that empower our clients to confidently build and scale AI solutions. With trusted partnerships across leading AI labs, startups, and global enterprises, we combine top-tier infrastructure performance with profound technical expertise to drive breakthroughs and transform compute into capability. Since our founding in 2017, CoreWeave has evolved into a publicly traded company (Nasdaq: CRWV) as of March 2025. Discover more at www.coreweave.com.Key Responsibilities:As the Senior Product Marketing Manager dedicated to CoreWeave SUNK (Slurm on Kubernetes), you will spearhead the strategic positioning and market introduction of our advanced research and training cluster designed for the most demanding AI workloads. SUNK stands as a pivotal offering within the CoreWeave ecosystem, tailored to facilitate extensive, enduring training jobs where reliability, predictability, and operational visibility are as essential as performance.In this role, you will craft distinct positioning and messaging that showcases SUNK as the premier research cluster for AI training—a researcher-centric system that upholds the Slurm workflows that teams depend on while simplifying the operation and maintenance of expansive training environments. Your task will be to articulate SUNK's features, such as high-performance and topology-aware scheduling, automated cluster health management via Mission Control, and deep operational visibility, into compelling narratives that resonate with AI research leaders, platform teams, and technical buyers.You will work autonomously within a defined product scope while closely collaborating with product management and engineering to launch new SUNK capabilities. This will involve crafting launch narratives, integrating customer and competitive insights into messaging, and ensuring go-to-market assets accurately reflect the evaluation, deployment, and operational aspects of research clusters. Additionally, you will partner with sales, solutions architects, and marketing teams to promote consistent, outcome-oriented messaging across customer-facing initiatives.Success in this position demands strong technical fluency, sound judgment, and a drive for execution. You will proactively seek insights...
CoreWeave is The Essential Cloud for AI™. Founded by pioneers, CoreWeave provides an innovative platform of technology, tools, and expert teams that empower innovators to confidently build and scale AI solutions. Our services are trusted by top AI labs, startups, and global enterprises, combining exceptional infrastructure performance with extensive technical know-how to drive breakthroughs and enhance computational capabilities. Since our inception in 2017, we have grown to become a publicly traded company (Nasdaq: CRWV) as of March 2025. Discover more at www.coreweave.com.About the Role:The Senior Staff, Energy Operations is a pivotal, development-focused position within CoreWeave’s Data Center Development team. You will be responsible for managing the site-level utilities path necessary for advancing data center projects from initial diligence through development, construction coordination, and final delivery to operations—essentially overseeing the journey from site preparation to operational readiness.This role entails the comprehensive management of site-specific utility workflows, ensuring that each project is equipped with a clear, validated, and risk-mitigated pathway to capacity. You will collaborate directly with utility providers, developers, and internal teams to oversee utility feasibility, delivery schedules, cost estimations, and execution risks.Your duties will involve close coordination on utility execution, project timelines, budgeting, local stakeholder engagement, government relations, and collaboration with internal grid-planning teams responsible for policy, regulatory engagement, and long-term system strategy.Your Responsibilities:Site-Level Utility Ownership (Critical Development Gate)Oversee site-specific power feasibility and validation for all development opportunities.Serve as a critical development gate—no project proceeds without a verified and actionable utilities pathway.Confirm availability, delivery timelines, upgrade needs, costs, and risk profiles.Utility Coordination & ExecutionAct as the primary technical contact at the site level with utility providers.Lead utility execution and manage related timelines and budgets.
CoreWeave is a leader in cloud solutions tailored for AI applications, providing cutting-edge technology, tools, and expert teams that empower innovators to build and scale AI with assurance. Trusted by top AI labs, startups, and multinational corporations, CoreWeave merges exceptional infrastructure performance with extensive technical knowledge to accelerate innovation and transform computational capabilities. Established in 2017, CoreWeave went public on Nasdaq in March 2025. Discover more at www.coreweave.com.Key Responsibilities:As a Sustainable Data Center Design Manager, you will spearhead sustainability-focused design initiatives throughout our data center construction and engineering projects. This integral position within the Data Center Engineering team will work in close partnership with Corporate Sustainability to guarantee alignment with our long-term objectives regarding energy efficiency, carbon reduction, and resource optimization.Role Overview:This role emphasizes the integration of sustainable practices in data center design, ensuring that projects incorporate onsite renewable energy, energy-efficient methodologies, low-carbon materials, water conservation techniques, and alternative refrigerants. The ideal candidate will champion cost assessments, technical evaluations, and the deployment of solar energy systems, battery storage solutions, grid interactivity, high-efficiency mechanical and electrical systems, as well as lifecycle carbon analysis.As the primary technical authority for sustainable design strategies, you will collaborate closely with architects, engineers, sustainability advisors, and construction teams to guarantee the incorporation of sustainability best practices from initial concept through to final construction.Travel to various data center sites and relevant industry conferences will be required as necessary.Your Contributions Will Include:Acting as the liaison between corporate sustainability objectives and practical design execution, ensuring that high-level sustainability goals are realized through technically sound design solutions.Fostering collaboration and providing technical leadership alongside MEP engineers and architects.
CoreWeave
About CoreWeave:CoreWeave is The Essential Cloud for AI™, crafted by pioneers for innovators. We empower leading AI labs, startups, and global enterprises with robust technology, tools, and expertise to confidently build and scale AI solutions. Our superior infrastructure performance accelerates breakthroughs, transforming compute into capability. Since our inception in 2017, we have proudly become a publicly traded company (Nasdaq: CRWV) as of March 2025. Discover more at www.coreweave.com.Role Overview:The Storage Engine Team at CoreWeave is pivotal in delivering top-tier managed storage products. We are committed to developing reliable and scalable storage solutions that lead the industry in performance. Our team collaborates closely with engineering departments across infrastructure, compute, and platform to ensure our storage services effectively support the most demanding AI workloads globally.Your Responsibilities:Design and implement distributed storage solutions to accommodate the scaling of data-intensive AI workloads.Contribute to the development of exabyte-scale, S3-compatible object storage, integrating dedicated storage clusters into varied customer environments.Utilize technologies such as RDMA, GPU Direct Storage, and distributed filesystem protocols like NFS or FUSE to enhance storage performance and efficiency.Lead initiatives to bolster the reliability, durability, security, and observability of our storage stack.Work with operations teams to monitor, troubleshoot, and optimize storage systems in production environments.Establish metrics and dashboards to enhance visibility into storage performance and health.Analyze telemetry and system data to drive improvements in throughput, latency, and resilience.Collaborate cross-functionally with platform, product, and infrastructure teams to provide seamless storage capabilities across the stack.Share your expertise and mentor fellow engineers on best practices in constructing distributed, high-performance storage solutions.
Sign in to browse more jobs
Create account — see all 78 results

