Principal Engineer, AI Inference Reliability

Cerebras SystemsRemote Office; Sunnyvale CA or Toronto Canada

Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Qualifications:Proven experience in AI and machine learning systems. Strong understanding of distributed systems and large-scale architecture. Experience with performance optimization and reliability engineering. Exceptional problem-solving skills and attention to detail. Ability to work collaboratively in a fast-paced environment. Strong communication skills to articulate complex technical concepts.

About the job

Our clientele includes leading model laboratories, major global corporations, and innovative AI-native startups. Notably, OpenAI has recently partnered with Cerebras to leverage 750 megawatts of scale, revolutionizing critical workloads with ultra-high-speed inference.

Our advanced wafer-scale architecture makes Cerebras Inference the fastest Generative AI inference solution available, outperforming GPU-based hyperscale cloud inference services by over tenfold. This remarkable speed enhancement is reshaping the user experience of AI applications, enabling real-time iterations and enhanced intelligence through additional agentic computation.

In late 2024, we launched Cerebras Inference, setting a new standard for Generative AI inference speed. Since its launch, we have rapidly scaled our services to meet the rising demand from AI labs, enterprises, and a vibrant developer community.

In October 2025, we celebrated our Series G funding round, successfully raising $1.1 billion USD to accelerate the growth of our product offerings and services to satisfy global AI demand.

About the Team

The Cerebras Inference team is dedicated to delivering the most efficient, secure, and reliable enterprise-grade AI service. We design and manage expansive distributed systems that facilitate AI inference with unparalleled speed and efficiency. Join us in scaling our inference capabilities to new heights!

About Cerebras Systems

Cerebras Systems is a leading technology company dedicated to advancing AI through innovative hardware solutions. By creating the world's largest AI chip, we empower organizations to achieve unprecedented computational capabilities, driving the future of artificial intelligence.

Similar jobs

1 - 20 of 612 Jobs

Search for Staff Software Engineer Inference Cloud

612 results

Select all on this page (20)

Apply

Staff Software Engineer, Inference Cloud

Cerebras Systems

Full-time|On-site|Sunnyvale, CA

Role Overview Cerebras Systems is looking for a Staff Software Engineer focused on Inference Cloud. This position is based in Sunnyvale, CA. What You Will Do Design, develop, and optimize software for inference products Work closely with team members to improve performance and reliability Apply advanced AI and machine learning methods to real-world challenges Collaboration Work alongside experienced engineers on projects that shape the future of inference technology at Cerebras Systems.

Apr 14, 2026

Apply

Staff Software Engineer - Kubernetes Cloud

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Join Us in Shaping the Future of Cybersecurity!At Illumio, we are the pioneers in ransomware and breach containment, revolutionizing how organizations manage cyber threats and ensure operational resilience. Our advanced breach containment platform, powered by the Illumio AI Security Graph, detects and mitigates threats across hybrid multi-cloud environments, stopping attacks before they escalate into crises.As a recognized leader in the Forrester Wave™ for Microsegmentation, Illumio empowers Zero Trust architectures, enhancing the cyber resilience of the systems and organizations that underpin the global economy.Work Location: On-site presence required five days a week at our Sunnyvale, CA Headquarters.Our Team's Vision:Our Engineering team thrives on a culture characterized by visionary leadership, autonomy, and a strong sense of ownership, creating a dynamic synergy that propels us forward in the ever-changing cybersecurity landscape.By joining our team, you will be at the forefront of Zero Trust Segmentation, utilizing a cutting-edge technology stack that encompasses operating systems, distributed applications, and advanced UI/visualization tools.Together, we are shaping the future of cybersecurity, continuing to innovate and develop world-class products led by a diverse group of individuals committed to addressing the most significant cybersecurity challenges we face today.Your Impact:You will develop a groundbreaking approach to orchestrating Zero Trust Segmentation at the application and pod levels, effectively identifying and blocking attack vectors within the container ecosystem.You will deepen your knowledge of modern container platforms such as Kubernetes, Istio, OpenShift, AKS, EKS, GKE, and more.You will take ownership of designing critical features and subsystems, ensuring that every detail is meticulously addressed and defended before your peers.You will deliver robust implementations that are elegant, simple, scalable, stable, secure, and maintainable, safeguarding the essential infrastructure of major enterprises.You will mentor junior engineers, recent graduates, and interns, fostering their growth and integration into the team.You will collaborate with field organizations and key customers to refine and shape this innovative product.Your Toolkit:Bachelor’s degree in Computer Science or a related field; a Master’s degree is a plus.

Oct 29, 2025

Apply

Senior Software Engineer I, Inference

CoreWeave

On-site|On-site|Sunnyvale, CA / Bellevue, WA

Join CoreWeave as a Senior Software Engineer I specializing in inference, where you will spearhead architectural designs, elevate engineering standards, and significantly enhance latency, throughput, and reliability across various services. Collaborate closely with product, orchestration, and hardware teams to advance our Kubernetes-native inference platform, ensuring we achieve stringent P99 SLAs at scale.

Feb 10, 2026

Apply

Staff Engineer, Cloud Security

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Illumio’s engineering team works at the intersection of cloud security and breach containment, supporting organizations as they defend against cyber threats. The company’s platform uses the Illumio AI Security Graph to detect and contain breaches across hybrid and multi-cloud environments. Illumio has been recognized as a Leader in the Forrester Wave™ for Microsegmentation and is committed to advancing Zero Trust security for critical infrastructure worldwide. Role overview The Staff Engineer, Cloud Security, will design and build containerized microservices for distributed, multi-tenant systems. These systems handle data, real-time events, and network telemetry from multiple public clouds, providing customers with insights, visibility, and actionable security recommendations to help reduce risk in the cloud. What you will do Design service architecture, document and present design decisions, and deliver strong implementations. Write code primarily in Go, and work with data pipelines using SQL or similar interfaces. Experience with Kubernetes is valued, though candidates with other language backgrounds who are open to learning are encouraged to apply. Own critical features and subsystems throughout the software development lifecycle, from clarifying requirements to deployment and user adoption. Mentor junior engineers, recent graduates, and interns, supporting their growth and integration within the team. Work environment The team values collaboration, autonomy, and ownership. Illumio encourages leadership at every level and welcomes new ideas to keep pace with changes in cybersecurity. Engineers work with modern technology stacks, including distributed applications, operating systems, and advanced UI and visualization tools. Diverse perspectives and ongoing innovation are central to the company’s approach. Location This position is based at Illumio headquarters in Sunnyvale, California.

Apr 23, 2026

Apply

Staff Engineer, Cloud Security

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Join Us in Building a Resilient Future!At Illumio, we are at the forefront of ransomware and breach containment, transforming the way organizations manage cyber threats and achieve operational resilience. Our innovative breach containment platform, powered by the Illumio AI Security Graph, excels at identifying and containing threats across hybrid multi-cloud environments, effectively preventing cyberattacks from escalating into crises.Recognized as a leader in the Forrester Wave™ for Microsegmentation, we empower Zero Trust architectures, enhancing the cyber resilience of critical infrastructures and systems that sustain global operations.Location: This position requires on-site presence five days a week at our Sunnyvale, CA Headquarters.Our Vision:Our Engineering team is characterized by a culture of visionary leadership, autonomy, and ownership, fostering a dynamic synergy that propels us forward in the rapidly evolving field of cybersecurity.By joining our team, you will contribute to the leading force in Zero Trust Segmentation, utilizing a state-of-the-art technology stack that includes operating systems, distributed applications, and advanced UI/visualization tools.Together, we are shaping the future of cybersecurity, committed to creating world-class products driven by diverse perspectives and a relentless pursuit of innovation in the face of unprecedented global cyber threats.Your Contributions:Develop containerized (micro) services for a distributed multi-tenant system that processes data, real-time events, and network telemetry from various public clouds, delivering real-time insights and security recommendations to our customers to help them identify and mitigate cloud risks.Design your services, meticulously detail your designs, defend your architecture before peers, and execute robust implementations.Mentor junior engineers, recent graduates, and interns, fostering their growth and productivity as valued team members.Primarily code in Go while working with data pipelines using SQL or similar interfaces. We value diverse backgrounds in various languages and technology stacks, and a willingness to learn is welcomed.Take ownership of critical features and subsystems throughout the complete software development lifecycle, from requirements clarification to ensuring successful deployment and customer utilization.

Jan 29, 2026

Apply

Staff Engineer, Cloud Security

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Join Us in Pioneering Cybersecurity!At Illumio, we are at the forefront of ransomware and breach containment, revolutionizing how organizations manage cyber threats and maintain operational resilience. Our innovative platform, powered by the Illumio AI Security Graph, effectively identifies and contains threats across hybrid multi-cloud environments, preventing the escalation of attacks into catastrophic incidents.Recognized as a leader in the Forrester Wave™ for Microsegmentation, we empower organizations to adopt a Zero Trust approach, enhancing cyber resilience for the critical infrastructure and systems that keep the world operational.Our Vision:The Engineering team at Illumio is fueled by a culture of visionary leadership, autonomy, and ownership. We cultivate a dynamic synergy that propels us forward in the rapidly evolving landscape of cybersecurity.By joining our team, you will become part of a leader in Zero Trust Segmentation, working with cutting-edge technologies across various operating systems, distributed applications, and advanced UI/visualization tools.Together, we are shaping the future of cybersecurity, creating world-class products driven by diverse perspectives, backgrounds, and an unwavering commitment to innovation amidst unprecedented cybersecurity challenges.Your Contribution:Develop containerized microservices for our distributed multi-tenant system, processing data, real-time events, and network telemetry from multiple public clouds to deliver actionable insights, visibility, and security recommendations to our customers, enabling them to identify and mitigate risks in the cloud.Design, architect, and implement robust solutions, defending your designs before peers and ensuring a high-quality delivery.Mentor junior engineers, new graduates, and interns, fostering their growth as engineering professionals and enhancing team productivity.Primarily code in Go and manage data pipelines using SQL or similar technologies, leveraging Kubernetes for our service infrastructure. We welcome candidates with diverse technical backgrounds and a passion for learning.Take ownership of critical features and subsystems, guiding them through the entire software development lifecycle from requirement clarification to successful deployment and customer utilization.Engage with operational elements of the system, gaining firsthand experience with challenges and driving continuous improvement.

Jan 22, 2026

Apply

Staff Software Engineer - Backend & Streaming at Crusoe | Sunnyvale, CA

Crusoe

Full-time|$209K/yr - $253K/yr|On-site|Sunnyvale, CA - US

At Crusoe, we are driven by our mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company, we manage each layer of our technology stack—from electrons to tokens—enabling the world's most ambitious AI workloads. Joining Crusoe means becoming part of a team that is actively building the future.We are at the forefront of a transformative industrial revolution, where the insatiable demand for AI compute meets the challenge of energy availability. Our energy-first approach not only enhances AI infrastructure but also ensures it's beneficial for the world and accelerates innovation.We seek passionate, problem-solving teammates who thrive in a fast-paced environment and share our ambitious vision. If you're eager to advance your career alongside experts in energy, manufacturing, data center construction, and cloud services, we want you on our team.If you're ready to engage in impactful work, assist our clients and partners in enhancing their AI strategies, and contribute to a high-performing collaborative team, we invite you to build with us at Crusoe.About This Role:We are looking for a Staff Streaming Software Engineer to become a vital part of our Observability team within the Cloud Infrastructure division. This team is responsible for building and managing real-time data platforms that deliver metrics, logs, traces, and event streams, empowering engineers across the organization to reliably operate Crusoe's AI cloud at scale.In this position, you will lead the technical direction for our high-throughput streaming systems, influencing architectural decisions and long-term investments throughout the observability stack. You will operate at the nexus of deep technical execution and organizational influence—identifying potential issues before they arise, shaping team approaches to building and managing streaming infrastructure, and collaborating with engineering leaders to align platform strategies with corporate objectives.This role offers a unique opportunity to define how observability data flows at scale within our rapidly expanding AI cloud and to make a lasting architectural impact on systems that the entire engineering organization relies on.

Apr 8, 2026

Apply

Staff Software Engineer

Intuitive Surgical, Inc.

Full-time|On-site|Sunnyvale

Role Overview Intuitive Surgical, Inc. is hiring a Staff Software Engineer in Sunnyvale. This position focuses on designing, building, and maintaining software that supports surgical robotics and improves patient care. What You Will Do Develop and refine software solutions for surgical robotics systems Work closely with teams from different disciplines to deliver reliable, high-quality products Contribute technical expertise to projects that advance healthcare technology

Apr 15, 2026

Apply

Engineering Manager, Inference Platform

Cerebras Systems

Full-time|On-site|Sunnyvale CA or Toronto Canada

At Cerebras Systems, we are revolutionizing AI computing by developing the world’s largest AI chip, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture provides the computational power equivalent to dozens of GPUs on a single chip, simplifying programming to the level of a single device. This unique approach enables us to achieve unparalleled training and inference speeds, allowing machine learning practitioners to run large-scale ML applications without the complexity of managing multiple GPUs or TPUs.Our esteemed clientele includes leading model laboratories, prominent global enterprises, and forward-thinking AI-native startups. Notably, OpenAI has entered a multi-year partnership with Cerebras to leverage 750 megawatts of scale, enhancing critical workloads with ultra-high-speed inference.With our groundbreaking wafer-scale architecture, Cerebras Inference delivers the fastest Generative AI inference solution globally, outperforming GPU-based hyperscale cloud inference services by over tenfold. This dramatic increase in speed is transforming how users experience AI applications, facilitating real-time iterations and enhancing intelligence through additional agentic computation.Location: Toronto / SunnyvaleWe are seeking a highly technical, hands-on engineering leader for our Inference Service Platform. In this role, you will guide a high-performing team to address a critical challenge: scaling large language model (LLM) inference on Cerebras’ advanced compute clusters and delivering a world-class, on-premise solution for enterprise customers. You will establish the technical vision while maintaining close engagement with the code, focusing on architecting highly reliable and low-latency distributed systems. If you possess proven expertise in distributed systems and scaling modern model-serving frameworks, we encourage you to apply.

Feb 17, 2026

Apply

AI Inference Deployment Engineer

Cerebras Systems

Full-time|On-site|Sunnyvale CA or Toronto Canada

Cerebras Systems is at the forefront of AI technology, developing the world's largest AI chip that is 56 times greater than conventional GPUs. Our innovative wafer-scale architecture delivers the computational capabilities of numerous GPUs on a single chip, simplifying programming to the level of a single device. This groundbreaking approach enables Cerebras to achieve unmatched training and inference speeds, allowing machine learning practitioners to seamlessly execute large-scale ML applications without the complexities of managing extensive GPU or TPU resources. Our clientele includes leading model laboratories, global corporations, and pioneering AI-centric startups. Notably, OpenAI has recently entered into a multi-year partnership with Cerebras, aiming to deploy 750 megawatts of capacity, revolutionizing key workloads with exceptionally rapid inference speeds. Thanks to our extraordinary wafer-scale architecture, Cerebras Inference provides the swiftest Generative AI inference solution available today, operating over ten times faster than GPU-based hyperscale cloud inference services. This significant boost in speed is reshaping the user experience in AI applications, facilitating real-time iterations and enhancing intelligence through advanced agentic computation. About The Role We are looking for an exceptionally talented Deployment Engineer to design and manage our state-of-the-art inference clusters. In this role, you will have the opportunity to work with the unparalleled Wafer-Scale Engine (WSE) and the systems that exploit its extraordinary capabilities.

Feb 17, 2026

Apply

Inference Frontend Engineer

Cerebras Systems

Full-time|On-site|Sunnyvale, CA

Cerebras Systems is revolutionizing the AI landscape with the world's largest AI chip, which is 56 times more extensive than traditional GPUs. Our innovative wafer-scale architecture enables us to deliver the computational power of dozens of GPUs on a single chip, while offering the ease of programming like a single device. This groundbreaking approach empowers Cerebras to achieve unparalleled training and inference speeds, allowing machine learning practitioners to run large-scale ML applications effortlessly without the complexities of managing numerous GPUs or TPUs.Cerebras serves a diverse clientele that includes leading model laboratories, global corporations, and pioneering AI-focused startups. Recently, OpenAI announced a multi-year collaboration with Cerebras to harness 750 megawatts of scale, significantly enhancing key workloads through ultra-fast inference capabilities.With our cutting-edge wafer-scale architecture, Cerebras Inference provides the fastest Generative AI inference solution globally, exceeding the speed of GPU-based hyperscale cloud inference services by over ten times. This extraordinary speed transformation is reshaping the user experience of AI applications, facilitating real-time iterations and boosting intelligence through enhanced agentic computation.

Feb 17, 2026

Apply

Senior Software Engineer - Cloud Security

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Onwards Together!Illumio stands at the forefront of ransomware and breach containment, revolutionizing how organizations manage cyber threats and establish operational resilience. Our innovative breach containment platform, powered by the Illumio AI Security Graph, effectively identifies and mitigates risks across hybrid multi-cloud environments, halting the escalation of attacks before they escalate into critical incidents.As a recognized leader in the Forrester Wave™ for Microsegmentation, we empower businesses to adopt a Zero Trust framework, enhancing their cyber resilience across infrastructures and systems that are vital to global operations.Work Environment:This position requires on-site presence five days a week at our Sunnyvale, CA headquarters.Our Team's Vision:Our engineering team embodies a culture fueled by visionary leadership, autonomy, and ownership, fostering a collaborative synergy that propels us forward in the dynamic field of cybersecurity.As a member of our team, you will contribute to the leader in Zero Trust Segmentation, working with a cutting-edge technology stack that encompasses operating systems, distributed applications, and advanced UI/visualization tools.Together, we are shaping the future of cybersecurity and developing world-class products, driven by diverse perspectives and a shared commitment to innovation during unprecedented times of cyber threats.Your Impact:Develop containerized microservices for a distributed, multi-tenant system that processes data, real-time events, and network telemetry from various public clouds to provide customers with actionable insights, visibility, and security recommendations to minimize cloud risks.Design your services, meticulously detail your architecture, advocate for your design decisions among peers, and deliver robust implementations.Mentor junior engineers, new graduates, and interns, fostering their development into productive team members.Primarily write code in Go and engage with data pipelines utilizing SQL or similar interfaces, leveraging Kubernetes for service infrastructure. We welcome candidates with diverse programming backgrounds and a strong desire to learn.Take ownership of key features and subsystems, managing the complete software development lifecycle from requirement clarification to successful deployment and utilization by customers.

Mar 23, 2026

Apply

Staff Software Engineer - Frontend

alten2

Contract|On-site|Sunnyvale

Join our dynamic team as a Staff Software Engineer specializing in Frontend development. We are seeking a talented individual with a robust background in building scalable e-commerce applications or mobile software. Your expertise in modern JavaScript frameworks and attention to detail will be instrumental in delivering high-quality web applications that enhance user experience.

Aug 17, 2018

Apply

Software Engineer - Cloud Infrastructure

Applied Intuition, Inc.

Full-time|$186K/yr - $186K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI. Established in 2017 and currently valued at $15 billion, this Silicon Valley-based company is developing critical digital infrastructure aimed at integrating intelligence into every moving machine worldwide. Serving industries such as automotive, defense, trucking, construction, mining, and agriculture, Applied Intuition excels in three primary areas: tools and infrastructure, operating systems, and autonomy. The company’s solutions are trusted by 18 of the top 20 global automakers, along with the United States military and its allies. With its headquarters in Sunnyvale, California, Applied Intuition has additional offices in key locations including Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.We operate primarily in-office, with a standard expectation for employees to work from their Applied Intuition office five days a week. However, we value flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when necessary to accommodate family commitments.Role OverviewAs a key member of our cloud platform team, you will play a vital role in the development and improvement of our large-scale simulation infrastructure. Modern autonomous system development heavily relies on realistic, large-scale simulations to test ongoing software updates. Your focus will be on ensuring the efficiency and reliability of systems that manage these extensive workloads. The scale of our product workloads challenges the limits of conventional cluster deployments, and you will be instrumental in building and maintaining this infrastructure, while ensuring interoperability with various custom autonomy software used globally.You will collaborate closely with the entire engineering team to ensure operational success across our backend systems and various customer deployments at Applied Intuition.

Feb 17, 2026

Apply

Staff Engineer, Cloud Security at Illumio | Sunnyvale, CA

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Illumio’s engineering team in Sunnyvale, California, is looking for a Staff Engineer focused on cloud security. This position centers on building and maintaining scalable, resilient solutions that help organizations manage risk across hybrid and multi-cloud environments. The team values autonomy, leadership, and a collaborative approach to solving complex cybersecurity problems. What you will do Develop containerized microservices for a distributed, multi-tenant platform that processes data, real-time events, and network telemetry from various public clouds. These services provide real-time insights, visibility, and actionable security recommendations to clients. Design and present service architectures, defend technical choices to peers, and deliver high-quality, reliable implementations. Mentor junior engineers, recent graduates, and interns, helping them grow and contribute effectively. Write code primarily in Go and work with data pipelines using SQL or similar technologies. The team welcomes candidates with diverse programming backgrounds who are willing to learn. Take ownership of critical features and subsystems, managing all phases of the software development lifecycle, from clarifying requirements through deployment and user adoption. Monitor and maintain the operational health of the system, addressing challenges and drawing lessons from real-world deployments. Team culture Engineers at Illumio work with a modern technology stack that covers multiple operating systems, distributed applications, and advanced UI and visualization tools. The group emphasizes ownership and resilience, supporting each other as they tackle some of cybersecurity’s toughest challenges. Location This role is based at Illumio’s headquarters in Sunnyvale, California.

Apr 22, 2026

Apply

Senior Software Engineer - Cloud Security

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Join Illumio as a Senior Software Engineer specializing in Cloud Security, where you will play a pivotal role in shaping our security solutions. In this position, you will leverage your expertise in designing and implementing robust cloud security frameworks, ensuring the integrity and confidentiality of our clients' data. You will work collaboratively with cross-functional teams to innovate and enhance our security products, addressing the evolving landscape of cyber threats.

Apr 9, 2026

Apply

Senior Inference Machine Learning Runtime Engineer

Cerebras Systems

Full-time|On-site|Sunnyvale CA or Toronto Canada

Cerebras Systems is at the forefront of AI innovation, creating the world’s largest AI chip, which is 56 times larger than traditional GPUs. Our groundbreaking wafer-scale architecture delivers the computational power equivalent to dozens of GPUs on a single chip, combined with the programming simplicity of a unified device. This innovative approach allows us to offer unparalleled training and inference speeds, enabling machine learning practitioners to execute extensive ML applications seamlessly, without the complexities of managing multiple GPUs or TPUs.Cerebras boasts an impressive clientele, including premier model labs, global corporations, and pioneering AI startups. Recently, OpenAI announced a multi-year partnership with Cerebras, aimed at deploying 750 megawatts of scale, revolutionizing critical workloads with ultra-fast inference capabilities.Our unique wafer-scale architecture enables Cerebras Inference to provide the fastest Generative AI inference solution globally, surpassing GPU-based hyperscale cloud inference services by more than tenfold. This remarkable enhancement in speed is reshaping the AI application user experience, facilitating real-time iteration and boosting intelligence through enhanced computational capabilities.About The RoleThe Inference ML Engineering team at Cerebras Systems is committed to empowering our rapid generative inference solution through intuitive APIs, supported by a distributed runtime that operates on extensive clusters of our proprietary hardware. Our goal is to enable enterprises, developers, and researchers to fully harness the capabilities of our platform, leveraging its exceptional performance, scalability, and flexibility. The team collaborates closely with cross-functional groups, including compiler developers, cluster orchestrators, ML scientists, cloud architects, and product teams, to deliver impactful solutions that redefine the limits of ML performance and usability.As a Senior Software Engineer on the Inference ML Engineering team, you will be instrumental in designing and implementing APIs, ML features, and tools that facilitate the execution of state-of-the-art generative AI models on our custom hardware. Your role will involve architecting solutions that allow for seamless model translation and execution, ensuring high throughput and minimal latency while maintaining user-friendliness. You will lead technical initiatives and collaborate with other engineering teams to enhance our solutions.

Feb 17, 2026

Apply

Principal Engineer, AI Inference Reliability

Cerebras Systems

Full-time|Remote|Remote Office; Sunnyvale CA or Toronto Canada

Cerebras Systems is at the forefront of AI innovation, manufacturing the largest AI chip in the world, which is 56 times bigger than conventional GPUs. Our cutting-edge wafer-scale architecture provides the computational power equivalent to dozens of GPUs on a single chip, simplifying programming to the level of a single device. This pioneering approach enables us to offer unmatched training and inference speeds, allowing machine learning practitioners to smoothly execute large-scale ML applications without the complexity of managing numerous GPUs or TPUs. Our clientele includes leading model laboratories, major global corporations, and innovative AI-native startups. Notably, OpenAI has recently partnered with Cerebras to leverage 750 megawatts of scale, revolutionizing critical workloads with ultra-high-speed inference. Our advanced wafer-scale architecture makes Cerebras Inference the fastest Generative AI inference solution available, outperforming GPU-based hyperscale cloud inference services by over tenfold. This remarkable speed enhancement is reshaping the user experience of AI applications, enabling real-time iterations and enhanced intelligence through additional agentic computation.In late 2024, we launched Cerebras Inference, setting a new standard for Generative AI inference speed. Since its launch, we have rapidly scaled our services to meet the rising demand from AI labs, enterprises, and a vibrant developer community.In October 2025, we celebrated our Series G funding round, successfully raising $1.1 billion USD to accelerate the growth of our product offerings and services to satisfy global AI demand.About the TeamThe Cerebras Inference team is dedicated to delivering the most efficient, secure, and reliable enterprise-grade AI service. We design and manage expansive distributed systems that facilitate AI inference with unparalleled speed and efficiency. Join us in scaling our inference capabilities to new heights!

Feb 17, 2026

Apply

Staff Systems Software Engineer (Future Forward)

Intuitive Surgical, Inc.

Full-time|On-site|Sunnyvale

Join our innovative team as a Staff Systems Software Engineer at Intuitive Surgical, where we are at the forefront of transforming surgical experiences. In this role, you will leverage your expertise to design, develop, and refine cutting-edge software solutions that enhance robotic-assisted surgeries. Your contributions will directly impact patient outcomes and the overall quality of care.

Mar 10, 2026

Apply

Software Engineer, Cloud Security at Illumio | Sunnyvale, CA

Illumio

Full-time|On-site|Sunnyvale, California - HQ

Join Us in Making the Digital World Safer!At Illumio, we stand at the forefront of ransomware and breach containment, transforming how organizations manage cyber threats while ensuring operational resilience. Our cutting-edge breach containment platform, powered by the Illumio AI Security Graph, is designed to identify and neutralize threats across diverse hybrid multi-cloud environments, effectively halting the escalation of attacks before they can cause significant harm.As a recognized leader in the Forrester Wave™ for Microsegmentation, Illumio empowers organizations to adopt a Zero Trust approach, enhancing cyber resilience for the infrastructure, systems, and entities that underpin the global economy.Location: 5 Days On-Site at Our Sunnyvale, CA Headquarters.Our Vision:Our Engineering team thrives on a culture of thought leadership, independence, and accountability. This dynamic empowers us to enhance digital safety collaboratively. Those who join our ranks will lead the charge in Zero Trust Segmentation, working with a technology stack that spans operating systems, distributed applications, UI, and visualization. Together, we will forge world-class products, driven by a diverse array of perspectives and a steadfast commitment to innovation amidst unprecedented cybersecurity challenges.Your Role:You will engineer containerized (micro) services for a distributed multi-tenant system, processing data, real-time events, and network telemetry from multiple public clouds to deliver insights, visibility, and security recommendations that help customers mitigate risks in the cloud.You will be responsible for designing services, meticulously detailing the architecture, defending your proposals before your peers, and delivering robust implementations.You will also play a key role in mentoring junior engineers, recent graduates, and interns, fostering their growth and productivity within the team.Your primary coding language will be Go, and you will work with data pipelines utilizing SQL or other interfaces. We embrace individuals with diverse language and technology backgrounds who are eager to learn.You will own significant features and subsystems, managing the entire software development lifecycle from requirements clarification to successful deployment and user engagement.You will engage with the operational aspects of the system, confronting the challenges of managing a complex system in real-time.

Mar 23, 2026

Create account — see all 612 results