LLM Inference Performance & Evaluations Engineer

Cerebras SystemsToronto, Ontario, Canada

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Skills and Qualifications3+ years of experience in developing high-performance machine learning or systems software. Strong foundation in software architecture and design principles. Expertise in performance benchmarking and optimization techniques. Familiarity with modern programming languages and development tools. Ability to work collaboratively in a dynamic team environment.

About the job

Among our esteemed clientele are leading model laboratories, global enterprises, and pioneering AI-native startups. Recently, OpenAI announced a multi-year collaboration with Cerebras, aiming to leverage 750 megawatts of scale to revolutionize key workloads through ultra-high-speed inference.

Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference provides the fastest Generative AI inference solution available today, boasting speeds over ten times faster than GPU-based hyperscale cloud services. This extraordinary increase in speed is reshaping the user experience of AI applications, enabling real-time iterations and enhancing intelligence through advanced agentic computation.

About The Role

Join our inference model team, dedicated to advancing state-of-the-art models by numerically validating and accelerating innovative concepts on our wafer-scale hardware. In this role, you will prototype architectural enhancements, construct performance evaluation pipelines, and translate quantitative insights into actionable changes that drive production success.

Key Responsibilities

Prototype and benchmark innovative concepts such as new attention mechanisms, mixture of experts (MoE), speculative decoding, and other emerging advancements.
Create agent-driven automation tools that design experiments, schedule runs, triage regressions, and prepare pull requests.
Collaborate closely with compiler, runtime, and silicon teams, gaining a unique perspective on the complete software/hardware innovation stack.
Stay current with the latest open- and closed-source models; execute them on wafer scale first to identify new optimization opportunities.

About Cerebras Systems

Cerebras Systems is a pioneering technology company revolutionizing the field of artificial intelligence with the world’s largest AI chip. Our innovative approach to architecture and design allows for unmatched computational efficiency and efficacy, making us leaders in AI inference and training solutions. We are committed to empowering our clients with tools that enable groundbreaking advancements in machine learning.

Similar jobs

1 - 20 of 1,609 Jobs

Search for Senior Product Manager Inference

1,609 results

Select all on this page (20)

Apply

Senior Product Manager – Inference

BenchSci

Full-time|On-site|Toronto, Ontario

Join BenchSci as a Senior Product Manager – Inference, where you will lead the charge in transforming scientific discovery through innovative product solutions. Collaborate with cross-functional teams to define product vision, strategy, and roadmap while ensuring alignment with customer needs and business goals.

Mar 24, 2026

Apply

Performance Engineer - Inference

Cerebras Systems

Full-time|On-site|Toronto, Ontario, Canada

Cerebras Systems is at the forefront of AI technology, having developed the world's largest AI chip, which is 56 times larger than traditional GPUs. Our revolutionary wafer-scale architecture delivers unparalleled AI compute power equivalent to dozens of GPUs on a single chip, combined with the ease of programming as if it were a single device. This innovative approach enables us to achieve industry-leading training and inference speeds, allowing machine learning practitioners to run extensive ML applications effortlessly, without the complexities associated with managing numerous GPUs or TPUs. Cerebras is trusted by leading model labs, global enterprises, and pioneering AI-native startups. Notably, OpenAI recently announced a multi-year partnership with Cerebras, aimed at deploying 750 megawatts of scale, revolutionizing critical workloads with ultra high-speed inference. Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution globally, exceeding GPU-based hyperscale cloud inference services by more than 10 times. This significant enhancement in speed is redefining the user experience of AI applications, facilitating real-time iterations and amplifying intelligence through enhanced agentic computation.About The RoleAs a member of the inference performance team, you will work at the critical intersection of hardware and software, enhancing end-to-end model inference speed and throughput. Your focus will encompass low-level kernel performance debugging and optimization, system-level performance analysis, performance modeling, and the creation of tools for performance diagnostics and projections.ResponsibilitiesDevelop performance models (kernel-level, end-to-end) to forecast the performance of state-of-the-art and client ML models.Optimize and troubleshoot our kernel micro code and compiler algorithms to enhance ML model inference speed, throughput, and compute utilization on the Cerebras WSE.Analyze and debug runtime performance at the system and cluster level.Create tools and infrastructure to visualize performance data collected from the Wafer Scale Engine and our compute cluster.

Feb 17, 2026

Apply

Senior Product Manager

Xplor Technologies

Full-time|On-site|Toronto

Join Xplor Technologies as a Senior Product Manager where you will play a pivotal role in driving the product vision and strategy. You will collaborate with cross-functional teams to develop and enhance our product offerings, ensuring they meet market demands and customer needs. Your insights will be pivotal in shaping our roadmap and influencing product features.

Apr 1, 2026

Apply

Senior Product Manager

Veeva Systems Inc.

Full-time|On-site|Canada - Toronto

At Veeva, we are seeking an experienced and dynamic Senior Product Manager to join our innovative team in Toronto. This role offers an exciting opportunity to lead product strategy and development, driving forward our vision in the life sciences industry. You will collaborate with cross-functional teams to define product requirements, prioritize initiatives, and ensure successful product launches.The ideal candidate is a strategic thinker with a proven track record in product management, coupled with a passion for delivering exceptional customer experiences. If you thrive in a fast-paced environment and are eager to make a significant impact, we would love to hear from you.

Jul 22, 2022

Apply

Senior Product Manager for Personalization

CookUnity

Full-time|$150K/yr - $180K/yr|On-site|Toronto, Ontario, Canada

About CookUnity:At CookUnity, we believe that food has a deeper purpose beyond convenience. Established in 2018, CookUnity is a pioneering platform that bridges the gap between exceptional chefs and food lovers. Currently, we deliver over 50 million meals annually, prepared by top-tier chefs, to homes across the nation. Our meals are fresh, ready-to-eat, and crafted with a passion that nourishes both body and soul.But we’re not stopping there—CookUnity is evolving into an innovative marketplace with a singular mission: to empower chefs to nourish the world.If this mission resonates with you, we invite you to explore this exciting opportunity.The RoleWe are seeking a Senior Product Manager for Personalization to spearhead the vision, strategy, and execution of our next-generation personalization systems, encompassing menu ranking, recommendations, search, predictive preferences, and adaptive user interfaces.This role will oversee the personalization engine throughout the entire eater journey—from initial onboarding to sustained engagement—and will collaborate cross-functionally with machine learning, engineering, data science, design, and culinary teams to create a highly adaptive, chef-driven food experience.Your focus will be to delve into understanding eater behaviors, intent signals, and food preferences, translating these insights into a scalable personalization roadmap aimed at significantly enhancing retention, conversion, and lifetime value. Our users make countless micro-decisions weekly, and your goal will be to streamline this process.What You’ll DoDefine & Drive Product StrategyManage the end-to-end personalization roadmap, which includes recommendations, ranking algorithms, adaptive feeds, intent modeling, and predictive food preferences.Develop a cohesive personalization strategy that encompasses discovery, ordering, search, and post-order experiences.Translate customer insights and behavioral data into frameworks for tailored ordering journeys across weekly menus.Ship High-Impact Personalization Features...

Dec 2, 2025

Apply

LLM Inference Performance & Evaluations Engineer

Cerebras Systems

Full-time|On-site|Toronto, Ontario, Canada

Cerebras Systems is at the forefront of AI innovation, creating the world’s largest AI chip, a staggering 56 times larger than traditional GPUs. Our revolutionary wafer-scale architecture delivers the computational power of dozens of GPUs within a single chip, paired with the simplicity of a unified programming interface. This unique approach enables us to achieve unparalleled training and inference speeds, empowering machine learning practitioners to execute large-scale ML applications effortlessly, without the complexities associated with hundreds of GPUs or TPUs.Among our esteemed clientele are leading model laboratories, global enterprises, and pioneering AI-native startups. Recently, OpenAI announced a multi-year collaboration with Cerebras, aiming to leverage 750 megawatts of scale to revolutionize key workloads through ultra-high-speed inference.Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference provides the fastest Generative AI inference solution available today, boasting speeds over ten times faster than GPU-based hyperscale cloud services. This extraordinary increase in speed is reshaping the user experience of AI applications, enabling real-time iterations and enhancing intelligence through advanced agentic computation.About The RoleJoin our inference model team, dedicated to advancing state-of-the-art models by numerically validating and accelerating innovative concepts on our wafer-scale hardware. In this role, you will prototype architectural enhancements, construct performance evaluation pipelines, and translate quantitative insights into actionable changes that drive production success.Key ResponsibilitiesPrototype and benchmark innovative concepts such as new attention mechanisms, mixture of experts (MoE), speculative decoding, and other emerging advancements.Create agent-driven automation tools that design experiments, schedule runs, triage regressions, and prepare pull requests.Collaborate closely with compiler, runtime, and silicon teams, gaining a unique perspective on the complete software/hardware innovation stack.Stay current with the latest open- and closed-source models; execute them on wafer scale first to identify new optimization opportunities.

Feb 17, 2026

Apply

Senior Product Manager - Invoicing

Jobber

Full-time|On-site|Toronto

Join Jobber as a Senior Product Manager specializing in Invoicing, where you will lead innovative product strategies that enhance customer experiences and drive business growth. You will collaborate with cross-functional teams to develop and execute a product roadmap, ensuring alignment with our overarching goals.

Mar 16, 2026

Apply

Senior Medical Product Manager

Fullscript

Full-time|On-site|Toronto, ON

Join Fullscript as a Senior Medical Product Manager, where you will play a pivotal role in shaping the future of healthcare technology. You will be responsible for leading product strategy and execution, collaborating with cross-functional teams to deliver innovative solutions that enhance patient outcomes. Your expertise in medical products and market trends will be essential in driving product development from conception to launch.

Mar 26, 2026

Apply

Senior Product Manager at venn | Toronto

venn

Full-time|On-site|Toronto

Role overview venn is hiring a Senior Product Manager based in Toronto. This role focuses on shaping and improving product offerings by collaborating with teams across the company. The Senior Product Manager guides products from initial design through development and launch, always with an eye on customer needs and business goals.

Apr 20, 2026

Apply

Senior Product Manager at Fable | Remote

Fable

Full-time|CA$130K/yr - CA$150K/yr|Remote|Remote — Toronto, Ontario, Canada

Fable works with major global brands to create digital products that improve accessibility for people with disabilities. The company’s clients include Walmart, Slack, and Shopify. Fable has received recognition from Forbes Accessibility 100 for 2025, Fast Company, the World Summit Awards, and the Zero Project, which is backed by the United Nations. Role overview The Senior Product Manager leads a core product area within a cross-functional team, focusing on accessible digital experiences shaped by real user feedback. This role centers on building solutions that go beyond compliance, aiming to serve people with a wide range of abilities. The team combines research-driven practices with direct community input to help organizations deliver truly inclusive products. What you will do Own strategy and execution for a major product domain, reporting to the Group Product Manager. Collaborate with Engineering, Design, and other partners in a pod structure to deliver value across enterprise and community offerings. Investigate and thoughtfully integrate AI into Fable’s platform to enhance accessibility and improve user workflows, always guided by real user needs and a commitment to transparency. Navigate ambiguity, balancing long-term vision with immediate priorities, and make trade-offs that benefit both users and the business. Foster collaboration and growth within the team, empowering others to excel. Requirements Based in Canada and able to work within North American time zones (Eastern Time preferred). Background in product management, especially in ambiguous or evolving domains. Proven ability to connect user insights to product strategy and execution. Strong collaborator with a focus on accessibility and inclusive design. Fable encourages applications from those who see themselves in most of these points.

Apr 23, 2026

Apply

Senior Product Manager - Work Planning

MaintainX

Full-time|On-site| San Francisco, California, Miami, Florida, Raleigh, North Carolina, Toronto, Ontario, & Montréal, Quebec

MaintainX is hiring a Senior Product Manager focused on Work Planning. This position plays a key part in shaping product direction and launching new solutions for a varied customer base. Role overview The Senior Product Manager will drive product strategy and work closely with teams across engineering, design, and marketing. The goal is to develop and improve work planning features that address real client needs and support company growth. What you will do Guide cross-functional teams to deliver enhancements to work planning products Apply analytical skills to generate insights that inform product decisions Collaborate on launching new features that improve user experience Locations This role can be based in San Francisco, Miami, Raleigh, Toronto, or Montréal.

Apr 29, 2026

Apply

Benchmarking Engineer for Inference Core Platform

Cerebras Systems

Full-time|On-site|Toronto, Ontario, Canada

Cerebras Systems is revolutionizing AI technology with the world's largest AI chip, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture combines the immense computational power of multiple GPUs into a single chip while maintaining unparalleled programming simplicity. This allows us to provide extraordinary training and inference speeds, empowering machine learning users to seamlessly execute large-scale ML applications without the complexities of managing numerous GPUs or TPUs. We proudly serve a diverse clientele, including leading model laboratories, global corporations, and innovative AI-centric startups. Notably, OpenAI recently formed a multi-year partnership with Cerebras, committing to deploy 750 megawatts of scale to enhance critical workloads with ultra-high-speed inference. Thanks to our groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution globally, achieving speeds over 10 times faster than GPU-based hyperscale cloud inference services. This remarkable speed transformation enhances user experiences and facilitates real-time iterations while augmenting intelligence through advanced agentic computation. About The RoleThe Inference Core Platform team is integral to Cerebras’ mission of delivering the world’s fastest AI inference. Our engineers develop the core software and hardware infrastructure that enables low-latency, high-speed, and high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We oversee the entire stack—from model compilation and scheduling to custom hardware kernels and driver development.The Platform Benchmarking team is crucial in enhancing the performance and scalability of AI inference on one of the most advanced computing systems ever developed. We spearhead the establishment of core inference capabilities and implement performance improvements at every development phase, from initial prototyping to full production deployment.We seek enthusiastic engineers eager to redefine the boundaries of AI inference. If you're passionate about developing systems that measure, analyze, and optimize performance on a large scale, this is your chance to make a transformative impact on the future of AI.

Mar 18, 2026

Apply

Senior Product Manager, Platform / OS

Roku Inc.

Full-time|CA$100K/yr - CA$116.4K/yr|On-site|Toronto, Canada

Teamwork Makes the Stream Work. Join Roku in Revolutionizing TelevisionAs the leading TV streaming platform in the U.S., Canada, and Mexico, Roku is on a mission to power every television worldwide. We pioneered the streaming experience and aim to connect consumers with their favorite content while empowering content publishers to build and monetize vast audiences. Our unique advertising capabilities allow brands to engage effectively with consumers.From your first day, you will be a vital contributor in our dynamic environment. At Roku, no one stands on the sidelines. You will have the opportunity to delight millions of TV streamers globally while gaining valuable experience across multiple disciplines. About Our TeamThe Viewer Product Team at Roku is dedicated to connecting millions of viewers with their favorite entertainment through a distinctive Roku experience. The International Product Team collaborates with functional experts and strategic leaders to develop and execute roadmaps across the Roku ecosystem in various global markets. We work cross-functionally to understand consumer and partner needs, identify market opportunities, analyze data for insights, and create effective go-to-market strategies that drive the success of our latest features, collaborating closely with marketing, sales, merchandising, data science, engineering, and other product teams. About the RoleWe are seeking a strategic, collaborative, and experienced Product Manager to lead Roku’s viewer (OS / Platform) product management in Canada. You will be responsible for overseeing growth and engagement strategies, managing the product roadmap, and identifying key product priorities that align with Roku’s strategic business objectives in Canada. This pivotal role will help accelerate Roku's growth and collaborate with market leaders to craft a strategic plan that positions Roku as a leader in Canadian media and streaming. You will be accountable for establishing and achieving key market objectives that support our broader strategy, and actively participate in shaping this strategy alongside other key market leaders.The ideal candidate is a strong product thinker, highly strategic, and possesses robust tactical skills.

Mar 5, 2026

Apply

Senior Product Manager at relayfi | Toronto, ON

Relay

Full-time|$162K/yr - $198K/yr|On-site|Toronto, ON

Relay is an innovative digital banking platform designed to empower self-made entrepreneurs with the knowledge and tools needed for financial mastery. Our mission is to provide clarity, confidence, and control over every dollar earned, transforming financial stress into actionable insights that enable small business owners to build stronger, more resilient enterprises.At Relay, we focus on delivering impactful outcomes rather than just outputs. Our product management approach revolves around understanding real customer challenges and creating solutions that drive significant improvements for small and medium-sized businesses (SMBs) and our organization. We combine a long-term vision with agile development, prioritizing trust as a key feature of our products, while ensuring usability, reliability, and finesse are integral to our offerings.Every product launch is meticulously measured, every release serves as a learning opportunity, and we maintain a relentless focus on prioritization to concentrate on what truly matters. Above all, we uphold the principles of customer empathy and product craftsmanship.Importance of This RoleSmall businesses often struggle with cash flow management, with many operating on just 27 days of available cash. They navigate payroll, inventory, contractor payments, and routine expenses while striving to allocate resources for growth. As a Senior Product Manager at Relay, you will play a crucial role in developing financial solutions that are straightforward, user-friendly, and impactful.In this position, you will oversee a significant product area from conception to launch (e.g., core banking, risk assessment, AI insights, payroll, etc.). Your efforts will take new products and features from ideation to realization, scaling existing solutions and influencing how thousands of entrepreneurs manage their businesses effectively.Your Contributions in the First Year May Include:Launching and refining high-priority products to ensure user experiences are seamless, intuitive, and trustworthy.Enhancing customer outcomes through integrations with banking, accounting, or payment systems.Collaborating with Risk, Data, and Operations teams to ensure Relay’s products are dependable, secure, and driven by insights.Creating narratives and strategic roadmaps that align stakeholders, clarify objectives, and foster confidence, even amidst uncertainty.Partnering with Sales, Marketing, and Customer Experience teams to facilitate successful product launches and increase adoption rates among SMB clients.

Feb 19, 2026

Apply

Senior Product Manager at authentic8 | Toronto

authentic8

Full-time|On-site|Toronto, Ontario

We are seeking a Senior Product Manager to join our dynamic team at authentic8, a leader in secure web access solutions. In this role, you will be responsible for driving product strategy and execution, collaborating closely with engineering, design, and marketing teams to deliver innovative solutions that meet the needs of our clients.You will leverage your expertise in product management to define product vision, gather insights from stakeholders, and prioritize features that enhance user experience and drive business growth. Your leadership skills will be crucial as you mentor junior product managers and lead cross-functional teams toward achieving product goals.

Mar 21, 2026

Apply

Senior Technical Product Manager - Search

MaintainX

Full-time|On-site|Toronto

Join MaintainX as a Senior Technical Product Manager specializing in Search, where you will drive innovation and enhance user experience through our search functionalities. You will lead cross-functional teams, engage with stakeholders, and spearhead product strategies to optimize our search capabilities.

Feb 27, 2026

Apply

Senior Product Manager - Compass Platform

Veeva Systems Inc.

Full-time|On-site|Canada - Toronto

Join Veeva Systems as a Senior Product Manager for the Compass Platform, where you will lead the development and strategy of our innovative product offerings. In this role, you will collaborate with cross-functional teams to enhance product features, drive market adoption, and ensure our solutions meet customer needs. Your expertise will be pivotal in shaping the future of the Compass Platform and delivering exceptional value to our clients.

Mar 28, 2023

Apply

Senior Product Manager - Payment Controls

zip

Full-time|On-site|Toronto

Role Overview zip is hiring a Senior Product Manager focused on Payment Controls in Toronto. This position centers on improving payment systems and maintaining compliance with industry standards. The role offers the chance to shape secure and efficient payment experiences. What You Will Do Lead a team dedicated to developing and refining payment controls. Drive projects that improve the efficiency and security of payment solutions. Ensure all payment processes align with relevant compliance requirements. Collaborate with stakeholders across the business to deliver reliable payment features.

Apr 14, 2026

Apply

Senior Product Manager at Financeit | Toronto, Ontario

Financeit

Full-time|CA$130K/yr - CA$145K/yr|On-site|Toronto, Ontario, Canada

About Us:At Financeit, we are a dynamic fintech and enterprise services company, dedicated to empowering the largest home improvement and retail organizations across the United States and Canada. Our innovative point-of-sale financing platform enables businesses of all sizes to enhance their sales processes by providing customers with flexible and affordable monthly payment options for significant purchases, be it home improvements, vehicles, or retail items. For larger enterprises, our comprehensive suite of solutions, branded as Centah, encompasses lead management, workflow optimization, live support, financing options, promotional strategies, performance assurance, and category development.Our unique culture is a blend of agility and ambition, allowing each team member to make a meaningful impact within the organization, while also influencing the industry at large. We are committed to fostering a diverse and inclusive workplace where innovation thrives.Role Overview:As a Senior Product Manager within our Centah division, you will spearhead the evolution and success of our product offerings. This crucial position requires you to lead interdisciplinary teams in defining, designing, and delivering innovative solutions that align with client needs and drive business growth.In your role, you will report directly to the Director of Product, where you will articulate a clear product feature vision and roadmap that aligns with our strategic goals while staying ahead of market trends to identify new opportunities. As the voice of the customer, you will cultivate a culture of innovation and ensure our products effectively address real market demands. This position demands exceptional product judgment, robust cross-functional leadership, and extensive expertise in the home services sector.Your Responsibilities:Drive the product roadmap for your domain by expertly balancing immediate feature delivery with long-term platform scalability.Engage with senior leadership to present insights, trade-offs, and strategic recommendations that influence organizational objectives.Act as the primary advocate for the customer, utilizing a curious, problem-solving mindset to uncover core pain points and collaboratively develop solutions.Champion product projects from conception through launch, meticulously defining requirements, identifying edge cases, and owning delivery and outcomes.Lead development teams in an Agile environment through continuous delivery, facilitating estimations, refining scope, and making informed trade-offs to meet project deadlines.Prioritize product features based on market demand, strategic initiatives, and comprehensive risk assessments to maximize impact.Analyze product performance metrics post-release, leveraging data-driven insights to iterate and enhance product offerings.

Mar 23, 2026

Apply

Senior Product Manager at Loopio | Toronto, ON

Loopio Inc.

Full-time|On-site|Toronto, ON Hub

Elevate Your Career with Loopio! Loopio is on the lookout for a curious, analytical, and customer-focused Senior Product Manager to join our dynamic Product team. If you are driven by ambitious goals, act with a sense of urgency, and prioritize data-informed decisions, you are the ideal candidate for our team.In this pivotal role, you will spearhead the advancement of our cutting-edge AI-driven intelligence and automation solution for response management, influencing how Loopio converts company knowledge into high-quality, reliable, and impactful proposals while minimizing manual work.Your mission will be to closely collaborate with customers to comprehensively understand their requirements, build their confidence in AI-powered proposal automation, enhance output precision, and lessen the reliance on manual validation to ensure quicker, more dependable outcomes that empower our clients to secure more business. Key Responsibilities:Lead the comprehensive product strategy and roadmap for a crucial aspect of our platform, from inception to delivery, launch, and ongoing enhancements.Work closely with Design, Customer Experience, and Go-to-Market teams to grasp how customers assess, trust, and adopt automated workflows, pinpointing areas where automation excels or falters.Collaborate with Engineering and Data Science to conceive and implement AI-driven solutions in areas such as content retrieval, search, document processing, and data extraction that adeptly address our users' jobs-to-be-done.Define, monitor, and enhance success metrics, including adoption, retention, automation rates, and reductions in manual effort.Facilitate cross-functional practices, including product reviews, roadmap alignment, metric evaluations, and go-to-market preparation discussions.Develop market and competitive insights to guide strategic decisions and prioritization.Engage directly with customers to identify product strengths and weaknesses, translating findings into actionable product opportunities.Ensure alignment among stakeholders by effectively linking roadmap decisions to customer outcomes and business impact. Qualifications:5+ years of experience in Product Management within B2B SaaS, showcasing a successful track record of delivering impactful, outcome-oriented features.Strong analytical skills with a deep understanding of data-driven decision-making.Excellent communication and collaboration skills.Ability to work in a fast-paced environment and manage multiple priorities.

Feb 3, 2026

Create account — see all 1,609 results