About the job
Cerebras Systems is at the forefront of AI technology, creating the world's largest AI chip, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture delivers the computational power of dozens of GPUs within a single chip, simplifying programming and enabling users to run expansive machine learning applications seamlessly. This revolutionary approach allows us to achieve unparalleled training and inference speeds, making it easier for machine learning practitioners to manage their workflows without the complexities of numerous GPUs or TPUs.
We are proud to serve a diverse clientele, which includes leading model labs, multinational corporations, and pioneering AI-focused startups. Recently, OpenAI partnered with Cerebras to leverage our technology for ultra-high-speed inference, transforming key workloads with a staggering 750 megawatts of scale.
Cerebras Inference is recognized as the fastest Generative AI inference solution globally, operating over ten times faster than GPU-based hyperscale cloud inference services. This drastic enhancement in speed is revolutionizing the user experience of AI applications, allowing for real-time iterations and augmented intelligence through enhanced computation capabilities.
About the Role
As an Engineering Lead, you will spearhead the development of a premier UI-based large-scale cluster management portal. This portal will serve as a comprehensive platform for all operations and maintenance of Cerebras clusters, encompassing cluster deployment management (from day 0 to 2), job scheduling, health monitoring, and more. Cerebras AI clusters consist of thousands of wafer-scale accelerator systems, high-end servers, and numerous networking ports including switches.
Responsibilities
- Serve as the key engineering representative and owner of the UI, ensuring backend integration follows industry best practices.
- Collaborate closely with product management and end-users to create a world-class tool.
- Provide robust technical leadership throughout the development process.
- Engage actively with various engineering teams to facilitate backend interactions.
- Design and implement a UI experience that is cohesive and intuitive across all operational and maintenance tasks.
- Mentor and guide a small team of engineers dedicated to this tool.

