company

Principal Architect - Performance Analysis and Modeling

d-MatrixSanta Clara
Hybrid Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

BSEE with 10+ years of industry experience or MSEE preferred with 8+ years of industry experience. Solid grasp through academic or industry experience in multiple of the relevant areas – computer architecture, hardware software co-design, and performance analysis.

About the job

At d-Matrix, we are pioneering the potential of generative AI to revolutionize technology. Positioned at the leading edge of software and hardware innovation, we are dedicated to pushing the limits of what is achievable. Our workplace culture is built on mutual respect and collaboration.

We embrace humility and value direct communication. Our team is diverse and inclusive, allowing for a variety of perspectives that lead to superior solutions. We are looking for individuals who are eager to take on challenges and are motivated by results. Are you ready to discover your playground? Together, we can explore the infinite possibilities of AI.

This position requires you to work on-site at our Santa Clara, CA headquarters for three days a week in a Hybrid work model.

Role Overview: Principal Architect - Performance Analysis and Modeling

d-Matrix is searching for exceptional computer architects to enhance AI application performance at the convergence of hardware and software, with a keen focus on cutting-edge hardware technologies (such as DIMC, D2D, 3D-DRAM, etc.) and innovative workloads (like generative inference, etc.). Our acceleration philosophy spans the entire system, encompassing efficient tensor cores, storage, and data movements, along with co-designing dataflow and collective communication techniques.

Key Responsibilities:

  • Analyze the latest machine learning workloads (including multi-modal LLMs, CoT reasoning models, and video/audio generation).

  • Contribute to the Hardware and Software features that empower the next generation of inference accelerators in data centers.

  • Stay current with research in ML Architecture and Algorithms, collaborating with various partner teams including Product, Hardware Design, Compiler, Inference Server, and Kernels.

  • Daily tasks will involve (1) analyzing the properties of emerging machine learning algorithms and workloads to discern functional and performance implications, (2) creating analytical models to forecast performance across current and future generations of d-Matrix hardware, and (3) proposing new hardware/software features to facilitate or accelerate these algorithms.

Qualifications:

Minimum:

  • Bachelor's degree in Electrical Engineering or related field with over 10 years of industry experience, or a Master's degree preferred with at least 8 years of experience.

  • Comprehensive understanding of relevant areas through academic or industry experience, including computer architecture, hardware-software co-design, and performance analysis.

About d-Matrix

At d-Matrix, we harness the transformative power of generative AI to drive technological advancement. Our commitment to innovation and collaboration fosters an inclusive environment where diverse perspectives pave the way for groundbreaking solutions.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.