company

Software Engineering Intern - Machine Learning Kernels

d-MatrixToronto
On-site Intern

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

Required Skills:Proficient in implementing high-performance kernels in low-level languages; experience in Assembly/ISA is advantageous. Strong proficiency in Python and/or C++. Solid understanding of machine learning model architecture (e.g., LLMs, CNNs). Familiarity with ML frameworks like PyTorch and ML packages such as NumPy. General knowledge of computer architecture (CPU, GPU, CUDA).

About the job

At d-Matrix, we are pioneering the potential of generative AI to transform the technology landscape. Positioned at the cutting edge of both software and hardware innovation, we continuously strive to expand the horizons of what can be achieved in this dynamic field. Our workplace culture is built on the principles of respect and collaboration.

We emphasize the importance of humility and open communication. Our inclusive team thrives on diverse perspectives, which lead to innovative solutions. We are on the lookout for motivated individuals who are passionate about overcoming challenges and are driven by tangible outcomes. Are you ready to explore your playground? Together, we can unlock the infinite possibilities of AI.

Job Title: Software Engineering Intern - Kernels

Location: Toronto, Canada

Program Duration: 12 weeks: June 1st - August 21st or June 22nd - September 11th

Project Overview:

As a Software Engineering Intern on our Kernels team, you will have a significant impact on the development of high-performance kernels that are crucial for accelerating machine learning models. Your role will encompass creating reference implementations for accuracy verification, defining unit tests for the implemented operators, optimizing performance, conducting scalability analysis across varying problem sizes, and packaging and shipping the final implementations. You will also gather performance metrics and identify bottlenecks to enhance core functionality.

What You Will Do:

  • Implement high-performance kernels using low-level languages (Assembly/ISA experience is a plus).

  • Develop, test, and optimize kernels for machine learning models and performance.

  • Automate the creation of reference implementations and unit tests.

  • Analyze scalability and performance, collect metrics, and troubleshoot bottlenecks.

  • Package and share implementations with partner teams.

About d-Matrix

At d-Matrix, we are committed to driving innovation in the AI sector. Our focus is on harnessing generative AI technology, leading to meaningful advancements in software and hardware solutions. Our team embodies a collaborative spirit and values diverse perspectives, which enrich our problem-solving capabilities.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.