companyEtched logo

Software Engineer - Performance Tools

EtchedSan Jose
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

We are seeking candidates with a strong foundation in software engineering principles, particularly those with experience in performance analysis tools. Ideal candidates will have:Proficiency in programming languages such as C/C++ and Python. Experience with performance analysis and profiling tools. Understanding of hardware architecture and low-level programming. Familiarity with machine learning concepts and their application in performance optimization. A problem-solving mindset and the ability to work collaboratively in a fast-paced environment.

About the job

About Etched

Etched is pioneering the development of the world's first AI inference system explicitly engineered for transformers, providing over 10x greater performance while significantly reducing costs and latency compared to traditional solutions. Our Etched ASICs enable the creation of products that were previously unattainable with GPUs, such as real-time video generation models and deeply parallel chain-of-thought reasoning agents. With substantial backing from leading investors and a team of top-tier engineers, Etched is reshaping the infrastructure for one of the fastest-growing industries in history.

Job Summary

We invite you to join our innovative team as a Software Engineer - Performance Tools. In this pivotal role, you will spearhead the development of an advanced performance analysis tool specifically designed for Sohu. Your expertise will be crucial in creating essential tools that empower our ML engineers and clients to comprehend workload behaviors, pinpoint performance limitations, and fully harness the potential of Sohu in accelerating the most demanding ML applications globally. This unique opportunity allows you to influence the performance analysis landscape for groundbreaking hardware from its inception.

Key Responsibilities

  • Tool Architecture & Design: Lead the architecture and design of a robust performance analysis suite, incorporating data collection mechanisms, processing pipelines, analysis engines, and user interfaces (CLI and/or GUI).

  • Low-Level Data Collection: Create reliable methods to gather performance data directly from our custom ML accelerator hardware (e.g., hardware performance counters, execution unit status, memory access patterns) through driver interfaces or other means.

  • Host & System Tracing: Establish tracing for host-side API interactions (runtime libraries, driver communications) and system-level events (CPU activity, PCIe traffic, memory usage, network contention) associated with Sohu workloads.

  • Data Correlation & Synchronization: Develop and implement methodologies for accurately correlating performance events across host CPUs, device drivers, PCIe buses, multiple accelerators, and various hosts, ensuring precise time synchronization.

  • Performance Analysis Engine: Construct analysis modules to automatically interpret the collected trace and counter data, identifying key performance limits (e.g., compute-bound, memory bandwidth-bound, latency-bound, PCIe-bound).

About Etched

Etched is at the forefront of AI technology, innovating solutions that empower advanced machine learning capabilities and redefine performance benchmarks. Our team, comprised of industry leaders, is committed to pushing the boundaries of what's possible in AI inference systems.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.