companyMoonlake logo

Technical Staff Member - Advanced Machine Learning Optimization

MoonlakeSan Mateo
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Ideal candidates should possess strong expertise in machine learning frameworks, experience with high-performance computing, and proficiency in optimizing algorithms for large-scale data processing. A solid understanding of GPU architectures and programming is crucial.

About the job

Join Moonlake, a pioneering company harnessing AI to develop immersive world simulations.

Role Overview

Enhancing Training Efficiency

  • Implement data loaders, fusion techniques, activation rematerialization, and gradient checkpointing.

  • Optimize training with FSDP/ZeRO/tensor+pipeline parallelism and NCCL tuning.

Improving GPU and Kernel Performance

  • Conduct Nsight profiling, develop Triton/CUDA kernels, and create fused operations.

  • Implement flash-attention style accelerations, sequence packing, and KV-cache optimizations.

Optimizing Inference

  • Focus on low-latency serving, continuous batching, and speculative decoding strategies.

  • Apply quantization methods (GPTQ/AWQ), distillation, and pruning techniques.

Infrastructure and Reliability

  • Manage SLURM/Kubernetes multi-node jobs and ensure checkpoint hygiene.

  • Maintain determinism, environment pinning, and effectively handle GPU failures.

Our dedicated team thrives on collaboration in our San Mateo office.

About Moonlake

Moonlake is at the forefront of AI technology, specializing in creating captivating world simulations that push the boundaries of imagination and interactivity.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.