Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.

Member of Technical Staff - ML Systems & Inference at Gimlet Labs | San Francisco | RoboApply Jobs

Technical Staff Member - Machine Learning Systems & Inference

Gimlet LabsSan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

ResponsibilitiesDesign and optimize comprehensive inference pipelines from request ingestion to execution and response delivery.Develop and enhance inference runtimes that effectively manage latency, throughput, and concurrency under realistic load conditions.Analyze batching, queuing, and scheduling trade-offs, including their effects on tail latency and fairness.Oversee KV cache allocation, placement, reuse, and eviction strategies across models and requests.Optimize prefill and decode pathways, focusing on attention mechanisms and memory management.Profile and troubleshoot inference performance issues across model, runtime, and system boundaries.Collaborate closely with teams specializing in compilers, kernels, networking, and distributed systems to achieve end-to-end performance enhancements.QualificationsProficiency in machine learning frameworks and inference optimization techniques.Experience with performance profiling and debugging tools.Strong understanding of system architecture and hardware-software integration.Ability to work collaboratively in a fast-paced, innovative environment.Excellent problem-solving skills and attention to detail.

About the job

At Gimlet Labs, we are pioneering the development of the first heterogeneous neocloud designed specifically for AI workloads. As the demand for AI systems surges, traditional homogeneous infrastructures face critical limits in power, capacity, and cost. Our innovative platform effectively decouples AI workloads from their hardware foundations, intelligently partitioning tasks and orchestrating them to the most suitable hardware for optimal performance and efficiency. This strategy fosters heterogeneous systems that span multiple vendors and generations, including cutting-edge accelerators, enabling significant enhancements in performance and cost-effectiveness at scale.

In addition to this foundational work, Gimlet is establishing a robust neocloud for agentic workloads. Our clients benefit from deploying and managing their workloads via stable, production-ready APIs, without the need to navigate hardware selection or performance optimization intricacies.

We collaborate with foundation labs, hyperscalers, and AI-native companies to drive real production workloads capable of scaling to gigawatt-class AI datacenters.

We are currently seeking a Member of Technical Staff specializing in ML systems and inference. In this pivotal role, you will be responsible for designing and constructing inference systems that facilitate the execution of complete models in real production environments. You will operate at the intersection of model architecture and system performance to ensure that inference processes are swift, predictable, and scalable.

This position is perfect for engineers with a deep understanding of modern model execution and a passion for optimizing latency, throughput, and memory utilization across the entire inference lifecycle.

About Gimlet Labs

Gimlet Labs is at the forefront of AI infrastructure innovation, creating solutions that redefine how AI workloads are managed and executed. Our cutting-edge technologies empower businesses to leverage AI effectively while addressing scalability and efficiency challenges, ensuring they remain competitive in a rapidly evolving landscape.

Technical Staff Member - Machine Learning Systems & Inference

Gimlet LabsSan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

About the job

We collaborate with foundation labs, hyperscalers, and AI-native companies to drive real production workloads capable of scaling to gigawatt-class AI datacenters.

Technical Staff Member - Machine Learning Systems & Inference

Unlock Your Potential

Experience Level

Qualifications

About the job

About Gimlet Labs

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

Technical Staff Member - Machine Learning Systems & Inference

Unlock Your Potential

Experience Level

Qualifications

About the job

About Gimlet Labs

Technical Staff Member - Machine Learning Systems & Inference

Unlock Your Potential

Experience Level

Qualifications

About the job

About Gimlet Labs

Technical Staff Member - Machine Learning Systems & Inference

Unlock Your Potential

Experience Level

Qualifications

About the job

About Gimlet Labs