companyTenstorrent logo

Senior Engineer, Server Inference

TenstorrentBelgrade, Serbia
Hybrid Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Adept at designing contemporary APIs and enhancing the deployment of ML models in production environments. Inquisitive about achieving performance optimizations through techniques such as batching, caching, and model parallelism. Committed to clean software architecture and effective abstraction layers. Driven to deliver backend systems that developers can depend on.

About the job

Tenstorrent is at the forefront of groundbreaking AI technology, setting new benchmarks for performance, usability, and cost-effectiveness. As AI reshapes the computing landscape, our solutions are designed to integrate advancements in software models, compilers, platforms, networking, and semiconductors. Our talented team has crafted a high-performance RISC-V CPU from the ground up and is united by a shared enthusiasm for AI and a commitment to creating the premier AI platform. We cherish collaboration, curiosity, and a relentless drive to tackle complex challenges. We are expanding our team and are on the lookout for contributors at all experience levels.

Become a part of our Inference Server Technologies team, where we create software that drives cutting-edge AI inferencing on Tenstorrent’s innovative hardware. Our team focuses on building the layer that operates on top of Tenstorrent's ML libraries—designing APIs, deploying workloads, and benchmarking end-to-end inference speed. You will play a crucial role in shaping how developers engage with and scale model execution on Tenstorrent’s infrastructure.

This role is hybrid based in Belgrade, Serbia.

We encourage candidates of all experience levels to apply. During the interview process, we will evaluate candidates for the appropriate level, and offers will be tailored accordingly.

About Tenstorrent

At Tenstorrent, we lead the charge in AI innovation, developing technologies that redefine performance standards and accessibility. Our diverse team is dedicated to pushing the boundaries of what's possible, creating a collaborative environment that fosters curiosity and problem-solving. Join us as we build the future of AI.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.