companyHudson River Trading (HRT) logo

Software Engineer - GPU Reliability at wehrtyou | New York, NY

Hudson River Trading (HRT)New York, NY, United States
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Key Responsibilities:Design and maintain automation tools and software features for GPU management, monitoring, metrics collection, maintenance, and network configuration. Diagnose and resolve software and hardware issues within a fleet of GPU devices, addressing application, network, operating system, and kernel-related problems. Collaborate with engineering teams to optimize workloads and processes for improved GPU efficiency. Analyze GPU job statistics to uncover trends and identify areas for enhancement. Required Qualifications:Bachelor's or Master's degree in Computer Science or a related field. Minimum of 2 years of relevant experience in Python programming and GPU management. Proficiency in using automation to address challenges and enhance process efficiency. Experience in troubleshooting, tuning, and deploying various GPU hardware. Strong understanding of computer science principles and software design patterns. Solid knowledge of Linux/UNIX operating systems. Familiarity with open-source software. Ability to quickly debug and analyze technical issues. Exceptional multitasking skills with keen attention to detail. Strong team player with the ability to work independently. Eagerness to learn rapidly and apply new skills effectively. Preferred Qualifications:Understanding of Debian operating systems. Familiarity with systems configuration management and monitoring tools.

About the job

Join Hudson River Trading (HRT) as a Software Engineer dedicated to enhancing GPU reliability within our innovative Systems Development team. Our team is responsible for building and maintaining the foundational platform utilized by all Systems teams to provision, monitor, and manage HRT’s expansive server and network infrastructure. In this pivotal role, you will focus on developing Python-based tools to analyze GPU hardware performance while crafting inventive solutions to boost observability, reliability, and efficiency across our GPU fleet. Collaborating closely with various engineering teams, you’ll gain insights into research and trading workflows to ensure optimal utilization of our GPU infrastructure.

About Hudson River Trading (HRT)

Hudson River Trading (HRT) is a pioneering trading firm that leverages cutting-edge technology to deliver innovative trading solutions. Our commitment to excellence and a culture of collaboration drive our mission to optimize financial markets while ensuring a dynamic and engaging workplace for our employees.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.