companyData Systems Analysts, Inc. logo

High Performance Computing (HPC) Support Engineer

Data Systems Analysts, Inc.Charlottesville, VA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Key Responsibilities:Deliver user support for computational workloads in both classified and unclassified HPC clusters. Assist users in creating, submitting, and troubleshooting scheduler job scripts, focusing on resource allocation for CPU, GPU, and distributed compute workloads. Diagnose slow, hanging, or failing HPC jobs, including MPI-based distributed workloads, GPU jobs, and large-scale parallel applications. Support users in compiling and executing scientific, modeling, or data processing applications in Linux-based HPC environments. Provide guidance on best practices for job scheduling, compute resource allocation, and workload performance. Monitor workload execution patterns and offer strategies for improving cluster throughput and resource utilization. Develop scripts or tools using Bash or Python to automate routine tasks. Maintain documentation and knowledge base articles detailing system capabilities, job execution procedures, and troubleshooting protocols. Engage in performance analysis of compute workloads to identify inefficiencies or configuration challenges. Coordinate with HPC systems engineers when infrastructure or cluster configuration issues affect workload performance. Deliver responsive onsite support for users executing HPC workloads.

About the job

All hired employees are expected to have experience with Microsoft Copilot and/or an approved equivalent AI solution.

Description:

Data Systems Analysts, Inc. (DSA) is on the lookout for a talented HPC Support Engineer, holding a TS/SCI clearance, to assist users in executing computational workloads within secure High Performance Computing (HPC) environments. This role involves direct collaboration with engineers, analysts, and researchers to facilitate job execution, troubleshoot workload failures, and enhance the performance and efficiency of compute workloads on HPC clusters.

The HPC Support Engineer will aid users in developing, submitting, and troubleshooting scheduler job scripts for systems like Slurm or PBS, optimizing resource allocation for CPU, GPU, and distributed compute workloads, while promoting best practices for efficient HPC cluster utilization.

This position demands a strong background in Linux, scripting skills, and familiarity with distributed computing environments tailored for scientific or engineering workloads. This role is based onsite in Charlottesville, VA.

About Data Systems Analysts, Inc.

Data Systems Analysts, Inc. (DSA) is a leading provider of technology solutions and support services, specializing in High Performance Computing and advanced analytics. DSA is committed to fostering innovation and efficiency in computational environments, ensuring our clients achieve their mission objectives with precision and excellence.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.