companyePlus Inc. logo

Principal Solutions Architect

ePlus Inc.San Ramon, CA
On-site Full-time $170K/yr - $190K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Qualifications:Proven experience as a Solutions Architect or in a similar role, particularly within the AI/ML domain. Deep understanding of NVIDIA technologies and AI software stacks. Strong analytical skills with experience in workload sizing and TCO modeling. Excellent communication and interpersonal skills for effective collaboration with clients and internal teams.

About the job


Overview


We are looking for a highly skilled Principal Solutions Architect to spearhead the comprehensive design, sizing, and deployment of infrastructure aligned with NVIDIA's AI Factory. In this technical and customer-centric role, you will convert intricate AI and machine learning workload specifications into robust, engineered infrastructure solutions that encompass colocation facilities, GPU compute, high-performance networking, parallel storage, and the complete NVIDIA AI software ecosystem.

Your role will be that of a trusted technical advisor to enterprise and hyperscale clients, collaborating closely with sales, product, and engineering teams to secure and execute transformative AI infrastructure initiatives. Your insights will significantly influence how organizations construct and manage production AI Factories capable of training cutting-edge models, managing extensive inference fleets, and accelerating data science workflows on a large scale.


Your Impact


Solution Design & Architecture

  • Facilitate discovery workshops to gather AI/ML workload requirements, including model training scale, inference SLAs, data pipeline throughput, and multi-tenancy needs.
  • Architect comprehensive AI Factory solutions in accordance with NVIDIA reference architectures, integrating colocation, GPU compute, networking, storage, and software components.
  • Create detailed Bills of Materials (BOMs), rack elevation diagrams, network topology diagrams, and power/cooling budgets for client proposals.
  • Define GPU cluster architectures utilizing NVIDIA DGX, HGX, and MGX systems with B200, B300, and GB300 Blackwell SXM and NVLink-Switch configurations.
  • Design RTX PRO 6000 Blackwell Server Edition deployments tailored for inference-optimized and enterprise AI workloads.
  • Conduct workload sizing and TCO/ROI modeling to substantiate infrastructure dimensions for training, fine-tuning, and inference at scale.

Colocation & Facility Planning

  • Outline colocation requirements, including critical power load (MW-scale), UPS and generator configurations, and PUE targets.
  • Design high-density GPU deployments using air-cooled, direct liquid cooling (DLC), and rear-door heat exchanger setups.
  • Specify meet-me room (MMR) and cross-connect requirements; detail carrier-neutral telecom diversity strategies.
  • Engage with colocation providers and data center operators to validate...

About ePlus Inc.

About ePlus Inc.:ePlus Inc. is a leading technology company that specializes in delivering innovative IT solutions and services to enable organizations to achieve their digital transformation goals. Our mission is to empower businesses through advanced technologies and exceptional service.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.