Search Jobs
HPC System Engineer
15800 Northup Way Bellevue, WA 98008 US
Job Description
Position Description: Protingent Staffing has an exciting FTE HPC System Engineer opportunity.
Job Responsibilities:
- Maintain Linux HPC Supercomputer systems availability to the customer, including in Azure Gov and on-prem infrastructure.
- Administer and maintain Linux based system software and firmware revisions, including patches, updates, and OS upgrades.
- Solve Linux system hardware, software, and third-party software issues, and provide detailed and thoughtful analysis of problem and resolution.
- Automate configuration management of infrastructure and applications, software updates, and maintenance and monitoring of system availability using modern DevOps tools (Ansible, GitHub, etc.)
- Installation, configuration, tuning, troubleshooting, and administration of commercial off-the-shelf (COTS), Open Source, and in-house developed applications leveraging HPC resources.
- Packaging, deployment, and management of software leveraging environment modules.
- Coordinate HPC infrastructure solutions and plan for growth.
- Actively connect with management regarding any problems with the equipment and propose resolution.
- Partner with IT Principal Engineering to define and execute roadmaps. Assist with gathering data for new feature, system, and/or advanced computing requirements from key stakeholders. Provide timely estimates for implementation delivery. Anticipate risks
- when planning and defining mitigation options.
- Respond to user queries regarding computing resources.
Job Qualifications:
- BA, BS, or MS in CS, EE, CE or equivalent experience.
- 5+ years of previous experience deploying and administrating production HPC clusters.
- Experience with managing an HPC resource scheduler (Slurm preferred).
- Proven track record to script in Bash or Python.
- Experience with MPI software and high-speed interconnects in HPC supercomputers.
- Experience with containers for HPC (Docker, Singularity, Apptainer).
- Deep understanding of operating systems, computer networks, and high-performance applications.
- Ability to work well with developers & test engineers.
- Proficiency in programming language such as Python, Fortran, C++, or R with the ability to learn from others as required.
- Proficient in using the Linux operating systems.
- Ability to multi-task and work cooperatively with others.
- The successful candidate will possess a high degree of trust and integrity, communicate openly and effectively and display respect with a desire to foster teamwork.
Job Details:
- Job Type; Direct Hire
- Salary Range: $113,605-$170,408
- Location: Bellevue, WA.
- Export control regulations require candidates to be a U.S. Citizen, U.S. Legal Permanent Resident, or of a protected person status
About Protingent: Protingent is a niche provider of top Engineering and IT talent to Software, Electronics, Medical Device, Telecom, and Aerospace companies nationwide. Protingent exists to make a positive impact and contribution to the lives of others as well as our community by providing relevant, rewarding, and exciting work opportunities for our candidates.
Meet Your Recruiter
Share This Job:
Related Jobs:
About Bellevue, WA
Are you sure you want to apply for this job?
Please take a moment to verify your personal information and resume are up-to-date before you apply.