Job Title:
HPC Systems Administrator with (LFS & Slurm)

Company: CBTS

Location: Cincinnati, OH

Created: 2024-05-05

Job Type: Full Time

Job Description:

This is NOT A LINUX Admin Role Candidates MUST LIVE within a 70 Mile radius of Downtown Cincinnati, OH 454202Direct Hire/Fulltime Position - Green Card / US Citizen -Salary $110,000 +Exceptional Benefits & 95% Work From HomeIs this your next job Read the full description below to find out, and do not hesitate to make an application.CBTS is searching for a HPC Systems Administrator with LFS and Slurm Workload management experience. The candidate will be responsible for the administration of HPC scheduler (LSF today moving to Slurm this fall) and resource management systems. This includes experience in building custom software modules in an HPC setting, requiring knowledge of programming language compilers (e.g., gcc) and the typical software build process on Linux systems. The role also involves performing advanced HPC job troubleshooting with end users and providing Linux system build and operations support for RHEL/OEL/CentOS based distributions.Preference is given to candidates with experience in Puppet/Satellite or related configuration management tools and those proficient in configuring LDAP, DNS, networking, storage, services, and logging. This position requires working closely with developers, data scientists, and platform users to assess needs and develop innovative and/or custom technology solutions. An ideal candidate must be willing to understand customer processes and work collaboratively to produce solutions, including system design, job scheduling, and data management.The senior role is expected to lead medium-sized projects with minimal supervision. Responsibilities include coordination with other teams, developing proof-of-concepts, scheduling downtimes, training team members, and documentation.Requirements:Bachelor's degree in a related field or equivalent combination of education and experienceMinimum 2 years High Performance Computing (HPC) cluster administration experienceMust have 2 Years of experience working with at least IBM Spectrum LSF (In Use Today) or Slurm (Moving to this Fall 2024) workload managers - Job Schedulers5-7 years of work experience in a related job disciplineRHEL-based system administrationTCP/IP and networking fundamentals (DNS, DHCP)Scripting/Programming (bash, Python)Docker/Container knowledgeKubernetes (Rancher) exposureBasic C/C++/Java knowledgeUnique Skills:AI or Machine Learning knowledgeExperience with NVIDIA GPUs and related toolsetsCincinnati Bell Technology Solutions provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a protected veteran in accordance with applicable federal, state and local laws.