Job Title:
Lead High Performance Computing Architect

Company: Icahn School of Medicine at Mount Sinai

Location: new york city, NY

Created: 2024-05-09

Job Type: Full Time

Job Description:

Strength Through DiversityGround breaking science. Advancing medicine. Healing made personal.Roles & Responsibilities: The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MDPhD-level support for researchers. The group is composed of a high-performance computing team, the research clinical data warehouse team and a research data services team.The Lead HPC Architect, High Performance Computational and Data Ecosystem, is responsible for architecting, designing, and leading the technical operations for Scientific Computing's computational and data science ecosystem. This ecosystem includes high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. To meet Sinai's scientific and clinical goals, the Lead brings a strategic, tactical and customer-focused vision to evolve Sinai's computational and data-rich environment to be continually more resilient, scalable and productive for basic and translational biomedical research. The development and execution of the vision includes a deep technical understanding of the best practices for computational, data and software development systems along with a strong focus on customer service for researchers. The Lead is an expert troubleshooter and productive team member. The incumbent is a productive partner for researchers and technologists throughout the organization and beyond. This position reports to the Director for Computational & Data Ecosystem in Scientific Computing.Lead the technical operations including the architect, design, expansion, monitoring, support, and maintenance for Scientific Computing's computational and data science ecosystem consistent with best practices. Key components include a 50,000+ core and 30+ petabyte usable high-performance computing cluster, clinical data warehouse and software development environment.Lead the troubleshooting, isolation and resolution of all technical issuesLead the design, development, implementation and management of all system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.Ensures that the design and operation of the HPC ecosystem is productive for research.Collaborates effectively with research and hospital system IT, compliance, HIPAA, security and other departments to ensure compliance with all regulations and Sinai policies.Partners with other peers regionally, nationally and internationally to discover, propose and deploy a world-class research infrastructure for Mount Sinai.Prepares and manages budgets for hardware, software and maintenance. Participates in chargebackfee recovery analysis and provides suggestions to make operations sustainable.Lead the integration of HPC resources with laboratory equipment such as genomic sequencers, etc.Researches, deploys and optimizes resource management and scheduling software and policies and actively monitoring.Designs, tunes, manages and upgrades parallel file systems, storage and data-oriented resources.Researches, deploys and manages security infrastructure, including development of policies and procedures.Lead and assist the team to resolve user support requests from researchers.Assists in developing and writing system design for research proposals.Lead the development of a framework for effective system documentation.Works effectively and productively with other team members within the group and across Mount Sinai.Provide after-hours support in case of a critical system issue.QualificationsBachelor's degree in computer science, engineering or another scientific field. Master's or PhD preferred.8 years of progressive HPC system administration and operations (preferably in a RedhatCentOS Linux administration, Batch HPC cluster environment)Must be an expert troubleshooter; Must be a team player and customer focusedStrong experience with configuration management systems such as xCAT, Puppet andor AnsibleStrong experience with networking and securityStrong experience with Infiniband and Gigabit EthernetExperience with LSF and GPFS Spectrum Scale parallel file systems and storageExperience with providing technical operations leadershipAbility to manage a variety of disparate tasks and priorities independently and troubleshoot complex technology problems.Attention to detail; time and project management skills.Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams.Strong written, oral, and interpersonal communication skillsScript and programming experiencePreferred ExperienceExperience with archival storage and tape libraries (TSM) is highly preferred.Experience with databases and web services is highly pliance, HIPAA, GDPR, FISMAExperience with managing web access to HPC resources (such as Open OnDemand)Experience in a research environment is highly preferred.Experience with financial budgets and providing cost benefit analysis is preferred.Cloud TechnologyStrength Through DiversityThe Mount Sinai Health System believes that diversity, equity, and inclusion are key drivers for excellence. We share a common devotion to delivering exceptional patient care. When you join us, you become a part of Mount Sinai's unrivaled record of achievement, education, and advancement as we revolutionize medicine together. We invite you to participate actively as a part of the Mount Sinai Health System team by:Using a lens of equity in all aspects of patient care delivery, education, and research to promote policies and practices to allow opportunities for all to thrive and reach their potential.Serving as a role model confronting racist, sexist, or other inappropriate actions by speaking up, challenging exclusionary organizational practices, and standing side-by-side in support of colleagues who experience spiring and fostering an environment of anti-racist behaviors among and between departments and co-workers.We work hard to acquire and retain the best people and to create an inclusive, welcoming and nurturing work environment where all feel they are valued, belong and are able to professional advance. We share the belief that all employees, regardless of job title or expertise contribute to the patient experience and quality of patient care.Explore more about this opportunity and how you can help us write a new chapter in our history!About the Mount Sinai Health System:Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 43,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time "” discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients' medical and emotional needs at the center of all treatment. EOE MinoritiesWomenDisabledVeteransCompensationThe Mount Sinai Health System (MSHS) provides a salary range to comply with the New York City Law on Salary Transparency in Job Advertisements. The salary range for the role is $103000 - $202155 Annually. Actual salaries depend on a variety of factors, including experience, education, and hospital need. The salary range or contractual rate listed does not include bonusesincentive, differential pay or other forms of compensation or benefits.