Full Job Description
Overview: HPC Administrator would be primarily responsible for end-to-end management, administration, upgrade, and enhancements of HPC Systems and System Integrations. This position will also be responsible for installation and support of applications on HPC used by Engineering & Technology teams.
Essential Responsibilities:
Responsible for Administration, Upgrade, Enhancements, and monitoring of HPC SystemsResponsible for supporting, maintaining, benchmarking & optimization HPC applications and servicesProvide end user support on technical issues, queries, data management, etc.Troubleshoot HPC applications operational problems quickly and effectively to meet business needsInstallation of new software and configuration of software packages and patch updates to meet user requirements on HPC parallel computing environmentMaintenance and creation of HPC user accounts and permissionsProvide Level 2 & 3 Technical support to a wide range of HPC & CAE Software ApplicationsEnsure Incidents and problem tickets issued against supported CIs get resolved as per OLAs/SLAsRegular HPC System health check including analysis of performanceTroubleshoot InfiniBand, Ethernet including Cables, Controllers, Drivers, IP address clashes, reassignment etc.Maintenance of Lustre storage and backup policiesEnsure adherence to IT security policies and proceduresAutomate HPC systems administration tasks using different configuration management toolsIdentify opportunities for improvement and implement the solutions by proactively learning new toolsEngage with customer to understand their business use case and the issues that impact them mostly
Eligibility Requirements:
Bachelor’s Degree in Mechanical Engineering or Information Technology3-5 years of relevant experience with HPC Clusters, Systems integrations, and application portingExperience with HPC modeling/simulation codes and optimizationExperience with HPC Schedulers and Resource Managers such as PBS, LSF etc.Must have strong Linux experienceExperience in Linux kernel modules, preferably for NVIDIA GPUsStrong scripting skills in shell and/or python
Strong knowledge on ITIL framework and follow Incident/Change/Problem management practices Desired Characteristics:
Ability to drive results in a dynamic and challenging environmentExcellent analytical and problem-solving skillsStrong interpersonal and Influencing skillsExcellent verbal & written communication skillsHighly self-motivated, disciplined and good team player
Job Information
Job Opening ID
SA-3057-JOB
Industry
IT Services
City
Bangalore North
State/Province
Maharashtra
Country
India
Zip/Postal Code
411001
Job Description: Undertake state of the art research projects into digital security topics of interest in support of the business....
Apply For This JobFull Job Description The ideal candidate will be responsible for recruitment efforts, new hire orientation and onboarding, employee termination, payroll...
Apply For This JobFull Job Description Date: 10-Aug-2022 Location: Mohali Company: Mahindra & Mahindra Limited Responsibilities & Key Deliverables Conduct design reviews basis...
Apply For This JobJob Description In a world of disruption and increasingly complex business challenges, our professionals bring truth into focus with the...
Apply For This JobRequisition Title – Subject Matter Expert Job Number: – IND033377 Description An innovative, flexible and collaborative work environment Join a...
Apply For This JobJob Description: In this role, utilize deep understanding of package solutions and recommend when appropriate. You will be providing key...
Apply For This Job
“`
Search qualified candidates by skills, location, experience, education, and more.
“`