The Linux Systems Administrator position maintains, develops, and provides support for the high-performance computing (HPC) systems. This position participates in the need identification, design and implementation of new and expanded HPC technologies and projects. A significant portion of the support activities are focused on computational biology software in partnership with scientific staff.
Duties and Responsibilities
- Build, customize and implement software solutions to facilitate routine operations of an information technology group, including administration automation, performance monitoring, digital security support and user experience improvements
- Identify, document, plan and execute software solutions for the challenges of the institute and alignment of best practices
- Troubleshoot performance and resource related software problems
- Contribute to the design and development of new services to build on our technology offerings to the institute and supporting scientific computation
- Participate in the resolution of production issues and lead efforts toward solutions
- Administer Linux OS servers and desktops
- Set up and manage a resource scheduler on the shared computational servers
- Support the implementation of major architectural changes
- Maintain local and web application servers to ensure uptime and responsiveness
- Proactive monitoring, troubleshooting, triage and resolution or escalation of issues
- Maintain documentation of systems and processes
Qualifications:
To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Education and Experience
- Bachelor’s degree in computer science or related field
- Minimum of three years of related work experience in computer science
- Proficiency in programming skills: Python/Perl or GO
- One year of Linux Certification (RHCA / RHCE) preferred
- Strong oral, written communication, and presentation skills
- Ability to work independently and as part of a team to meet project goals
- Willing to learn new skills and knowledge quickly in a fast-paced work environment
- HPC Resource management and scheduling systems (SLURM, TORQUE, Apache Spark)
- Three-years of experience with Linux systems administration and/or Linux-oriented Software Development in production environments
- One-year of experience with application containerization, such as Docker or Podman, deploying/managing production apps
- One-year of Ansible or similar configuration management system, experience preferred in a production environment
Skills and Abilities
- Excellent communication and interpersonal skills and experience facilitating teamwork in a multidisciplinary group of scientists
- High level of attention to detail
- Ability to resolve complex problems, evaluate options and implement solutions
- Ability to organize and coordinate work of colleagues
How to Apply:
For immediate consideration, please submit a cover letter and Curriculum vitae at: hr@pnri.org. NO PHONE CALLS PLEASE.