Senior Systems Engineer
Back Apply Share
501483 Work type:
Full Time Location:
Medical Center School/Department: Systems Biology
Grade: Grade 105
The ideal candidate will be effective in the engineering of systems and the managing engineering personnel. The candidate will be reporting to the Director of IT and will assist with technical direction and the research of new technologies. The candidate will require experience in system design and implementation requiring a high level of technical expertise in a variety of technical disciplines such as, high-performance computing, enterprise storage systems, and data center management. He or she will also be responsible for advising desktop personnel on security, storage, and system domain management issues, to help ensure high-quality desk support services for a community of approximately 100 scientific researchers.
The ability to adjust to rapidly changing technical requirements is essential, as is willingness to assume high level responsibilities and take initiative. Ability to multi-task, solve complex problems and work well with various external groups is crucial. Strong interpersonal and communications skills are required. The physical ability to install rack mounted servers is needed. Ability to handle off-hours, on-site support calls. Must possess the following technical skills:
Work closely with the IT Director to research and architect new infrastructure for capital equipment purchases. Propose designs for review and assist the IT Director with proposed design technical details for grant applications.
Carry out technical projects assigned by the IT Director to completion. Projects may include, HPC cluster installation and configuration, HPC CentOS upgrades, working with the storage vendor to install NAS storage version updates and trouble shoot storage hardware alerts. Provide written project status updates to the IT Director and discuss blocking points.
Oversee the technical design, maintenance and support of a ~6500 core high performance compute (HPC) cluster. This responsibility includes HPC networking to ensure inter-node connectivity and user connectivity, respond to ISO system security scan reports and provide remediation to all vulnerabilities reported, research application maintenance (e.g, MatLab, R), Sun Grid Engine management to monitor HPC cluster jobs and coordinate version updates, the use of programming languages (Python, Perl, C/C++) to automate tasks and provide HPC user support.
Provide escalated technical guidance to junior engineering and desktop personnel.
Responsible for the design structure and maintenance of an Isilon & Qumulo 5 PB enterprise network attached storage system (NAS).
The position entails some user and vendor interaction as well as reporting, instruction, system alarm investigation and documentation. The incumbent will operate as a member of the Systems Management team and must be a team player.
Some off-hours coverage will be required.
Additional related responsibilities as assigned by the Director of IT.
Bachelor’s degree or equivalent in education and experience, plus at least four years of related experience. Demonstrated knowledge of system design and implementation related to Active Directory, distributed computing, enterprise storage, system virtualization and networking. Experience in high-performance computing and big data environments. Excellent written and verbal communications skills.
Master’s degree preferred.
Expert knowledge of Active Directory domains and forests, LDAP & Kerberos in heterogeneous environments.
Expert knowledge of Linux (Ubuntu, CentOS), common UNIX services, Shell scripting, LINUX C, Perl and Python.
Expert knowledge of TCP/IP networking, network security, and DNS (BIND, Windows).
Expert knowledge of SAN and NAS services (iSCSI, NFS, CIFS).
Expert knowledge of MS Windows server to provide infrastructure support to desktop personnel and virtual server support to external customers.
Experience with data center management including power & cooling systems, on-call datacenter support, data center CORE network design and support.
Expert knowledge of TCP /IP networking, network security and DNS.
Equal Opportunity Employer / Disability / Veteran
Columbia University is committed to the hiring of qualified local residents.