The University of Texas at Dallas logo

The University of Texas at Dallas

Systems Engineer II

🇺🇸 Hybrid - Richardson, TX

🕑 Full-Time

💰 $70K

💻 Information Technology

🗓️ August 4th, 2025

Ansible Docker Python

Edtech.com's Summary

University of Texas at Dallas is hiring a Systems Engineer II. This role involves supporting and managing the BigTex and UTD HPC clusters, installing and maintaining software for researcher needs, ensuring software security through lifecycle processes, and leveraging automation to manage HPC environments running on Linux.

Highlights
  • Support and manage HPC clusters including BigTex and UTD clusters.
  • Install, configure, and maintain scientific and research software packages.
  • Implement software lifecycle processes to maintain security and patching.
  • Utilize automation tools to streamline HPC system administration.
  • Provide level 2 user support and manage support tickets for faculty and students.
  • Required skills include knowledge of Linux OS, HPC systems (OpenHPC, SLURM), container technologies (Docker, Apptainer), and network interconnects (Ethernet, Infiniband).
  • Preferred qualifications include Master's degree in Computer Science, experience with HPC deployment, troubleshooting, and dev ops tools (GitHub, Ansible).
  • Salary starts at $70,000 depending on qualifications and internal equity.
  • Responsibilities include scripting, packaging software into RPMs, and supporting high performance parallel file storage systems.
  • Must be able to participate in on-call rotations and comply with security requirements for university critical infrastructure.

Systems Engineer II Full Description

Systems Engineer II
Posting Number  | S06531P
Position Title  | Systems Engineer II
Functional Title  | System Engineer II
Department  | Information Technology-Cyber Infrastructure & Research
Salary Range  | Starting at $70,000 dependent on qualifications and internal equity
Pay Basis  | Monthly
Position Status  | Regular full-time
Location  | Richardson
Position End Date (if temporary)  |
Posting Open Date  | 08/01/2025
Posting Close Date  |
Open Until Filled  | Yes
Desired Start Date  | 09/01/2025

Job Summary
Reporting to the Director of HPC Operations. This is a mid-level Linux and HPC systems engineer with a background in a High Performance Computing environment. To collaborate with and support our customers, this engineer must have demonstrated a consultative customer service attitudes in prior roles in similar organizations. Primary responsibilities include: Support and manage the BigTex HPC cluster and UTD HPC clusters, install and manage the vast list of software required to support researcher needs. Keep software secure by implementing a software lifecycle process to keep software patched and supported. Leverage automation to simplify management and administration of our HPC environment. The applicant must have broad industry knowledge of hardware and software services involved in building and operating HPC environments using Linux operating system.

Minimum Education and Experience  | Bachelor’s degree and two (2) years of related experience or an equivalent combination of relevant education and experience may be considered.

Preferred Education and Experience  | Master’s degree in Computer Science or equivalent with two years of experience in corresponding research services, support efforts, products and technologies. Current knowledge of HPC best practice and systems deployment and maintenance. Troubleshooting methodology and awareness of industry standards. Excellent interpersonal, written, and verbal communication skills are a must. Good technical documentation, architecture diagramming, and organizational skills. Experienced in supporting on-premises and code storage platforms, intermitted abilities supporting and administrating operating systems (Multiple Linux Versions) and ability to apply security policies to platforms and integrates new hardware into our HPC framework. Experience in supporting and operating 1Gbps – 100Gbps Ethernet and 56Gbps – 200 Gbps Infiniband HPC network interconnects. Ability to manage support tickets and prioritize considering varied scope, scale, and technical requirements. Familiarity with data center operations fundamentals in networking and power. Familiarity with containers (docker, podman, apptainer). Familiarity with Open OnDemand. Familiarity with Singularity/Apptainer containers for HPC. Familiarity with Lmod or environment modules. Familiarity with Apptainer/Singularity HPCFamiliarity with SLRUM, Warewulf, and OHPC 2.0/3.0 Familiarity with Open MPI

Other Qualifications  | To the extent this position requires the holder to research, work on, or have access to critical infrastructure as defined in Section 117.001(2) of the Texas Business and Commerce Code, the ability to maintain the security or integrity of the critical infrastructure is a minimum qualification to be hired and to continue to be employed in the position.

Essential Duties and Responsibilities  | Be the primary software support engineer for HPC clusters (including BigTex) with support from core HPC team for complex scenarios Respond to user tickets from faculty and students. Level 2 support experience at scale of 1 to 3 with 3 being a senior specialist. Act as a role model in demonstrating integrity and ethical behavior in working with confidential and university information. Use high performance cluster systems such as OpenHPC and SLURM job scheduler to support HPC Operations Assist in development and implementation of internal policies, rules, and operation procedures for Research Computing and Cyber infrastructure to guarantee various assurance models such as NIST 800-53 and NIST 800-171 under which assured research is conducted. Perform annual updates, intermediate level software coding (prefer Python, Linux Shell, etc.) in at least two or more languages. Perform installation, configuration, updating, networking, performance monitoring and troubleshooting of software on HPC Systems on Linux platform Troubleshoot, modify, catalog, document, and update scripts. Package scientific software into RPMs and integrate with Lmod—so users can `module load <software>` Compile, test and install many related open-source scientific software packages as requested by research faculty, staff, and students.

Physical Demands and Working Conditions  | Sitting for extended periods of time. Dexterity of hands and fingers to operate a computer keyboard, mouse, power tools, and to handle other computer components. Lifting and transporting of moderately heavy objects, such as servers, switches, computers, and peripherals.

Physical Activities  |
Working Conditions  |
Additional Information  | KNOWLEDGE, SKILLS & ABILITY: Familiarity with at least one high performance cluster operating systems such as OpenHPC, ROCKS, Bright/Nvidia Cluster Manager Experience with all related dev ops tools such as GitHub, GitLab, Ansible, package management tools for rpm and or deb package building. Familiarity with large scale high performance parallel file storage systems such as WEKA, VAST, GPFS, BeeGFS, CEPH. Experience with installing and supporting: Open source and commercial research related software, Python, R, Matlab, Mathworks, Julia, Ansys, Intel, nVidia cuda and GCC compilers. Experience with dev ops tools such as GitHub, GitLab, Ansible, package management tools for rpm and or deb package building. Experience with SLURM job scheduler. 

ADDITIONAL INFORMATION:
Candidates will be subject to a criminal background check Must have adequate transportation to be able to drive to other work locations as necessary On-call availability for quickly responding to and resolving system emergencies, both during regular and emergency off-hours. Emergency on-call rotation availability for 24×7×365 coverage. Visa sponsorship is not available.

Remote Work Eligibility Statement

Hybrid Remote Work Available for Texas Residents with further discussion and agreement with their supervisor.

Special Instructions Summary  |
Important Message  | 1) All employees serve as a representative of the University and are expected to display respect, civility, professional courtesy, consideration of others and discretion in all interactions with members of the UT Dallas community and the general public.

2) The University of Texas at Dallas is committed to providing an educational, living, and working environment that is welcoming, respectful, and inclusive of all members of the university community. UT Dallas does not discriminate on the basis of race, color, religion, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, national origin, disability, genetic information, or veteran status in its services, programs, activities, employment, and education, including in admission and enrollment. The University is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities. To request reasonable accommodation in the employment application and interview process, contact the ADA Coordinator. For inquiries regarding nondiscrimination policies, contact the Title IX Coordinator.

Supplemental Questions
Required fields are indicated with an asterisk (*).
  1.  What is your experience level with High Performance Computational resources & services? 
    • No Response
    • Beginner 0-2 years
    • Intermediate 3-5 years
    • Advanced 5+ years
  2.  What is your experience with process documentation? 
    • Beginner (0-2 years)
    • Intermediate (2-5 years)
    • Advanced (5+ years)
  3.  Describe your experience working with Research Pl's on computational cyberinfrastructure needs. 
    (Open Ended Question)
Applicant Documents
Required Documents
  1. Resume
  2. Cover Letter/Letter of Application 
Optional Documents
  1. Veteran Employment Preference - Form DD-214
Human Resources, 
 800 West Campbell Road, AD3.418 
 Richardson, 
 TX 75080-3021