Senior Site Reliability Engineer
Job Summary
Responsible for ensuring the reliability and performance of our staff facing Enterprise Platform used to build, test, deploy, observe, and operate Enterprise Applications, Databases, Web Services, Batch Jobs, and MATLAB Server Products. Partner with Business Applications teams to design and build highly scalable and reliable web infrastructure. Partner with Business. Applications teams to orchestrate and automate web application deployments. Build automation and process to enable self-serve production deployments. Code, configure, integrate, manage, and enhance various automation tools. Partner with the observability engineers to build out platform observability Platform Runtime Management. Participate in rotating on on-call support to ensure high availability of our mission critical web applications. Participate in application and infrastructure release and deployment activities.
May telecommute up to 40%.
Responsibilities
Qualifications
Required Qualifications:
Employer will accept a Master’s degree in Computer Science, Engineering [any], Mathematical Sciences, or a related field and two [2] years of experience in a Senior Site Reliability Engineer- related occupation.
Alternatively, employer will accept a PhD degree in Computer Science, Engineering [any], Mathematical Sciences, or related field and no experience or a Bachelor’s degree in Computer Science, Engineering [any], Mathematical Sciences, or related field and [5] five years of experience in a Senior Site Reliability Engineer- related occupation.
Position requires demonstratable experience with the following:
- Designing application infrastructure and automation operations;
- Knowledge of domain and industry tools (CNCF, DevOps, CI/CD, Secrets Management, or Container Registries);
- Deploying highly available applications in on-premises environments or public clouds;
- Experience with scripting languages (Java, Python, or Go) for automation;
- Working with Java Application Server administration and tuning (Glassfish or Jetty);
- Linux experience (Ubuntu, RHEL, or Amazon Linux) with a focus on Web (Apache or Nginx);
- Infrastructure as code tools (CloudFormation, Puppet, Chef, or Ansible);
- Working with build, test, and deployment tools (Maven, Gradle, or Jenkins);
- Working with Source Control Management Systems (GitLab, GitHub, or Perforce); and
- Working with container and container orchestration (Docker or Kubernetes).
Experience may be gained during graduate program. Will accept any suitable combination of education, training, and/or experience. Multiple positions available.