Red Hat
Associate Site Reliability Engineer
This job is now closed
Job Description
- Req#: 100337
- Develop and maintain software to automatically provision, upgrade, monitor, and heal Red Hat Ansible Automation Platform managed applications in Azure.
- Write Ansible automation playbooks to reduce toil.
- Support the operations of Red Hat Ansible Automation Platform by responding to and troubleshooting system alerts.
- Work with the team to perform root cause analysis on outages.
- Provide engineering support to Red Hat's global technical support team to resolve customer issues.
- Participate in a global on-call rotation which could involve the occasional weekend or holiday.
- Passion for learning new technologies, building elegant software systems, troubleshooting complex technical issues, and automation.
- Software development experience. (Using Python or GoLang would be a plus.)
- Experience with Linux. (Linux administration experience would be a plus.)
- Experience with containerization technology (Kubernetes would be a plus.)
- Basic understanding of computer networking including DNS.
- Basic knowledge of software development life cycle tools, like GitHub and Jenkins.
- Basic software development life cycle (SDLC) and agile or scrum processes
- Basic understanding of monitoring systems.
- Good written and verbal communication skills in English.
- Writing Ansible playbooks and administering the Red Hat Ansible Automation Platform
- Familiarity with data center networking and routing protocols are a plus.
- Cloud native development/administration experience (AWS, GCP, Azure, etc)
- Experience working remotely
About the job
The Red Hat Ansible Engineering (https://www.ansible.com/) team is looking for a highly motivated individual with a self-starter mentality to join our Managed Ansible on Cloud team as a Site Reliability Engineer. In this role, you will be working on a team of highly talented software engineers whose mission it is to grow the Red Hat cloud offering for the Ansible Automation Platform.
Job Summary
Using your expertise in SRE principles of automation and continuous improvement, you will help create an environment where availability, reliability, and security are incorporated through the entire application lifecycle, not treated as an afterthought. As an SRE, you will build tooling to automate the building, testing, deployment, promotion, monitoring, alerting, and maintenance of the Red Hat Ansible Managed Application on Azure as well as other offerings (https://www.redhat.com/en/technologies/management/ansible/azure).
You will get an opportunity to collaborate with diverse agile teams around the world to deliver value for our customers and partners in an open source way. This is also a great opportunity to hone your skills while working with a wide range of modern languages, frameworks, and technologies. As an Associate Site Reliability Engineer, you will have the opportunity to grow your skills in a variety of technologies and techniques while bringing solutions to market. You will contribute to a cloud-ready mentality for the Ansible organization. You will also become a part of Red Hat’s culture, which makes us unique in the industry. You will work with communities (f.e. https://www.ansible.com/community) passionate about open source software.
What you will do
What you will bring
Experience with the following is considered a plus:
About the company
Red Hat is the world’s leading provider of enterprise open source solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies.