This job is now closed
Job Description
- Req#: R-00227429
- Join our Chief Digital Information Office in Commercial & Institutional, where we harness technology innovation, business agility and one-bank collaboration to push the boundaries of what’s possible for our customers
- The wellbeing and growth of our people is fundamental to our shared success, which is why we’re passionate about cultivating an environment that fosters inclusion and champions potential
- Our journey will be challenging and complex, but truly transformative – so if you’re ready to stretch your capability, gain unique experience and shape the future banking experience for generations to come, this is your opportunity
- Collaborate with development teams to influence and improve application architecture for better scalability, reliability, and performance
- Develop and implement monitoring, alerting, and automation solutions to proactively identify and address potential issues
- Conduct performance analysis and capacity planning to ensure optimal resource utilisation
- Implement and advocate for best practices related to reliability, availability, and performance
- Conduct post-mortem analysis of incidents and implement preventive measures
- Proven experience in a Site Reliability Engineering role or a similar capacity
- Strong programming and scripting skills using Python, Go, Shell, Springboot, ReactJS, Redux, react Hooks, Javascript, MS, nodeJS, express JS, HTML, CSS and SASS
- Expertise in designing and implementing monitoring and alerting solutions using Grafana or ELK stack
- Experience in Continuous Integration/Continuous deployment (CI/CD) tools Gitlab, Ansible, DWS or Consul
- Proficiency in infrastructure as code using Terraform or Ansible
Join us as a Site Reliability Engineer
What you'll do
You’re joining a team that’s passionate about the customer vision and delivering a seamless onboarding experience for our customers. Aligned to our ‘Start and Manage my banking relationship’ customer goals, you’ll be working with your colleagues to deliver the technology strategic roadmap, while ensuring the right balance between our business goals and building future technology, at the right cost.
As a Site Reliability Engineer (SRE), you’ll play a crucial role in designing, building, and maintaining our infrastructure and applications to ensure high availability, reliability, and performance. You’ll collaborate with cross-functional teams to drive improvements and enhance the overall efficiency of our systems. You’ll also participate in on-call rotations to respond to incidents and ensure system reliability.
In addition to this, you’ll:
The skills you'll need
We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes. Exposure to monitoring tools Splunk and API Monitoring would be advantageous.
We’re also looking for:
Hours
35Job Posting Closing Date:
13/02/2024About the company
NatWest Group plc, is a majority state-owned British banking and insurance holding company, based in Edinburgh, Scotland.