NatWest Group

Site Reliability Engineer


PayCompetitive
LocationLondon/England
Employment typeFull-Time

This job is now closed

  • Job Description

      Req#: R-00227429

      Join us as a Site Reliability Engineer

      • Join our Chief Digital Information Office in Commercial & Institutional, where we harness technology innovation, business agility and one-bank collaboration to push the boundaries of what’s possible for our customers
      • The wellbeing and growth of our people is fundamental to our shared success, which is why we’re passionate about cultivating an environment that fosters inclusion and champions potential
      • Our journey will be challenging and complex, but truly transformative – so if you’re ready to stretch your capability, gain unique experience and shape the future banking experience for generations to come, this is your opportunity

      What you'll do

      You’re joining a team that’s passionate about the customer vision and delivering a seamless onboarding experience for our customers. Aligned to our ‘Start and Manage my banking relationship’ customer goals, you’ll be working with your colleagues to deliver the technology strategic roadmap, while ensuring the right balance between our business goals and building future technology, at the right cost.

      As a Site Reliability Engineer (SRE), you’ll play a crucial role in designing, building, and maintaining our infrastructure and applications to ensure high availability, reliability, and performance. You’ll collaborate with cross-functional teams to drive improvements and enhance the overall efficiency of our systems. You’ll also participate in on-call rotations to respond to incidents and ensure system reliability.

      In addition to this, you’ll:

      • Collaborate with development teams to influence and improve application architecture for better scalability, reliability, and performance
      • Develop and implement monitoring, alerting, and automation solutions to proactively identify and address potential issues
      • Conduct performance analysis and capacity planning to ensure optimal resource utilisation
      • Implement and advocate for best practices related to reliability, availability, and performance
      • Conduct post-mortem analysis of incidents and implement preventive measures

      The skills you'll need

      We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes. Exposure to monitoring tools Splunk and API Monitoring would be advantageous.

      We’re also looking for:

      • Proven experience in a Site Reliability Engineering role or a similar capacity
      • Strong programming and scripting skills using Python, Go, Shell, Springboot, ReactJS, Redux, react Hooks, Javascript, MS, nodeJS, express JS, HTML, CSS and SASS
      • Expertise in designing and implementing monitoring and alerting solutions using Grafana or ELK stack
      • Experience in Continuous Integration/Continuous deployment (CI/CD) tools Gitlab, Ansible, DWS or Consul
      • Proficiency in infrastructure as code using Terraform or Ansible

      Hours

      35

      Job Posting Closing Date:

      13/02/2024

  • About the company

      NatWest Group plc, is a majority state-owned British banking and insurance holding company, based in Edinburgh, Scotland.