Lead Site Reliability Engineer
0000050007 Royal Bank of Canada • Toronto, Canada
Role Description
Job Description
What is the role? Lead Site Reliability EngineerAs a Lead SRE, you will play a critical role in ensuring the availability, reliability, scalability, and performance of key applications, balancing production support responsibilities with continuous improvement initiatives. The ideal candidate will have deep expertise in agile application development, operations, technology lifecycle management, infrastructure and automation to reduce toil, improve observability, resolve complex production incidents, address underlying root causes.What will you do?Perform application production support role including off-hours support.
Development of SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)
Run the production environment by monitoring availability and taking a holistic view of system health.
Build software and systems to man...