RPE - System Reliability Engineering Specialist (Hybrid)
860 Morgan Stanley Svcs Canada Co • montreal, Canada
Role Description
Systems Reliability Engineering (SRE) is a production‑oriented discipline focused on improving system service availability, observability, scalability, performance and reliability for technology products across Morgan Stanley. This role is part of the Reliability & Production Engineering (RPE) organization within the Technology division.
Responsibilities
- Work closely with engineering and development teams to design, build and maintain systems; help decide on product choices, schema design and query tuning.
- Troubleshoot issues across the entire stack: hardware, software, application and network.
- Identify and drive opportunities to improve automation for our platforms; scope and create automation for deployment, management and visibility of services.
- Proactively identify and address systems reliability risks.
- Represent the RPE organization in design reviews and operational readiness exercises for new and existing services.