Role Description
We’re looking for a Site Reliability Engineer (SRE) to join our Logging & Monitoring squad. You’ll ensure the reliability, scalability, and security of our observability platforms, while building automated solutions that deliver seamless experiences for our customers.
What you’ll do
Take end-to-end ownership of designing, deploying, operating, and continuously improving the performance and fault-tolerance of large-scale, multi-cloud solutions.Ensure system security, data integrity, and high availability across all logging and monitoring platforms.Develop and enhance monitoring, logging, and alerting frameworks to enable proactive issue detection and swift resolution.Collaborate with IT and engineering teams to onboard services onto our platforms and consult on best SRE practices.Stay ahead of technology trends and evaluate new solutions that strengthen our observability ecosystem.Crea...