At Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew. We seek to ignite th…
Senior Site Reliability Engineer - Observability (x/f/m)
Skills & Technologies
Job Description
Your Impact
We are looking for a Senior Site Reliability Engineer to join the Core Reliability & Observability team in Platform Engineering.
Your mission will be to shape Doctolib's observability strategy and ensure our platform remains reliable, debuggable, and scalable at a European scale. You will work in a feature team developing logging, metrics, tracing, and alerting capabilities, contributing directly to supporting 400,000 health professionals and 80 million patients in their daily healthcare journey.
Working in the tech team at Doctolib means building innovative products and features to improve the daily lives of care teams and patients.
What you'll do
Your responsibilities include but are not limited to
Lead the observability strategy across the platform, with an emphasis on building scalable, developer-friendly logging and tracing capabilities
Identify and lead large-scale cross-cutting reliability initiatives, including improvements to our incident detection, response, and postmortem analysis capabilities
Take part in the on-call rotation, and actively contribute to improving our on-call experience by refining alerting, reducing noise, and ensuring actionable telemetry
Who you are
Before you read on: if you don't have the exact profile described below, but you feel this job description matches your skill set, we still encourage you to apply.
You'll be a great fit if you
Have a solid hands-on experience (3y+) on a large-scale production platform
Have proven experience with cloud platforms such as AWS, Azure or Google Cloud
Have solid understanding of containerization and orchestration technologies (Docker and Kubernetes)
Have a strong understanding of Helm for managing Kubernetes manifests and ArgoCD for GitOps workflows
Have deep expertise in observability tooling and architecture, such as
Logging: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
Tracing: OpenTelemetry or proprietary APMs
Metrics: Prometheus, Thanos,
Company & Role Analysis
JobSeeker+Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.
£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)
Similar roles
See moreSenior Site Reliability Engineer / Bristol, Remote / £75,000 to £95,000 Per Annum D.O.E. TwinStream was formed to bring together their colle…
Senior Site Reliability Engineer / Bristol, Remote / £75,000 to £95,000 Per Annum D.O.E. TwinStream was formed to bring together their colle…
Senior Site Reliability Engineer / Bristol, Remote / 75,000 to 95,000 Per Annum D.O.E. TwinStream was formed to bring together their collect…
Role: Senior Site Reliability Engineer (SRE) – Kubernetes / OpenShift Location: Remote -UK (possible paid occasional travel to TIG Secure si…
Role: Senior Site Reliability Engineer (SRE) – Kubernetes / OpenShift Location: Remote -UK (possible paid occasional travel to TIG Secure si…