KeyStep

Senior Site Reliability Engineer - Observability (x/f/m)

Doctolib
Berlin, Germany
24 days ago
contract

Skills & Technologies

Site ReliabilityGitOpsSOLIDElasticsearchAWSAzureGoogle CloudDockerKubernetesHelmArgoCDPrometheusDatadogOpenTelemetryCloudStrategy

Job Description

Doctolib’s Engineering environment is rich and we are building innovative products and features aiming each day to ease doctors' and patient life. We are looking for a Senior Site Reliability Engineer to keep Doctolib production systems running smoothly. You will also be a key-player to support the exponential growth of Doctolib services.

What you will do

As a Senior Site Reliability Engineer within the Core Reliability & Observability team, you will play a pivotal role in shaping the company’s observability strategy and ensuring our platform remains reliable, debuggable, and scalable. This role sits at the intersection of infrastructure, developer experience, and product engineering, with a particular focus on building and evolving the foundations of logging, metrics, tracing, and alerting across the organization.

Lead the observability strategy across the platform, with an emphasis on building scalable, developer-friendly logging and tracing capabilities.

Identify and lead large-scale cross-cutting reliability initiatives, including improvements to our incident detection, response, and postmortem analysis capabilities.

Take part in the on-call rotation, and actively contribute to improving our on-call experience by refining alerting, reducing noise, and ensuring actionable telemetry.

Who you are

You could be our next team mate if you

Have a solid hands-on experience (3y+) on a large-scale production platform

Have proven experience with cloud platforms such as AWS, Azure or Google Cloud

Have solid understanding of containerization and orchestration technologies (Docker and Kubernetes)

Have a strong understanding of Helm for managing Kubernetes manifests and ArgoCD for GitOps workflows

Deep expertise in observability tooling and architecture, such as

Logging: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector

Tracing: OpenTelemetry or proprietary APMs

Metrics: Prometheus, Thanos, Datadog, or equivalent

Have proficiency in at least o

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
People First Ltd
London, UK
£54,000 – £76,800
Full-time
4 days ago

Salary: £54,000 - 76,800 per year Requirements: Bachelors degree in Computer Science, Engineering, or equivalent practical experience. Recen…

View Job
Cavendish Professionals
Bristol, UK
£55,676 – £55,676
Full-time
6 days ago

We are recruiting for a Senior Engineer to join a respected UK civil engineering contractor, working on an project in Bristol . This is an e…

View Job
Realm
London, UK
£69,287 – £69,287
Full-time
6 days ago

High-growth infrastructure company focused on delivering large-scale compute, data centre capacity, and power solutions for advanced machine…

View Job
Realm
London, UK
£68,610 – £68,610
Full-time
6 days ago

High-growth infrastructure company focused on delivering large-scale compute, data centre capacity, and power solutions for advanced machine…

View Job
Apply NowApply with CV Improver