KeyStep

Site Reliability Engineer - Observability

N26
Berlin, Germany
about 4 hours ago
full-time

Skills & Technologies

PythonGolangPlatform EngineeringSite ReliabilityMicroservicesPrometheusGrafanaOpenTelemetryCloudComplianceMakeAutomationResilience

Job Description

About the opportunity

We are seeking a Site Reliability Engineer to join the Observability group inside our Platform Engineering domain.

Platform Engineering’s goal is to provide easy to use, self-service platforms to enable other segments to easily build, deploy and monitor their business applications. And Observability’s role in that part of the company is to provide our users with end-to-end observability that’s easy to use.

As one of the first banks completely hosted in the cloud, our security, resilience and productivity standards require a motivated and well balanced team. We are using a modern technology stack to match our principles when it comes to providing the framework for our development team, the company and our customers.

In this role, you will

In Observability you’ll build the tools for monitoring and measuring infrastructure, microservices, and sometimes totally unique workloads. You’ll put the developer experience at the front of your mind and implementations, and you’ll contribute deeply to understanding and preventing incidents through tooling, automation, and people-centered processes. All the while you’ll be thinking like an engineer. That means using automation to speed up repetitive tasks and to contribute to the security and compliance of the tech stack that you run and support.

What you need to be successful

You need to be well-versed in the basics building blocks of observability: metrics, logs, and traces. You should also be familiar with the tools we need to extract (think things like Prometheus, StatsD, OpenTelemetry libraries), transform (tools like Vector, Beats, FluentBit), and load (OpenSearch, Grafana, etc.) this data in ways that your fellow engineers can make sense of it. Then, you should be comfortable with helping your colleagues understand this data so they can measure and monitor their applications and recover quickly for incidents..

Next, you’ll need experience with at least one glue language like GoLang or Python.

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
PostHog
Remote
Full-time
Remote
1 day ago

ABOUT POSTHOG We're shipping every product that companies need https://posthog.com/handbook/why-does-posthog-exist to run their business fr…

View Job
NSD
London, UK
£60,000 – £60,000
Full-time
2 days ago

Salary: £60,000 - 60,000 per year Requirements: Active SC Clearance and eligibility for DV Clearance Experience in an Azure focused Site Rel…

View Job
People First Ltd
London, UK
£54,000 – £76,800
Full-time
2 days ago

Salary: £54,000 - 76,800 per year Requirements: Bachelors degree in Computer Science, Engineering, or equivalent practical experience. Recen…

View Job
Adecco
London, UK
£55,000 – £70,000
Full-time
2 days ago

Salary: £55,000 - 70,000 per year Requirements: Proven leadership experience in Site Reliability Engineering or senior platform engineering…

View Job
CBSbutler Holdings Limited trading as CBSbutler
London, UK
£60,000 – £63,000
Full-time
2 days ago

Salary: £60,000 - 63,000 per year Requirements: Strong background in software engineering for large-scale distributed systems Proficiency in…

View Job
Apply NowApply with CV Improver