KeyStep

Senior Software Engineer, AI Platform - Evaluation & Annotation

Datadog
Paris, France
about 6 hours ago
full-time

Skills & Technologies

CI/CDDatadogGenerative AIStrategyTrainingAssessmentAI

Job Description

The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products.

As a Senior Software Engineer on the Evaluation and Annotation team, you will design and evolve the systems that define and measure AI quality at scale. This includes building evaluation pipelines, model performance monitoring, and annotation workflows that assess correctness, safety, bias, and reliability across production use cases.

Your work will directly shape how Datadog ships and maintains trustworthy AI capabilities. You will partner closely with product, ML, and infrastructure teams to define quality standards, integrate evaluation systems with our observability platform, and build human-in-the-loop feedback mechanisms that continuously improve model behavior.

At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.

What You’ll Do

Design and scale robust evaluation systems to measure the performance and reliability of LLMs and AI agents across Datadog’s product ecosystem

Lead efforts to build human-in-the-loop and automated annotation pipelines for model assessment, ensuring high-quality training and feedback data

Define and implement continuous evaluation workflows in CI/CD and production environments to monitor model behavior in real time

Analyze model outputs for correctness, bias, safety, and reliability and translate insights into actionable improvements

Collaborate cross-functionally with Applied Scientists, Researchers, product managers, and platform engineers to establish best practices for responsible AI

Mentor team members and contribute to long-term technical strategy focused on AI quality, trust, and safety

Who You Are

You have 6+ years of experience building large-scale distributed sy

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
Datadog
Bordeaux, France
Full-time
about 6 hours ago

The Data Science team designs and builds algorithmically driven features in the Datadog app. We work across a range of applications, primari…

View Job
Datadog
Madrid, Spain
Full-time
about 6 hours ago

At Datadog, we leverage AI across our observability platform to improve monitoring, speed up incident resolution, and ensure data reliabilit…

View Job
Datadog
Paris, France
Full-time
about 6 hours ago

The AI Platform owns Datadog’s entire AI stack—everything from distributed training infrastructure (for our SOTA models) to the frameworks t…

View Job
Datadog
Madrid, Spain
Full-time
about 12 hours ago

At Datadog, we leverage AI across our observability platform to improve monitoring, speed up incident resolution, and ensure data reliabilit…

View Job
Apply NowApply with CV Improver