KeyStep

Senior Software Engineer, AI Platform - Evaluation & Annotation

Datadog
Paris, France
6 days ago
full-time

Skills & Technologies

CI/CDDatadogGenerative AIStrategyTrainingAssessmentAI

Job Description

The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products.

As a Senior Software Engineer on the Evaluation and Annotation team, you will design and evolve the systems that define and measure AI quality at scale. This includes building evaluation pipelines, model performance monitoring, and annotation workflows that assess correctness, safety, bias, and reliability across production use cases.

Your work will directly shape how Datadog ships and maintains trustworthy AI capabilities. You will partner closely with product, ML, and infrastructure teams to define quality standards, integrate evaluation systems with our observability platform, and build human-in-the-loop feedback mechanisms that continuously improve model behavior.

At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.

What You’ll Do

Design and scale robust evaluation systems to measure the performance and reliability of LLMs and AI agents across Datadog’s product ecosystem

Lead efforts to build human-in-the-loop and automated annotation pipelines for model assessment, ensuring high-quality training and feedback data

Define and implement continuous evaluation workflows in CI/CD and production environments to monitor model behavior in real time

Analyze model outputs for correctness, bias, safety, and reliability and translate insights into actionable improvements

Collaborate cross-functionally with Applied Scientists, Researchers, product managers, and platform engineers to establish best practices for responsible AI

Mentor team members and contribute to long-term technical strategy focused on AI quality, trust, and safety

Who You Are

You have 6+ years of experience building large-scale distributed sy

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
Braze
São Paulo
Full-time
about 13 hours ago

At Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew. We seek to ignite th…

View Job
MongoDB
Dublin, Ireland
Full-time
about 14 hours ago

The MongoDB Customer Observability Team is a diverse group of contributors working together to help our users manage MongoDB at global scale…

View Job
GoCardless
Lisbon, Portugal
Full-time
about 16 hours ago

About us At GoCardless we believe bank payments are the best way to pay and get paid. We also believe that bank account data is a powerful…

View Job
Apaleo
Munich, Germany
Full-time
Remote
about 8 hours ago

Join Apaleo and Shape the Future of Hospitality Tech! Apaleo is the world's most open, API-first property management platform powering the n…

View Job
smartmicro
Braunschweig
Full-time
Remote
about 17 hours ago

Introduction We are smartmicro, the leading specialist in high-performance automotive and traffic radar- and radar/camera hybrid sensor tech…

View Job
Apply NowApply with CV Improver