KeyStep

Senior Software Engineer, AI Platform - Evaluation & Annotation

Datadog
Paris, France
6 days ago
full-time

Skills & Technologies

CI/CDDatadogGenerative AIStrategyTrainingAssessmentAI

Job Description

The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products.

As a Senior Software Engineer on the Evaluation and Annotation team, you will design and evolve the systems that define and measure AI quality at scale. This includes building evaluation pipelines, model performance monitoring, and annotation workflows that assess correctness, safety, bias, and reliability across production use cases.

Your work will directly shape how Datadog ships and maintains trustworthy AI capabilities. You will partner closely with product, ML, and infrastructure teams to define quality standards, integrate evaluation systems with our observability platform, and build human-in-the-loop feedback mechanisms that continuously improve model behavior.

At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.

What You’ll Do

Design and scale robust evaluation systems to measure the performance and reliability of LLMs and AI agents across Datadog’s product ecosystem

Lead efforts to build human-in-the-loop and automated annotation pipelines for model assessment, ensuring high-quality training and feedback data

Define and implement continuous evaluation workflows in CI/CD and production environments to monitor model behavior in real time

Analyze model outputs for correctness, bias, safety, and reliability and translate insights into actionable improvements

Collaborate cross-functionally with Applied Scientists, Researchers, product managers, and platform engineers to establish best practices for responsible AI

Mentor team members and contribute to long-term technical strategy focused on AI quality, trust, and safety

Who You Are

You have 6+ years of experience building large-scale distributed sy

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
MongoDB
Dublin, Ireland
Full-time
about 4 hours ago

The MongoDB Query Execution Team is hiring software engineers who want to join us in developing a fast and modular distributed query system.…

View Job
SRT Marine Systems PLC
Bristol, UK
£45,000 – £70,000
Full-time
about 8 hours ago

SRT Marine Systems plc (SRT) is a market leader in the domain of international marine surveillance technology and systems. We are a respecte…

View Job
CleanTech Talent
Glasgow, UK
£68,854 – £68,854
Full-time
about 13 hours ago

Senior Spacecraft Software Engineer Glasgow, UK - Hybrid £85,000 - £100,000 Depending on experience & Qualifications We are proud to be supp…

View Job
Avento Immobilien Services GmbH
Munich, Germany
Full-time
Remote
about 17 hours ago

Avento is a property management company in Munich. We manage buildings on behalf of property owners. We're building our own software platfor…

View Job
Government Communications Headquarters
Manchester, UK
£50,354 – £60,036
Full-time
about 21 hours ago

Salary: £50,354 - 60,036 per year Requirements: Proven experience in modern software frameworks and languages such as Golang, Java, JavaScri…

View Job
Apply NowApply with CV Improver