About Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks tha…
Senior Software Engineer, AI Platform - Evaluation & Annotation
Skills & Technologies
Job Description
The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products.
As a Senior Software Engineer on the Evaluation and Annotation team, you will design and evolve the systems that define and measure AI quality at scale. This includes building evaluation pipelines, model performance monitoring, and annotation workflows that assess correctness, safety, bias, and reliability across production use cases.
Your work will directly shape how Datadog ships and maintains trustworthy AI capabilities. You will partner closely with product, ML, and infrastructure teams to define quality standards, integrate evaluation systems with our observability platform, and build human-in-the-loop feedback mechanisms that continuously improve model behavior.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do
Design and scale robust evaluation systems to measure the performance and reliability of LLMs and AI agents across Datadog’s product ecosystem
Lead efforts to build human-in-the-loop and automated annotation pipelines for model assessment, ensuring high-quality training and feedback data
Define and implement continuous evaluation workflows in CI/CD and production environments to monitor model behavior in real time
Analyze model outputs for correctness, bias, safety, and reliability and translate insights into actionable improvements
Collaborate cross-functionally with Applied Scientists, Researchers, product managers, and platform engineers to establish best practices for responsible AI
Mentor team members and contribute to long-term technical strategy focused on AI quality, trust, and safety
Who You Are
You have 6+ years of experience building large-scale distributed sy
Company & Role Analysis
JobSeeker+Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.
£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)
Similar roles
See moreAbout Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks tha…
We’re a new team building AI-assisted tools to make Datadog developers more effective, by autonomously generating tests, fixing bugs, and im…
We’re a new team building AI-assisted tools to make Datadog developers more effective, by autonomously generating tests, fixing bugs, and im…
IBV ist spezialisiert auf High-End Software-Dienstleistungen im technischen Markt. Wir bringen eine breite Expertise in den Bereichen Embedd…
Senior Software Engineer (Backend) London - Hybrid (3 days in office) Up to £85k + benefits What if the systems you built were used every…