At Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew. We seek to ignite th…
Senior Software Engineer, AI Platform - Evaluation & Annotation
Skills & Technologies
Job Description
The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products.
As a Senior Software Engineer on the Evaluation and Annotation team, you will design and evolve the systems that define and measure AI quality at scale. This includes building evaluation pipelines, model performance monitoring, and annotation workflows that assess correctness, safety, bias, and reliability across production use cases.
Your work will directly shape how Datadog ships and maintains trustworthy AI capabilities. You will partner closely with product, ML, and infrastructure teams to define quality standards, integrate evaluation systems with our observability platform, and build human-in-the-loop feedback mechanisms that continuously improve model behavior.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do
Design and scale robust evaluation systems to measure the performance and reliability of LLMs and AI agents across Datadog’s product ecosystem
Lead efforts to build human-in-the-loop and automated annotation pipelines for model assessment, ensuring high-quality training and feedback data
Define and implement continuous evaluation workflows in CI/CD and production environments to monitor model behavior in real time
Analyze model outputs for correctness, bias, safety, and reliability and translate insights into actionable improvements
Collaborate cross-functionally with Applied Scientists, Researchers, product managers, and platform engineers to establish best practices for responsible AI
Mentor team members and contribute to long-term technical strategy focused on AI quality, trust, and safety
Who You Are
You have 6+ years of experience building large-scale distributed sy
Company & Role Analysis
JobSeeker+Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.
£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)
Similar roles
See moreThe MongoDB Customer Observability Team is a diverse group of contributors working together to help our users manage MongoDB at global scale…
About us At GoCardless we believe bank payments are the best way to pay and get paid. We also believe that bank account data is a powerful…
About Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks tha…
Join Apaleo and Shape the Future of Hospitality Tech! Apaleo is the world's most open, API-first property management platform powering the n…
Senior Software Architect (m/f/d)
Introduction We are smartmicro, the leading specialist in high-performance automotive and traffic radar- and radar/camera hybrid sensor tech…