The AI Platform team at Datadog builds the infrastructure that powers the next generation of generative AI features across our products. As…
Senior Software Engineer - Linux Kernel / GPU Monitoring
Skills & Technologies
Job Description
The eBPF Platform team owns the shared eBPF infrastructure inside the Datadog Agent, and is responsible for its reliability, performance, and evolution across a wide variety of Linux distributions and kernel versions. We build tooling and agent functionality for product teams utilizing eBPF (Network Performance Monitoring, Universal Service Monitoring, Cloud Workload Security, GPU Monitoring), enable new teams exploring eBPF, and centralize deep kernel expertise across the organization. The team contributes to open source projects such as btfhub and cilium/ebpf.
Datadog is investing heavily in GPU Monitoring to give customers deep visibility into GPU utilization, health, and performance across their infrastructure. The eBPF Platform team builds the agent-side foundation that makes this possible, from eBPF programs that capture GPU activity at the kernel level, to the metrics pipelines and validation infrastructure that ensure data quality at scale.
In this role, you will work at the intersection of eBPF, the Linux kernel, and GPU infrastructure. You'll contribute to GPU Monitoring capabilities within the Datadog Agent while also working across the broader eBPF platform, investigating production incidents, improving reliability, and helping shape the architecture of one of the most widely deployed eBPF solutions in the industry.
At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.
What You’ll Do
Contribute to GPU Monitoring feature development end-to-end, from ideation to implementation within the Datadog Agent
Build and maintain shared eBPF functionality for product teams to use in their eBPF-based products
Investigate and debug complex production issues that span the kernel, eBPF programs, and agent runtime
Research, prototype, develop, and document solutions to ha
Company & Role Analysis
JobSeeker+Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.
£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)
Similar roles
See moreThe AI Platform owns Datadog’s entire AI stack—everything from distributed training infrastructure (for our SOTA models) to the frameworks t…
Mission The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-t…
Mission The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify's text-t…
Mission Speechify is the easiest way to listen to the world's information. Articles on the web, documents in the cloud, books on your phone.…
Mission Speechify is the easiest way to listen to the world's information. Articles on the web, documents in the cloud, books on your phone.…