KeyStep

Senior AI Data Engineer (x/f/m)

Doctolib
Paris, France
about 1 month ago
contract

Skills & Technologies

Data EngineeringScalabilityNoSQLGCPGoogle CloudCloudMachine LearningBigQueryLLMRAGEmbeddingsSAFeDeploymentTrainingAIData QualityTransformation

Job Description

What you’ll do

At Doctolib, we're on a mission to transform healthcare through the power of AI. As a Senior Data Engineer, you'll play a key role in building and optimizing the data foundations within the AI Team to deliver safe, scalable, and impactful models.

You will join a dedicated team working on data infrastructure for LLM, VLM and RAG-based systems, powering our new AI Medical Companion.

Your work will ensure that our engineers and data scientists can train, evaluate, and deploy AI models efficiently on high-quality, well-structured, and compliant data.

Your responsibilities include but are not limited to

Ensure high standards of data quality for AI model inputs.

Design, build, and maintain scalable data pipelines on Google Cloud Platform (GCP) for AI and machine learning use cases.

Implement data ingestion and transformation frameworks that power Retrieval systems and training datasets for LLMs and multimodal models.

Architect and manage NoSQL and Vector Databases to store and retrieve embeddings, documents, and model inputs efficiently.

Collaborate with ML and platform teams to define data schemas, partitioning strategies, and governance rules that ensure privacy, scalability, and reliability.

Integrate unstructured and structured data sources (text, speech,image, documents, metadata) into unified data models ready for AI consumption.

Optimize performance and cost of data pipelines using GCP native services (BigQuery, Dataflow, Pub/Sub, Cloud Storage, Vertex AI).

Contribute to data quality and lineage frameworks, ensuring AI models are trained on validated, auditable, and compliant datasets.

Continuously evaluate and improve our data stack to accelerate AI experimentation and deployment.

Who you are

You could be our next teammate if you have

Master’s or Ph.D. degree in Computer Science, Data Engineering, or a related field.

5+ years of experience in Data Engineering, ideally supporting AI or ML workloads.

Strong experience with the GCP

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
Datadog
Paris, France
Full-time
about 5 hours ago

We’re a new team building AI-assisted tools to make Datadog developers more effective, by autonomously generating tests, fixing bugs, and im…

View Job
Datadog
Paris, France
Full-time
about 11 hours ago

We’re a new team building AI-assisted tools to make Datadog developers more effective, by autonomously generating tests, fixing bugs, and im…

View Job
Recare Deutschland GmbH
Berlin, Germany
Full-time
Remote
about 10 hours ago

About RecareAs one of the leading German HealthTech companies, we are reshaping discharge management – technology-driven, patient-centered,…

View Job
PENTADOC AG
Würzburg
Full-time
Remote
about 11 hours ago

At 5Plus, we build production-grade AI systems that solve real operational problems for our customers. As a Senior AI Engineer, you will tak…

View Job
Apply NowApply with CV Improver