KeyStep

Senior Data Engineer Python/GCP (x/f/m)

Doctolib
Paris, France
21 days ago
contract

Skills & Technologies

PythonScalabilityNoSQLGCPGoogle CloudCloudMachine LearningBigQueryLLMRAGEmbeddingsSAFeDeploymentTrainingAIData QualityTransformation

Job Description

Your Impact

We are looking for a Senior Data Engineer to join the AI Team working on our AI Medical Companion.

Your mission will be to build and optimize the data foundations that power safe, scalable, and impactful AI models. You will work on data infrastructure for LLM, VLM, and RAG-based systems, ensuring our engineers and data scientists can train, evaluate, and deploy AI models efficiently on high-quality, well-structured, and compliant data. Your work will directly support health professionals in delivering better care while improving their work-life balance, ultimately impacting 80 million patients and 400,000 healthcare professionals across Europe.

Working in the tech team at Doctolib means building innovative products and features to improve the daily lives of care teams and patients.

What you'll do

Your responsibilities include but are not limited to

Design, build, and maintain scalable data pipelines on Google Cloud Platform (GCP) for AI and machine learning use cases

Implement data ingestion and transformation frameworks that power Retrieval systems and training datasets for LLMs and multimodal models

Architect and manage NoSQL and Vector Databases to store and retrieve embeddings, documents, and model inputs efficiently

Collaborate with ML and platform teams to define data schemas, partitioning strategies, and governance rules that ensure privacy, scalability, and reliability

Integrate unstructured and structured data sources (text, speech, image, documents, metadata) into unified data models ready for AI consumption

Optimize performance and cost of data pipelines using GCP native services (BigQuery, Dataflow, Pub/Sub, Cloud Storage, Vertex AI)

Contribute to data quality and lineage frameworks, ensuring AI models are trained on validated, auditable, and compliant datasets

Continuously evaluate and improve our data stack to accelerate AI experimentation and deployment

Who you are

Before you read on: if you don't have the exact profile des

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
Adyen
Amsterdam, Netherlands
Full-time
about 13 hours ago

This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft…

View Job
Typeform
Germany (Remote) ; Ireland (Remote); Netherlands (Remote) ; Spain (Remote) ; United Kingdom (Remote)
Full-time
Remote
1 day ago

Who we are Typeform is a refreshingly different form builder. We help over 150,000 businesses collect the data they need with forms, survey…

View Job
Onapsis
Biederbach Baden-Wurttemberg, Baden-Württemberg, Germany
Full-time
Remote
1 day ago

About the job The world's most critical--and at risk--business applications have been neglected for far too long. Onapsis eliminates this…

View Job
Skalar
Munich, Germany
Full-time
Remote
1 day ago

Build our data foundation from v1 -> v10. Join the early core team of experienced entrepreneurs (scaled to millions of users, $100m+ exits).…

View Job
Skalar
Munich, Germany
Full-time
Remote
1 day ago

Build our core AI capabilities from v1 -> v10. Join the early core team of experienced entrepreneurs (scaled to millions of users, $100m+ ex…

View Job
Apply NowApply with CV Improver