KeyStep

Senior Data Engineer Python/GCP (x/f/m)

Doctolib
Paris, France
3 days ago
contract

Skills & Technologies

PythonScalabilityNoSQLGCPGoogle CloudCloudMachine LearningBigQueryLLMRAGEmbeddingsSAFeDeploymentTrainingAIData QualityTransformation

Job Description

Your Impact

We are looking for a Senior Data Engineer to join the AI Team working on our AI Medical Companion.

Your mission will be to build and optimize the data foundations that power safe, scalable, and impactful AI models. You will work on data infrastructure for LLM, VLM, and RAG-based systems, ensuring our engineers and data scientists can train, evaluate, and deploy AI models efficiently on high-quality, well-structured, and compliant data. Your work will directly support health professionals in delivering better care while improving their work-life balance, ultimately impacting 80 million patients and 400,000 healthcare professionals across Europe.

Working in the tech team at Doctolib means building innovative products and features to improve the daily lives of care teams and patients.

What you'll do

Your responsibilities include but are not limited to

Design, build, and maintain scalable data pipelines on Google Cloud Platform (GCP) for AI and machine learning use cases

Implement data ingestion and transformation frameworks that power Retrieval systems and training datasets for LLMs and multimodal models

Architect and manage NoSQL and Vector Databases to store and retrieve embeddings, documents, and model inputs efficiently

Collaborate with ML and platform teams to define data schemas, partitioning strategies, and governance rules that ensure privacy, scalability, and reliability

Integrate unstructured and structured data sources (text, speech, image, documents, metadata) into unified data models ready for AI consumption

Optimize performance and cost of data pipelines using GCP native services (BigQuery, Dataflow, Pub/Sub, Cloud Storage, Vertex AI)

Contribute to data quality and lineage frameworks, ensuring AI models are trained on validated, auditable, and compliant datasets

Continuously evaluate and improve our data stack to accelerate AI experimentation and deployment

Who you are

Before you read on: if you don't have the exact profile des

Company & Role Analysis

JobSeeker+
Likely perks
Private MedicalPension25+ Days HolidayStock OptionsLearning BudgetFlexible Hours
Culture & working style

Neutral 2–4 sentence summary of what working at this company is like, drawn from public reviews and press coverage. Tone, collaboration style, pace, benefits highlights.

Market salary range

£45,000 – £60,000 (Glassdoor, Levels.fyi, 2025)

Unlock the full analysis for this job
Sign in to unlock →

Similar roles

See more
Asana
Warsaw, Poland
Full-time
about 20 hours ago

As part of the Enterprise Data & Intelligence (EDI) team, you will play a key role in enabling company-wide data-informed decision making. B…

View Job
Intercom
London, UK
Full-time
about 21 hours ago

Fin is the AI Customer Agent company on a mission to help businesses provide perfect customer experiences. Our AI Agent Fin is the highest-…

View Job
Intercom
London, UK
Full-time
about 21 hours ago

Fin is the AI Customer Agent company on a mission to help businesses provide perfect customer experiences. Our AI Agent Fin is the highest-…

View Job
HelloFresh
Warszawa, Masovian Voivodeship, Poland
Full-time
2 days ago

Work with HelloFresh in Warsaw and its HelloTech organisation, HelloFresh’s global technology backbone with more than 1000 people, building…

View Job
Apply NowApply with CV Improver