Data Engineer – Legal Data Integration
Noxtua · Berlin
Stellenbeschreibung
About the role
As a Data Engineer you will join Noxtua’s Data Expansion Squad, responsible for turning heterogeneous legal data from multiple jurisdictions into a unified, high‑quality foundation. You will work closely with AI, engineering and legal experts to build reliable, scalable pipelines that feed downstream search and AI‑supported workflows.
Key responsibilities
- Design, build and optimise end‑to‑end ETL pipelines for legal data, covering cleaning, transformation, chunking, validation, embedding and ingestion into vector databases.
- Parse, validate, normalise and transform XML‑based legal feeds into internal schemas and unified document formats.
- Develop and maintain data models and storage schemas that support continuously updated, large‑scale datasets.
- Coordinate data handover and integration from internal and external providers, including APIs and web‑scraping pipelines.
- Implement and continuously refine metadata enrichment strategies to maximise searchability.
Required profile
- At least 2 years of professional experience in data engineering with successfully deployed projects.
- Strong Python programming skills and experience designing robust data pipelines.
- Solid understanding of data modelling, quality, filtering, validation and consistency.
- Familiarity with containerisation (Docker), CI/CD pipelines and version control (Git).
- Good grasp of data structures, algorithms, system design principles and software‑engineering best practices.
- English proficiency at C2 level.
Required skills
- Python
- ETL pipeline development
- Data modelling
- XML parsing and transformation
- Docker
- CI/CD
- Git
- Graph databases
- Vector databases
- NLP model deployment (bonus)
Questions fréquentes
Warum melden Sie diesen Job?
In 30 Sekunden bewerben
Geben Sie Ihre E‑Mail ein, um sich zu bewerben. Ein Konto wird automatisch erstellt.
Durch das Fortfahren akzeptieren Sie unsere Nutzungsbedingungen.
Sie haben bereits ein Konto? Anmelden
Veröffentlicht vor 18 Stunden
Läuft ab in 1 Monat
4 Ansichten · 0 Bewerbungen
Steigern Sie Ihre Chancen
Laden Sie Ihren Lebenslauf hoch – wir vermitteln Sie an passende Stellen.
Ihr Lebenslauf wird analysiert...
Noxtua
Berlin
Related job offers
-
Backend Engineer (Remote)
YO IT Consulting Berlin -
Junior IT Systemadministrator (m/w/x)
YOC Berlin -
Quality Assurance Operator (m/w/d)
Bundesdruckerei-Gruppe Berlin -
PhD position in Data Science for Life, Earth & Energy
HDS-LEE | Helmholtz School for Data Science in Life, Earth and Energy Juliers -
Working Student – IT (On‑site Support)
Coriolis Pharma Munich et périphérie