Skip to main content

User account menu

  • Log in
DBS-Logo

Database Group Leipzig

within the department of computer science

ScaDS-Logo Logo of the University of Leipzig

Main navigation

  • Home
  • Study
    • Exams
      • Hinweise zu Klausuren
    • Courses
      • Current
    • Modules
    • LOTS-Training
    • Abschlussarbeiten
    • Masterstudiengang Data Science
    • Oberseminare
    • Problemseminare
    • Top-Studierende
  • Research
    • Projects
      • Benchmark datasets for entity resolution
      • FAMER
      • HyGraph
      • Privacy-Preserving Record Linkage
      • GRADOOP
    • Publications
    • Prototypes
    • Annual reports
    • Cooperations
    • Graduations
    • Colloquia
    • Conferences
  • Team
    • Erhard Rahm
    • Member
    • Former employees
    • Associated members
    • Gallery

ScaDS Research on Scalable Privacy-preserving Record Linkage

Breadcrumb

  • Home
  • Research
  • Publications
  • ScaDS Research on Scalable Privacy-preserving Record Linkage

Franke, M. ; Gladbach, M. ; Sehili, Z. ; Rohde, F. ; Rahm, E.

ScaDS Research on Scalable Privacy-preserving Record Linkage

Datenbank-Spektrum

2019 / 02

Paper

Futher information: https://doi.org/10.1007/s13222-019-00305-y

Abstract

Privacy-preserving record linkage (PPRL) supports the matching and integration of person-related data, e.g., o n patients or customers without compromising privacy. It is based on the encoding of sensitive attribute values needed for matching and often involves trusted parties for linkage. We report on recent research results from the Big Data center ScaDS Dresden/Leipzig to improve the efficien cy, scalability and quality of PPRL, and to apply PPRL in the medical domain. In particular, we present the use of pivot-based filtering techniques and LSH (locality-sensitive hashing)-based blocking to reduce the number of comparisons. Furthermore, we report on parallel linkage implementations based on Apache Flink supporting scalability to millions of records.

Recent publications

  • 2025 / 8: Slice it up: Unmasking User Identities in Smartwatch Health Data
  • 2025 / 6: SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage
  • 2025 / 5: Federated Learning With Individualized Privacy Through Client Sampling
  • 2025 / 3: Assessing the Impact of Image Dataset Features on Privacy-Preserving Machine Learning
  • 2025 / 3: Automated Configuration of Schema Matching Tools: A Reinforcement Learning Approach

Footer menu

  • Directions
  • Contact
  • Impressum