Skip to main content

User account menu

  • Log in
DBS-Logo

Database Group Leipzig

within the department of computer science

ScaDS-Logo Logo of the University of Leipzig

Main navigation

  • Home
  • Study
    • Exams
      • Hinweise zu Klausuren
    • Courses
      • Current
    • Modules
    • LOTS-Training
    • Abschlussarbeiten
    • Masterstudiengang Data Science
    • Oberseminare
    • Problemseminare
    • Top-Studierende
  • Research
    • Projects
      • Benchmark datasets for entity resolution
      • FAMER
      • HyGraph
      • Privacy-Preserving Record Linkage
      • GRADOOP
    • Publications
    • Prototypes
    • Annual reports
    • Cooperations
    • Graduations
    • Colloquia
    • Conferences
  • Team
    • Erhard Rahm
    • Member
    • Former employees
    • Associated members
    • Gallery

Dr. Lars Kolb

Breadcrumb

  • Home
  • Team
  • Dr. Lars Kolb
  • Dr. Lars Kolb
Google Scholar profile

Selected Talks

  • Effiziente MapReduce-Parallelisierung von Entitity Resolution-Workflows,
    PhD Defense, Leipzig, December 2014
  • Don’t Match Twice: Redundancy-free Similarity Computation with MapReduce
    2nd International Workshop on Data Analytics in the Cloud (DanaC), New York, June 2013
  • Dedoop: Efficient Deduplication with Hadoop
    Data Integration Day, Leipzig, September 2012
  • Learning-based Entity Resolution with MapReduce
    3rd International Workshop on Cloud Data Management (CloudDB), Glasgow, October 2011
  • Parallel Sorted Neighborhood Blocking with MapReduce
    German Database Conference “GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web” (BTW), Kaiserslautern, March 2011
  • Data Partitioning for Parallel Entity Matching
    8th International Workshop on Quality in Databases (QDB), Singapore, September 2010

Reviewer

  • BTW 2011, BTW 2013
  • SIGMOD 2012
  • ICDE 2012
  • WebDB 2013
  • IJSWIS 2013
  • DMC 2013, DMC 2014

Teaching

TermExercisePractical courseSeminarLecture
Fall 2009/10Database Systems 1 Cloud Data ManagementParallel Database Systems (guest lecture)
Spring 2010Database Systems 2Database
Data Warehouse
  
Fall 2010/11Database Systems 1 Web Data Integration 
Spring 2011Database Systems 2Database Cloud Data Management (guest lecture)
Fall 2011/12Database Systems 1Data WarehouseNoSQL Databases 
Spring 2012Database Systems 2Database  
Fall 2012/13Database Systems 1Data WarehouseLarge-scale Data Analytics 
Spring 2013Database Systems 2Database  
Fall 2013/14Database Systems 1MapReduceNew Trends in Big DataCloud Data Management
Spring 2014Database Systems 2Database Systems 2
NoSQL Databases
 NoSQL Databases

Bachelor’s & Master’s Thesis Supervision

YearStudentTypeTitle
2012Axel FischerB. Sc.Implementierung eines File Managers für das Hadoop Distributed Filesystem und Realisierung einer MapReduce Workflow Submission-Komponente
2013Sergej SintschilinB. Sc.Wiederverwendung berechneter Matchergebnisse für MapReduce-basiertes Object Matching
2013Ziad SehiliM. Sc.Evaluierung und Erweiterung von MapReduce-Algorithmen zur Berechnung der transitiven Hülle ungerichteter Graphen für Entity Resolution Workflows
2013Dan HäberleinB. Sc.Migration und Extraktion von Datensätzen mittels spaltenorientierter Datenbanken am Beispiel von Apache HBase
2014Hans-Henning KochM. Sc.Evaluation of Backends for the Use in a Horizontally Scalable Version of ipoque’s Net Reporter

Publications (14)

Dateien Cover Beschreibung Jahr
Privacy Preserving Record Linkage with PPJoin
Sehili, Z. ; Kolb, L. ; Borgs, C. ; Schnell, R. ; Rahm, E.
Proc. of 16. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW), 2015
2015 / 3
Effiziente MapReduce-Parallelisierung von Entity Resolution-Workflows
Kolb, L.
Dissertation, Universität Leipzig
2014 / 9
Iterative Computation of Connected Graph Components with MapReduce
Kolb, L. ; Sehili, Z. ; Rahm, E.
Datenbank-Spektrum 14 (2), 2014
2014 / 7
Optimizing Similarity Computations for Ontology Matching - Experiences from GOMMA
Hartung, M. ; Kolb, L. ; Groß, A. ; Rahm, E.
Proc. 9th Intl. Conference on Data Integration in the Life Sciences (DILS), 2013
2013 / 7
Don't Match Twice: Redundancy-free Similarity Computation with MapReduce
Kolb, L. ; Thor, A. ; Rahm, E.
Proc. 2nd Intl. Workshop on Data Analytics in the Cloud (DanaC), 2013
2013 / 6
When to Reach for the Cloud: Using Parallel Hardware for Link Discovery
Kolb, L. ; Heino, N. ; Hartung, M. ; Auer, S. ; Rahm, E.
Proc. 10th Intl. Extended Semantic Web Conference (ESWC), 2013
2013 / 5
Parallel Entity Resolution with Dedoop
Kolb, L. ; Rahm, E.
Datenbank-Spektrum 13 (1), 2013
2013 / 2
Dedoop: Efficient Deduplication with Hadoop
Kolb, L. ; Thor, A. ; Rahm, E.
Proc. 38th Intl. Conference on Very Large Databases (VLDB) / Proc. of the VLDB Endowment 5(12), 2012
2012 / 8
Load Balancing for MapReduce-based Entity Resolution
Kolb, L. ; Thor, A. ; Rahm, E.
Proc. 28th Intl. Conference on Data Engineering (ICDE), 2012
2012 / 4
Multi-pass Sorted Neighborhood Blocking with MapReduce
Kolb, L. ; Thor, A. ; Rahm, E.
Computer Science - Research and Development 27(1), 2012
2012 / 2

Pagination

  • Current page 1
  • Page 2
  • Next page Next ›
  • Last page Last »

Recent publications

  • 2025 / 9: Generating Semantically Enriched Mobility Data from Travel Diaries
  • 2025 / 8: Slice it up: Unmasking User Identities in Smartwatch Health Data
  • 2025 / 7: MPGT: Multimodal Physics-Constrained Graph Transformer Learning for Hybrid Digital Twins
  • 2025 / 6: Leveraging foundation models and goal-dependent annotations for automated cell confluence assessment
  • 2025 / 6: SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage

Footer menu

  • Directions
  • Contact
  • Impressum