German English

Dr. Lars Kolb

Contact

  • E-Mail:

Research

Publications

Google Scholar profile
PDF

Google Scholar
Sehili, Z.; Kolb, L.; Borgs, C.; Schnell, R.; Rahm, E.
Privacy Preserving Record Linkage with PPJoin
Proc. of 16. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW), 2015
2015-03
PDF
further information
Google Scholar
Kolb, L.
Effiziente MapReduce-Parallelisierung von Entity Resolution-Workflows
Dissertation, Universität Leipzig
2014-09
PDF
further information
Google Scholar
Kolb, L.; Sehili, Z.; Rahm, E.
Iterative Computation of Connected Graph Components with MapReduce
Datenbank-Spektrum 14 (2), 2014
2014-07
PDF

Google Scholar
Hartung, M.; Kolb, L.; Groß, A.; Rahm, E.
Optimizing Similarity Computations for Ontology Matching - Experiences from GOMMA
Proc. 9th Intl. Conference on Data Integration in the Life Sciences (DILS), 2013
2013-07
PDF

Google Scholar
Kolb, L.; Thor, A.; Rahm, E.
Don't Match Twice: Redundancy-free Similarity Computation with MapReduce
Proc. 2nd Intl. Workshop on Data Analytics in the Cloud (DanaC), 2013
2013-06
PDF

Google Scholar
Ngonga Ngomo A.-C.; Kolb, L.; Heino, N.; Hartung, M.; Auer, S.; Rahm, E.
When to Reach for the Cloud: Using Parallel Hardware for Link Discovery
Proc. 10th Intl. Extended Semantic Web Conference (ESWC), 2013
2013-05
PDF
further information
Google Scholar
Kolb, L.; Rahm, E.
Parallel Entity Resolution with Dedoop
Datenbank-Spektrum 13 (1), 2013
2013-02-23
PDF

Google Scholar
Kolb, L.; Thor, A.; Rahm, E.
Dedoop: Efficient Deduplication with Hadoop
Proc. 38th Intl. Conference on Very Large Databases (VLDB) / Proc. of the VLDB Endowment 5(12), 2012
2012-08
PDF

Google Scholar
Kolb, L.; Thor, A.; Rahm, E.
Load Balancing for MapReduce-based Entity Resolution
Proc. 28th Intl. Conference on Data Engineering (ICDE), 2012
2012-04
PDF
further information
Google Scholar
Kolb, L.; Thor, A.; Rahm, E.
Multi-pass Sorted Neighborhood Blocking with MapReduce
Computer Science - Research and Development 27(1), 2012
2012-02
PDF

Google Scholar
publication iconKolb, L.; Köpcke, H.; Thor, A.; Rahm, E.
Learning-based Entity Resolution with MapReduce
Proc. 3rd Intl. Workshop on Cloud Data Management (CloudDB), 2011
2011-10
PDF

Google Scholar
Kolb, L; Thor, A.; Rahm, E.
Block-based Load Balancing for Entity Resolution with MapReduce
Proc. 20th Intl. Conference on Information and Knowledge Management (CIKM), 2011
2011-10
PDF

Google Scholar
Kolb, L.; Thor, A.; Rahm, E.
Parallel Sorted Neighborhood Blocking with MapReduce
Proc. 14th GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW), 2011
2011-03
PDF

Google Scholar
Kirsten, T.; Kolb, L.; Hartung, M.; Groß, A.; Köpcke, H.; Rahm, E.
Data Partitioning for Parallel Entity Matching
Proc. 8th Intl. Workshop on Quality in Databases (QDB), 2010
2010-09

Selected Talks

Reviewer

Teaching

TermExercisePractical courseSeminarLecture
Fall 2009/10 Database Systems 1 Cloud Data Management Parallel Database Systems (guest lecture)
Spring 2010 Database Systems 2 Database
Data Warehouse
Fall 2010/11 Database Systems 1 Web Data Integration
Spring 2011 Database Systems 2 Database Cloud Data Management (guest lecture)
Fall 2011/12 Database Systems 1 Data Warehouse NoSQL Databases
Spring 2012 Database Systems 2 Database
Fall 2012/13 Database Systems 1 Data Warehouse Large-scale Data Analytics
Spring 2013 Database Systems 2 Database
Fall 2013/14 Database Systems 1 MapReduce New Trends in Big Data Cloud Data Management
Spring 2014 Database Systems 2 Database Systems 2
NoSQL Databases
NoSQL Databases

Bachelor’s & Master’s Thesis Supervision

YearStudentTypeTitle
2012 Axel Fischer B. Sc. Implementierung eines File Managers für das Hadoop Distributed Filesystem und Realisierung einer MapReduce Workflow Submission-Komponente
2013 Sergej Sintschilin B. Sc. Wiederverwendung berechneter Matchergebnisse für MapReduce-basiertes Object Matching
2013 Ziad Sehili M. Sc. Evaluierung und Erweiterung von MapReduce-Algorithmen zur Berechnung der transitiven Hülle ungerichteter Graphen für Entity Resolution Workflows
2013 Dan Häberlein B. Sc. Migration und Extraktion von Datensätzen mittels spaltenorientierter Datenbanken am Beispiel von Apache HBase
2014 Hans-Henning Koch M. Sc. Evaluation of Backends for the Use in a Horizontally Scalable Version of ipoque’s Net Reporter