Skip to main content

User account menu

  • Log in
DBS-Logo

Database Group Leipzig

within the department of computer science

ScaDS-Logo Logo of the University of Leipzig

Main navigation

  • Home
  • Study
    • Exams
      • Hinweise zu Klausuren
    • Courses
      • Current
    • Modules
    • LOTS-Training
    • Abschlussarbeiten
    • Masterstudiengang Data Science
    • Oberseminare
    • Problemseminare
    • Top-Studierende
  • Research
    • Projects
      • Benchmark datasets for entity resolution
      • FAMER
      • HyGraph
      • Privacy-Preserving Record Linkage
      • GRADOOP
    • Publications
    • Prototypes
    • Annual reports
    • Cooperations
    • Graduations
    • Colloquia
    • Conferences
  • Team
    • Erhard Rahm
    • Member
    • Former employees
    • Associated members
    • Gallery

When to Reach for the Cloud: Using Parallel Hardware for Link Discovery

Breadcrumb

  • Home
  • When to Reach for the Cloud: Using Parallel Hardware for Link Discovery

Kolb, L. ; Heino, N. ; Hartung, M. ; Auer, S. ; Rahm, E.

When to Reach for the Cloud: Using Parallel Hardware for Link Discovery

Proc. 10th Intl. Extended Semantic Web Conference (ESWC), 2013

2013 / 05

Paper

Abstract

<p style="text-align:justify;">
With the ever-growing amount of RDF data available across the Web, the discovery of links between datasets and deduplication of resources within knowledge bases have become tasks of crucial importance. Over the last years, several link discovery approaches have been developed to tackle the runtime and complexity problems that are intrinsic to link discovery. Yet, so far, little attention has been paid to the management of hardware resources for the execution of link discovery tasks. This paper addresses this research gap by investigating the efficient use of hardware resources for link discovery. We implement the HR<sup>3</sup> approach for three different parallel processing paradigms including the use of GPUs and MapReduce platforms. We also perform a thorough performance comparison for these implementations. Our results show that certain tasks that appear to require cloud computing techniques can actually be accomplished using standard parallel hardware. Moreover, our evaluation provides break-even points that can serve as guidelines for deciding on when to use which hardware for link discovery.
</p>

<h2>ESWC 2013 Best Paper Award</h2>
<p style="text-align:justify;">
<a href="/file/eswc_2013_best_paper_award.pdf" style="float:right; margin-left:2em;"><img title="Best Paper Award" src="file/eswc_2013_best_paper_award.png" width="165" height="234" alt="Award" style="border:1px solid grey;"/></a>The research paper &quot;When to Reach for the Cloud: Using Parallel Hardware for Link Discovery&quot; has received the (shared) Best Paper Award of the 10th Extended Semantic Web Conference (ESWC) in Montpellier. The paper is the result of a close cooperation between the database group and the AKSW team.
</p>

<h2>Keywords</h2>
<ul>
<li>Link Discovery</li>
<li>GPU</li>
<li>MapReduce</li>
<li>Data Skew, Load Balancing</li>
</ul>

<h2 id="bibtex_heading">BibTex</h2>
<pre id="bibtex_listing">
@inproceedings{DBLP:conf/esws/NgomoKHHAR13,
author = {Axel-Cyrille Ngonga Ngomo and
Lars Kolb and
Norman Heino and
Michael Hartung and
S{\\"o}ren Auer and
Erhard Rahm},
title = {{When to Reach for the Cloud: Using Parallel Hardware for
Link Discovery}},
booktitle = {ESWC},
year = {2013},
pages = {275-289},
ee = {http://dx.doi.org/10.1007/978-3-642-38288-8_19},
crossref = {DBLP:conf/esws/2013},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
</pre>

Recent publications

  • 2025 / 9: Generating Semantically Enriched Mobility Data from Travel Diaries
  • 2025 / 8: Slice it up: Unmasking User Identities in Smartwatch Health Data
  • 2025 / 6: SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage
  • 2025 / 6: Leveraging foundation models and goal-dependent annotations for automated cell confluence assessment
  • 2025 / 5: Federated Learning With Individualized Privacy Through Client Sampling

Footer menu

  • Directions
  • Contact
  • Impressum