Skip to main content

User account menu

  • Log in
DBS-Logo

Database Group Leipzig

within the department of computer science

ScaDS-Logo Logo of the University of Leipzig

Main navigation

  • Home
  • Study
    • Exams
      • Hinweise zu Klausuren
    • Courses
      • Current
    • Modules
    • LOTS-Training
    • Abschlussarbeiten
    • Masterstudiengang Data Science
    • Oberseminare
    • Problemseminare
    • Top-Studierende
  • Research
    • Projects
      • Benchmark datasets for entity resolution
      • FAMER
      • HyGraph
      • Privacy-Preserving Record Linkage
      • GRADOOP
    • Publications
    • Prototypes
    • Annual reports
    • Cooperations
    • Graduations
    • Colloquia
    • Conferences
  • Team
    • Erhard Rahm
    • Member
    • Former employees
    • Associated members
    • Gallery

Data integration with WETSUIT

Breadcrumb

  • Home
  • Data integration with WETSUIT

Duration

/

Description

WETSUIT (Web EnTity Search and fUsIon Tool):

WETSUIT is a new powerful open source mashup tool to search and integrate web data from diverse sources and domain-specific entity search engines. It supports adaptive search strategies to query sets of relevant entities with a minimum of communication overhead. Mashups can be composed using a set of high-level operators based on the Java-compatible language Scala. The operator implementation supports a high degree of parallel processing, in particular a streaming of entities between all data transformation operations facilitating a fast presentation of intermediate results.

Demonstration Mashups:

  • Online Citation Service lets you determine the citation counts of Google Scholar for any author or venue listed at DBLP. References to be analyzed can also be provided by a csv or bib file.
  • SimPubFinder lets you determine the citing papers for publications listed in a bib or csv input file.

Publikationen (5)

Dateien Cover Beschreibung Jahr
WETSUIT: An Efficient Mashup Tool for Searching and Fusing Web Entities
Endrullis, S. ; Thor, A. ; Rahm, E.
Proc. 38th Intl. Conference on Very Large Databases (VLDB) / Proceedings of the VLDB Endowment 5(12), 2012 (demo)
2012 / 8
Entity Search Strategies for Mashup Applications
Endrullis, S. ; Thor, A. ; Rahm, E.
Proc. 28th Intl. Conference on Data Engineering (ICDE), 2012
2012 / 4
CloudFuice: A flexible Cloud-based Data Integration System
Thor, A. ; Rahm, E.
Proc. of 10th Intl. Conference on Web Engineering (ICWE), 2011
2011 / 6
Evaluation of Query Generators for Entity Search Engines
Endrullis, S. ; Thor, A. ; Rahm, E.
Proc. Intl. Workshop on Using Search Engine Technology for Information Management (USETIM), 2009
2009 / 8
Data Integration Support for Mashups
Thor, A. ; Aumüller, D. ; Rahm, E.
Proc. 6th Intl. Workshop on Information Integration on the Web (IIWeb), 2007
2007 / 7

Recent publications

  • 2025 / 9: Generating Semantically Enriched Mobility Data from Travel Diaries
  • 2025 / 8: Slice it up: Unmasking User Identities in Smartwatch Health Data
  • 2025 / 6: SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage
  • 2025 / 6: Leveraging foundation models and goal-dependent annotations for automated cell confluence assessment
  • 2025 / 5: Federated Learning With Individualized Privacy Through Client Sampling

Footer menu

  • Directions
  • Contact
  • Impressum