Skip to main content

User account menu

  • Log in
DBS-Logo

Database Group Leipzig

within the department of computer science

ScaDS-Logo Logo of the University of Leipzig

Main navigation

  • Home
  • Study
    • Exams
      • Hinweise zu Klausuren
    • Courses
      • Current
    • Modules
    • LOTS-Training
    • Abschlussarbeiten
    • Masterstudiengang Data Science
    • Oberseminare
    • Problemseminare
    • Top-Studierende
  • Research
    • Projects
      • Benchmark datasets for entity resolution
      • FAMER
      • HyGraph
      • Privacy-Preserving Record Linkage
      • GRADOOP
    • Publications
    • Prototypes
    • Annual reports
    • Cooperations
    • Graduations
    • Colloquia
    • Conferences
  • Team
    • Erhard Rahm
    • Member
    • Former employees
    • Associated members
    • Gallery

Exploiting Semantics from Ontologies and Shared Annotations to Partition Linked Data

Breadcrumb

  • Home
  • Research
  • Publications
  • Exploiting Semantics from Ontologies and Shared Annotations to Partition Linked Data

Palma, G. ; Vidal, M. ; Raschid, L. ; Thor, A.

Exploiting Semantics from Ontologies and Shared Annotations to Partition Linked Data

Proc. 10th Intl. Conference on Data Integration in the Life Sciences (DILS), Lisbon, July 2014

2014 / 07

Paper

Abstract

Linked Open Data initiatives have made available a diversity of collections that domain experts have annotated with controlled vocabulary terms from ontologies. The challenge is to explore these rich and complex annotated datasets, together with the domain semantics captured within ontologies, to discover patterns of annotations across multiple concepts that may lead to potential discoveries. We identify annotation signatures of links that associate semantically similar concepts, where similarity is measured in terms of shared annotations and ontological relatedness. Formally, an annotation signature is a partitioning or clustering of the links that represent the relationships between shared annotations. A clustering algorithm named AnnSigClustering is proposed to generate annotation signatures. Evaluation results over drug, disease, and gene datasets demonstrate the effectiveness of using annotation signatures to find patterns between the links that belong to the same part of the signature.

Recent publications

  • 2025 / 9: Generating Semantically Enriched Mobility Data from Travel Diaries
  • 2025 / 8: Slice it up: Unmasking User Identities in Smartwatch Health Data
  • 2025 / 6: SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage
  • 2025 / 6: Leveraging foundation models and goal-dependent annotations for automated cell confluence assessment
  • 2025 / 5: Federated Learning With Individualized Privacy Through Client Sampling

Footer menu

  • Directions
  • Contact
  • Impressum