German English

Benchmark Datasets for Schema and Ontology Matching

Benchmark Datasets for Schema and Ontology Matching

Datasets

The table below comprises a set of selected match tasks that have been solved with COMA++. Each match task consists of at least two source schemas (owl or xdr) to be matched against each other. We also provide the presumably perfect mapping result that can be used for evaluating the results of automatic match approaches. If more than two schemas were matched, the mapping file contains the mappings for all possible schema combinations.

Please refer to the papers to see how we determined the perfect mapping.


DomainType#Elements#Corresp.#SrcInstances?Sources Used in
Purchase Order (Apertum, CIDXPOSCHEMA, Excel, Noris, Paragon)xdr 35 .. 82 36 .. 855NoPurchase Order[1], [2], [3], [4]
Web Directory (Google, Yahoo, web, dmoz)owl418 .. 1132 197 .. 7924YesWebDirectory[3], [4], [5]
Lebensmittel (Google, web)owl53 .. 59 322YesLebensmittel[3]
Freizeit (Google, dmoz)owl67 .. 71 672YesFreizeit[3]

Links to collections of further match tasks

Illinois Semantic Integration Archive
Trento tasks
The Open Biological and Biomedical Ontologies (OBO Foundry)
Ontology Alignment Evaluation Initiative
Test cases for Entity Resolution

Publications

PDF

Google Scholar
publication iconMassmann, Sabine; Raunich, Salvatore; Aumueller, David; Arnold, Patrick; Rahm, Erhard
Evolution of the COMA Match System
OM-2011 (The Sixth International Workshop on Ontology Matching, October 24th, 2011, Bonn, Germany)
2011-10
PDF

Google Scholar
publication iconAlgergawy, A.; Massmann, S.; Rahm, E.
A Clustering-based Approach For Large-scale Ontology Matching
Proc. ADBIS, 2011
2011-09
PDF

Google Scholar
publication iconPeukert, E; Massmann, Sabine; König, Kathleen
Comparing Similarity Combination Methods for Schema Matching
GI-Workshop - Informationsintegration in Service-Architekturen
2010-09-30
PDF

Google Scholar
Massmann, S. ; Rahm, E.
Evaluating Instance-based Matching of Web Directories
11th International Workshop on the Web and Databases (WebDB 2008)
2008-06
PDF

Google Scholar
Engmann, D.; Massmann, S.
Instance Matching with COMA++
BTW 2007 Workshop: Model Management und Metadaten-Verwaltung
2007-03
PDF

Google Scholar
Do, H.-H.; Rahm, E.
Matching Large Schemas: Approaches and Evaluation
Information Systems, Volume 32, Issue 6, September 2007, Pages 857-885
2007
PDF
further information
Google Scholar
publication iconMassmann, S.; Engmann, D.; Rahm, E.
COMA++: Results for the Ontology Alignment Contest OAEI 2006
International Workshop on Ontology Matching, collocated with the 5th ISWC-2006; Athens, Georgia, USA
2006-11

further information
Google Scholar
Do, Hai Hong
Schema Matching and Mapping-based Data Integration
Dissertation. Veröffentlich durch Verlag Dr. Müller (VDM), ISBN 3-86550-997-5,
2006
PDF

Google Scholar
Do, H.H.; Melnik, S.; Rahm, E.
Comparison of Schema Matching Evaluations
Proc. Workshop Web and Databases, LNCS 2593, 2003
2003
PDF
further information
Google Scholar
Do, H.H.; Rahm, E.
COMA - A System for Flexible Combination of Schema Matching Approaches
Proc. 28th Intl. Conference on Very Large Databases (VLDB), Hongkong, Aug. 2002
2002


Contact/Project Members