Life science ontologies evolve frequently to meet new requirements or to better reflect the current domain knowledge. The development and adaptation of large and complex ontologies is typically performed collaboratively by several curators. To effectively manage the evolution of ontologies it is essential to identify the difference (Diff) between ontology versions. Such a Diff supports the synchronization of changes in collaborative curation, the adaptation of dependent data such as annotations, and ontology version management. We propose a novel approach COnto-Diff to determine an expressive and invertible diff evolution mapping between given versions of an ontology. Our approach first matches the ontology versions and determines an initial evolution mapping consisting of basic change operations (insert/update/delete). To semantically enrich the evolution mapping we adopt a rule-based approach to transform the basic change operations into a smaller set of more complex change operations, such as merge, split, or changes of entire subgraphs. The proposed algorithm is customizable in different ways to meet the requirements of diverse ontologies and application scenarios. We evaluate the proposed approach for large life science ontologies including the Gene Ontology and the NCI Thesaurus and compare it with PromptDiff. We further show how the Diff results can be used for version management and annotation migration in collaborative curation.
<h2 id="bibtex_heading">BibTex</h2>
<pre id="bibtex_listing">
@ARTICLE{hartung_jbi2013,
author = {Hartung, M. and Gross, A. and Rahm, E.},
title = {{COnto-Diff: Generation of Complex Evolution Mappings for Life Science Ontologies}},
journal = {{Journal of Biomedical Informatics}},
volume = {46},
issue = {1},
pages = {15--32},
year = {2013}
}
</pre>