Aumüller, D. ; Rahm, E.

PDFMeat: Managing Publications on the Semantic Desktop

CIKM 2011, October 24--28, 2011, Glasgow, Scotland, UK

2011 / 10

Paper

Futher information: http://code.google.com/p/pdfmeat/

Abstract

Researchers maintain bibliographies and extensive sets of PDF files of scholarly publications on their desktop. The lack of proper metadata of downloaded PDFs makes this task a tedious one. With PDFMeat we present a solution to automatically determine publication metadata for scholarly papers within the user’s desktop environment and link the metadata to the files. PDFMeat effectively matches local full texts to an online repository. In an evaluation for more than 2.000 diverse PDF files it worked highly reliable and showed excellent accuracy of up to 98 percent. We demonstrate PDFMeat for different sets of papers, highlighting the semantic integration and use of the retrieved metadata within the file browser of the desktop environment.