German English

Citation Analysis

The impact of scientific publications is often estimated by the number of citations they receive, i.e. how frequently they are referenced by other publications. Since publications have associated authors, originating institutions and publication venues (e.g. journals, conference proceedings) citations have also been used to compare their scientific impact. The tremendous scope of new scientific archives like Google Scholar makes it possible to freely access citation data for millions of publications and authors and thus to evaluate the citations for entire conferences and journals.

Data-Warehouse-based Analysis

We performed offline, data-warehouse-based citation analysis for selected database conferences (VLDB, SIGMOD) and journals (TODS, VLDB Journal, SIGMOD Record) in August 2005 and in 2007.

The citation/reference counts for the 10 years of 1994-2003 were determined by combining data from DBLP, ACM Digital Library, and Google Scholar (GS) as described in our iFuice paper. All reference counts are from Google Scholar and ACM. Reference counts from GS have problems, e.g. they include self-citations and pointers from web pages. We also noticed cases where GS grouped together different versions of the papers, book reviews and books, or had serveral entries of the same publications. We therefore applied some post-processing to deal with such cases. Still GS is by far the most current and comprehensive source at this time and we find the results quite interesting and helpful to uncover certain trends.

The analysis results were published in SIGMOD Record 2005. In 2007 we reran the analysis for an extended period of 12 years (1994-2005); a short summary of the new results is in a APE08 paper (slides).

In the following subpages we present selected results for the considered venues, in particular the top-5 papers per year and overall(“top” in terms of reference counts). Furthermore, we determine the 100 most referenced authors for the considered venues and present selcted results comparing the impact of conferences and journals as well as the increase in citations within two years.

Additional analysis results:

Online Analysis

We currently develop the Online Citation Service (OCS), a new system for online citation analysis of computer science research. For any set of DBLP publications, it retrieves and integrates citation data on demand from four different data sources: Google Scholar, Microsoft Libra, ACM Digital Library, and Citeseer. A set of search query generators is provided to efficiently retrieve relevant citation data and to iteratively refine search results for improved data quality.

Prototypes:

  • OCS 1.0 (restricted to Google Scholar; only publication lists for one author or for one venue can be analyzed)
  • OCS 2.0 (currently only for internal use)
  • Google Scholar H-Index calculates the single publication h index (and further metrics) based on Google Scholar

See also Affiliation Analysis

Project Members

Publications

PDF
further information
Google Scholar
Thor, A.; Bornmann, L.
The calculation of the single publication h index and related performances measures: A Web application based on Google Scholar
Online Information Review 35(2), 2011
2011-01
PDF

Google Scholar
Bornmann, L.; Marx, W.; Schier, H.; Thor, A.; Daniel, H.-D.
From black box to white box at open access journals: Predictive validity of manuscript reviewing and editorial decisions …
Research Evaluation 19(2), 2010
2010-06
PDF
further information
Google Scholar
Aumueller, David; Rahm, Erhard
Web-based Affiliation Matching
14th International Conference on Information Quality 2009 (ICIQ’09)
2009-11
PDF

Google Scholar
Bornmann, L.; Marx, W.; Schier, H.; Rahm, E.; Thor, A.; Daniel, H.-D.
Convergent validity of bibliometric Google Scholar data in the field of chemistry
Journal of Informetrics 3(1), 2009
2009
PDF

Google Scholar
Rahm, E.
Comparing the Scientific Impact of Conference and Journal Publications in Computer Science
Information Services & Use
2008
PDF

Google Scholar
publication iconThor, Andreas; Aumueller, David; Rahm, Erhard
Data Integration Support for Mashups
Proc. 6th Intl. Workshop on Information Integration on the Web (IIWeb), 2007
2007-07
PDF

Google Scholar
Köpcke, H.; Rahm, E.
Analyse von Zitierungshäufigkeiten für die Datenbankkonferenz BTW
Datenbank-Spektrum, 7. Jahrgang, Heft 20
2007-02
PDF
further information
Google Scholar
Rahm, E.; Thor, A.
Citation analysis of database publications
ACM Sigmod Record 24(4), 2005
2005-12