German English

XMach-1: A Benchmark for XML Data Management

Benchmarking XML Data Management Systems

The need to efficiently store and manage large amounts of XML data is rapidly increasing due to the growing use of XML as an improved web format, as the native data format for a variety of applications and as a standard interchange format especially in the e-business domain.

Two main types of systems are promoted to manage XML data, namely

  • native XML data stores, and
  • relational/object-relational DBMS augmented with an extension to store and manipulate XML data.

Native data stores are tailored to XML requirements and thus promise performance benefits and improved support for specific XML requirements (e.g., complex document structure, fast path navigation, text search). Relational and object-relational systems, on the other hand, typically provide good scalability and a large repertoire of performance-improving techniques, e.g. for query processing, that can be exploited for at least certain usage forms of XML data. Furthermore, they may avoid having separate data management systems for SQL and XML.

We are currently evaluating several XML data management systems using a newly developed benchmark called XMach-1. While this is the first version of the benchmark we will improve it having practical results and new requirements. The current version of the paper is a revised version replacing the first draft from September 2000.

Since the first public draft of the XQuery query language for XML is published we have formulated the queries from our specification in this language.
We have now updated the queries to fix some minor bugs and to reflect the changes made in the XQuery specification draft from 7 June 2001.

Implementation

We now have released the complete XMach-1 benchmark reference implementation consisting of a data generator to populate the XML benchmark database and a query execution framework. Please download it here. This archive contains the sources, class files and documentation of the data generator and the generic query framework. For further information see the README file in directory doc.

Publications

Project Members

Links