XQuery sample code

XIME-P 2006

News:

June 26, 2006:
The XIME-P 2006 Electronic Proceedings are available.

June 20, 2006:
The "Caffeine and Code" demonstration session is complete and will feature 11 novel and innovative XQuery implementations, tools, and applications.

May 17, 2006:
The workshop program has been posted. Registration details are available.
Demo your XQuery system in the "Caffeine and Code" break!

Apr 10, 2006:
Submission is closed now. Authors will be notified by May 15, 2006 whether their submission is accepted for inclusion in the XIME-P 2006 program. Thank you for your contributions!

Mar 29, 2006:
The submission deadline for XIME-P 2006 has been postponed by one week. New submission deadline: Monday, April 10, 2006.

Mar 21, 2006:
Paper submission is open! (Use New User Registration to obtain a login and password for the submission site.)
Note: The submission deadline is April 10, 2006, 5 p.m. (PDT).

Jan 20, 2006:
XIME-P 2006 will be run as an ACM supported workshop, collocated with SIGMOD 2006.

Dec 6, 2005:
Don Chamberlin, a lead of the W3C XQuery Working Group will give the XIME-P 2006 keynote speech.

Sponsors:

BEA

IBM

Oracle

X-Hive

Important Dates:

Contact:

Please address any questions to the workshop co-chairs, Mike Carey and Torsten Grust.

Previous XIME-P workshops:

Upcoming XIME-P workshop:

The XIME-P 2006 Electronic Proceedings

These proceedings are part of the ACM Digital Library. The ISBN for the XIME-P 2006 proceedings is 1-5953-465-0. All research papers in these proceedings are subject to Copyright © 2006 ACM.

Invited Talk

Don Chamberlin (IBM Almaden Research Center, USA)
XQuery: Where Do We Go From Here?

PDF2006 is a big year for XQuery. It's the year when significant user experience with XQuery products began to accumulate, the year when a complete test suite first became available, and with reasonable luck, the year in which XQuery will be adopted as a W3C Recommendation. It's a good time to reflect on what the language designers did well and what they could have done better. It's also a good time to try to assemble the wisdom that users and implementors have gained from experience with XQuery. Based on this wisdom, we can try to look into the future and predict the ways in which XQuery needs to evolve in order to be successful.

Research Papers

Michael Kay
(Saxonica Ltd., UK)
Positional Grouping in XQuery

PDF This paper proposes an extension to the XQuery language to solve the problem of positional grouping: that is, problems in which it is necessary to convert a flat sequence into a hierarchy by recognizing patterns in the sequence of items. Positional grouping is contrasted with value-based grouping, where the allocation of items to groups is based on common values rather than on the positional relationships of the items in the sequence. The approach is based on analyzing a set of use cases, derived from real-world experience.

Don Chamberlin
(IBM Almaden, USA)
Mike Carey
(BEA Systems, USA)
Daniela Florescu
(Oracle Corp., USA)
Donald Kossmann
(ETH Zurich, Switzerland)
Jonathan Robie (DataDirect, USA)
Programming with XQuery

PDF XQuery is a declarative language for querying and updating XML data sources. Interfacing XQuery to a host programming language is difficult because of the type system mismatch, and global optimization is difficult in a mixed-language environment. In this paper, we investigate a small extension called XQueryP that enables XQuery expressions to exchange state information through variables. This extension makes it easier to develop applications in XQuery without relying on a host programming language. We draw an analogy between the proposed extension and similar extensions that have been added over the years to the SQL query language for similar reasons.

Leonidas Fegaras
Ranjan Dash
YingHui Wang
(U Texas at Arlington, USA)
A Fully Pipelined XQuery Processor

PDF We present a high-performance, pull-based streaming processor for XQuery, called XQPull, that can handle many essential features of the language, including general predicates, recursive queries, backward axis steps, and function calls, using a very small amount of caching. Our framework is based on a new type of event streams, called retarded streams, which allow multiple and nested streams to be interleaved in the same physical stream, while postponing the caching of input events until is absolutely necessary, typically at the end of the query evaluation, just before the results are ready to print.

Carl-Christian Kanne
Guido Moerkotte
(U Mannheim, Germany)
Template Folding for XPath

PDF We discuss query evaluation for XML-based server systems where the same query is evaluated on every incoming XML message. In a typical scenario, many of the incoming messages will be highly similar to each other. Current XML query evaluators reevaluate the query from scratch on every message. We call substructures that occur in many input documents template fragments, and introduce a novel template folding method that allows to move the work of evaluating the query on recurring document substructures from the query execution engine into the query compiler. Similar to constant folding, our method avoids run-time evaluation of intermediate results whose value only depends on information that is already available at compile time. For XPath location paths, we propose a representation for such invariant intermediate results, and show how it can be incorporated into query execution plans. Such augmented execution plans improve query performance when evaluating the same query on subsequent input documents.

Christoph Koch
Dan Olteanu
Stefanie Scherzinger
(U Saarland, Germany)
Building a Native XML-DBMS as a Term Project in a Database Systems Course

PDF This is to report on a database systems course the rst author held in the summer semester of 2005 at Saarland University, Saarbrüucken, Germany. This course was an experiment in several respects. For one, we wanted to teach a systems course with a practical part in which students apply the material taught to build the core of a database management system. Such a systems building e ort seems to be quite common in top-tier US universities, but it is rare in Europe. One main reason for this is that European curricula often require students to take many small courses per term. Students then cannot be required to invest the time necessary for such a systems-building e ort into an individual course. In Saarbrücken, this fortunately does not apply and students are expected to take only about two main courses per term. (The database systems course in Saarbrucken is worth 9 points in the European course-credit transfer system ECTS, which corresponds to an estimated workload of 20 hours per week.)

Wouter Alink
Raoul Bhoedjang
(Forensic Institute, NL)
Arjen de Vries
Peter Boncz
(CWI, NL)
Efficient XQuery Support for Stand-Off Annotation

PDF XML annotations are a widely occurring phenomenon in many application fields, and XML databases should be used to store and query such data. To provide intuitive and fast querying of annotations, we make a case for extending XPath with four new axis steps, that correspond with socalled StandOff joins, introduced here. The new steps can be efficiently implemented using a region index and fast looplifted StandOff MergeJoin algorithms. These techniques were added to the open-source XML DBMS MonetDB/XQuery, and we show in our evaluation it thus becomes capable of interactively querying >GB annotation databases.

Ionut E. Iacob
(UNC Wilmington, USA)
Alex Dekhtyar
(U Kentucky, USA)
Multihierarchical XQuery for Document-Centric XML

PDF Text has a non-hierarchical structure. Not suprisingly, searching for information ion the content of a document often yields results that overlap the structure within the document. It is often of great interest to relate such results to the embedded document structure. In this work we present an extension to the XQuery language over multihierarchical document-centric XML documents. We illustrate the benefits of using multihierarchical XQuery for text-and-structure searches in document-centric XML documents. More specifically, multihierarchical XQuery allows representing relationships between text and search results and document structure even for cases where such search results overlap markup boundaries and even in cases when only one markup hierarchy is considered.

Copyright © 2006 ACM

The Association for Computing Machinery, Inc.
1515 Broadway
New York, New York 10036

Copyright © 2006 by the Association for Computing Machinery, Inc (ACM). Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee provided that the copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted.

To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permission to republish from: Publications Dept. ACM, Inc. Fax +1-212-869-0481 or E-mail permissions@acm.org.

For other copying of articles that carry a code at the bottom of the first or last page, copying is permitted provided that the per-copy fee indicated in the code is paid through the Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923.

Notice to Past Authors of ACM-Published Articles
ACM intends to create a complete electronic archive of all articles and/or other material previously published by ACM. If you have written a work that was previously published by ACM in any journal or conference proceedings prior to 1978, or any SIG Newsletter at any time, and you do NOT want this work to appear in the ACM Digital Library, please inform permissions@acm.org, stating the title of the work, the author(s), and where and when published.

ACM ISBN 1-5953-465-0
Additional copies may be ordered prepaid from:

ACM Order Department
P.O. Box 11405
Church Street Station
New York, NY 10286-1405
Phone: 1-800-342-6626
(U.S.A. and Canada)
+1-212-626-0500
(All other countries)
Fax: +1-212-944-1318
E-mail: acmhelp@acm.org