The XIME-P 2006 Electronic Proceedings
These proceedings are part of the ACM Digital
Library. The ISBN for
the XIME-P 2006 proceedings is 1-5953-465-0. All research papers in
these proceedings are subject to Copyright
© 2006 ACM.
Invited Talk
Don Chamberlin (IBM Almaden Research Center, USA)
XQuery: Where Do We Go From Here?
2006 is a big year for XQuery. It's the year when
significant user experience with XQuery products began to
accumulate, the year when a complete test suite first became
available, and with reasonable luck, the year in which
XQuery will be adopted as a W3C Recommendation. It's a good
time to reflect on what the language designers did well and
what they could have done better. It's also a good time to
try to assemble the wisdom that users and implementors have
gained from experience with XQuery. Based on this wisdom, we
can try to look into the future and predict the ways in
which XQuery needs to evolve in order to be successful.
Research Papers
Michael Kay
(Saxonica Ltd., UK)
|
Positional Grouping in XQuery
This paper proposes an extension to the XQuery language to
solve the problem of positional grouping: that is, problems
in which it is necessary to convert a flat sequence into a
hierarchy by recognizing patterns in the sequence of
items. Positional grouping is contrasted with value-based
grouping, where the allocation of items to groups is based
on common values rather than on the positional relationships
of the items in the sequence. The approach is based on
analyzing a set of use cases, derived from real-world
experience.
|
Don Chamberlin
(IBM Almaden, USA)
Mike Carey
(BEA Systems, USA)
Daniela Florescu
(Oracle Corp., USA)
Donald Kossmann
(ETH Zurich, Switzerland)
Jonathan Robie
(DataDirect, USA)
|
Programming with XQuery
XQuery is a declarative language for querying and updating
XML data sources. Interfacing XQuery to a host programming
language is difficult because of the type system mismatch,
and global optimization is difficult in a mixed-language
environment. In this paper, we investigate a small extension
called XQueryP that enables XQuery expressions to exchange
state information through variables. This extension makes it
easier to develop applications in XQuery without relying on
a host programming language. We draw an analogy between the
proposed extension and similar extensions that have been
added over the years to the SQL query language for similar
reasons.
|
Leonidas Fegaras
Ranjan Dash
YingHui Wang
(U Texas at Arlington, USA)
|
A Fully Pipelined XQuery Processor
We present a high-performance, pull-based streaming
processor for XQuery, called XQPull, that can handle many
essential features of the language, including general
predicates, recursive queries, backward axis steps, and
function calls, using a very small amount of caching. Our
framework is based on a new type of event streams, called
retarded streams, which allow multiple and nested streams to
be interleaved in the same physical stream, while postponing
the caching of input events until is absolutely necessary,
typically at the end of the query evaluation, just before
the results are ready to print.
|
Carl-Christian Kanne
Guido Moerkotte
(U Mannheim, Germany)
|
Template Folding for XPath
We discuss query evaluation for XML-based server systems
where the same query is evaluated on every incoming XML
message. In a typical scenario, many of the incoming
messages will be highly similar to each other. Current XML
query evaluators reevaluate the query from scratch on every
message. We call substructures that occur in many input
documents template fragments, and introduce a novel template
folding method that allows to move the work of evaluating
the query on recurring document substructures from the query
execution engine into the query compiler. Similar to
constant folding, our method avoids run-time evaluation of
intermediate results whose value only depends on information
that is already available at compile time. For XPath
location paths, we propose a representation for such
invariant intermediate results, and show how it can be
incorporated into query execution plans. Such augmented
execution plans improve query performance when evaluating
the same query on subsequent input documents.
|
Christoph Koch
Dan Olteanu
Stefanie Scherzinger
(U Saarland, Germany)
|
Building a Native XML-DBMS as a Term Project in a Database Systems Course
This is to report on a database systems course the rst
author held in the summer semester of 2005 at Saarland
University, Saarbrüucken, Germany. This course was an
experiment in several respects. For one, we wanted to teach
a systems course with a practical part in which students
apply the material taught to build the core of a database
management system. Such a systems building e ort seems to be
quite common in top-tier US universities, but it is rare in
Europe. One main reason for this is that European curricula
often require students to take many small courses per
term. Students then cannot be required to invest the time
necessary for such a systems-building e ort into an
individual course. In Saarbrücken, this fortunately does
not apply and students are expected to take only about two
main courses per term. (The database systems course in
Saarbrucken is worth 9 points in the European
course-credit transfer system ECTS, which corresponds to an
estimated workload of 20 hours per week.)
|
Wouter Alink
Raoul Bhoedjang
(Forensic Institute, NL)
Arjen de Vries
Peter Boncz
(CWI, NL)
|
Efficient XQuery Support for Stand-Off Annotation
XML annotations are a widely occurring phenomenon in many
application fields, and XML databases should be used to
store and query such data. To provide intuitive and fast
querying of annotations, we make a case for extending XPath
with four new axis steps, that correspond with socalled
StandOff joins, introduced here. The new steps can be
efficiently implemented using a region index and fast
looplifted StandOff MergeJoin algorithms. These techniques
were added to the open-source XML DBMS MonetDB/XQuery, and
we show in our evaluation it thus becomes capable of
interactively querying >GB annotation databases.
|
Ionut E. Iacob
(UNC Wilmington, USA)
Alex Dekhtyar
(U Kentucky, USA)
|
Multihierarchical XQuery for Document-Centric XML
Text has a non-hierarchical structure. Not suprisingly,
searching for information ion the content of a document
often yields results that overlap the structure within the
document. It is often of great interest to relate such
results to the embedded document structure. In this work we
present an extension to the XQuery language over
multihierarchical document-centric XML documents. We
illustrate the benefits of using multihierarchical XQuery
for text-and-structure searches in document-centric XML
documents. More specifically, multihierarchical XQuery
allows representing relationships between text and search
results and document structure even for cases where such
search results overlap markup boundaries and even in cases
when only one markup hierarchy is considered.
|
Copyright © 2006 ACM
The Association for Computing Machinery, Inc.
1515 Broadway
New York, New York 10036
Copyright © 2006 by the Association for Computing
Machinery, Inc (ACM). Permission to make digital or hard copies of
portions of this work for personal or classroom use is granted
without fee provided that the copies are not made or distributed for
profit or commercial advantage and that copies bear this notice and
the full citation on the first page. Copyrights for components of
this work owned by others than ACM must be honored.
Abstracting with
credit is permitted.
To copy otherwise, to republish, to post on servers or to
redistribute to lists, requires prior specific permission and/or a
fee. Request permission to republish from: Publications Dept. ACM,
Inc. Fax +1-212-869-0481 or E-mail permissions@acm.org.
For other copying of articles that carry a code at the bottom of the
first or last page, copying is permitted provided that the per-copy
fee indicated in the code is paid through the Copyright Clearance
Center, 222 Rosewood Drive, Danvers, MA 01923.
Notice to Past Authors of ACM-Published Articles
ACM intends to create a complete electronic archive of all articles
and/or other material previously published by ACM. If you have
written a work that was previously published by ACM in any journal
or conference proceedings prior to 1978, or any SIG Newsletter at
any time, and you do NOT want this work to appear in the ACM Digital
Library, please inform permissions@acm.org, stating
the title of the
work, the author(s), and where and when published.
ACM ISBN 1-5953-465-0
Additional copies may be ordered prepaid from:
ACM Order Department
P.O. Box 11405
Church Street Station
New York, NY 10286-1405
|
Phone: 1-800-342-6626
(U.S.A. and Canada)
+1-212-626-0500
(All other countries)
Fax: +1-212-944-1318
E-mail: acmhelp@acm.org
|