TwigBuffer: Avoiding Useless Intermediate Solutions Completely in Twig Joins

File Size Format
51508_1.pdf 135Kb Adobe PDF View
Title TwigBuffer: Avoiding Useless Intermediate Solutions Completely in Twig Joins
Author Li, Jiang; Wang, Junhu
Journal Name Lecture Notes in Computer Science
Editor G Goos, J Hartmanis
Year Published 2008
Place of publication Germany
Publisher Springer
Abstract Twig pattern matching plays a crucial role in XML data processing. TwigStack [2] is a holistic twig join algorithm that solves the problem in two steps: (1) finding potentially useful intermediate path solutions, (2) merging the intermediate solutions. The algorithm is optimal when the twig pattern has only //-edges, in the sense that no useless partial solutions are generated in the first step (thus expediting the second step and boosting the overall performance). However, when /-edges are present, a large set of useless partial solutions may be produced, which directly downgrades the overall performance. Recently, some improved versions of the algorithm (e.g., TwigStackList and iTwigJoin) have been proposed in an attempt to reduce the number of useless partial solutions when /-edges are involved. However, none of the algorithms can avoid useless partial solutions completely. In this paper, we propose a new algorithm, TwigBuffer, that is guaranteed to completely avoid the useless partial solutions. Our algorithm is based on a novel strategy to buffer and manipulate elements in stacks and lists. Experiments show that TwigBuffer significantly outperforms previous algorithms when arbitrary /-edges are present.
Peer Reviewed Yes
Published Yes
Alternative URI http://dx.doi.org/10.1007/978-3-540-78568-2_46
Copyright Statement Copyright 2008 Springer. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. The original publication is available at www.springerlink.com
Volume 4947
Page from 554
Page to 561
ISSN 0302-9743/1611-3349
Date Accessioned 2008-07-18
Language en_AU
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Science, Environment, Engineering and Technology
Subject PRE2009-Database Management
URI http://hdl.handle.net/10072/26126
Publication Type Journal Articles (Refereed Article)
Publication Type Code c1

Show simple item record

Griffith University copyright notice