• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Semantic caching for xml queries (2004)

by L Chen
Add To MetaCart

Tools

Sorted by:
Results 1 - 2 of 2

Utility-driven Load Shedding for XML Stream Processing

by Mingzhu Wei, Elke A. Rundensteiner, Murali Mani - In WWW , 2008
"... Because of the high volume and unpredictable arrival rate, stream processing systems may not always be able to keep up with the input data streams — resulting in buffer overflow and uncontrolled loss of data. Load shedding, the prevalent strategy for solving this overflow problem, has so far only be ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
Because of the high volume and unpredictable arrival rate, stream processing systems may not always be able to keep up with the input data streams — resulting in buffer overflow and uncontrolled loss of data. Load shedding, the prevalent strategy for solving this overflow problem, has so far only been considered for relational stream processing, but not for XML. Shedding applied to XML stream processing brings new opportunities and challenges due to complex nested nature of XML structures. In this paper, we tackle this unsolved XML shedding problem using a three-pronged approach. First, we develop an XQuery preference model that enables users to specify the relative importance of preserving different subpatterns in the XML result structure. This transforms shedding into the problem of rewriting the user query into shed queries that return approximate query answers with utility as measured by the given user preference model. Second, we develop a cost model to compare the performance of alternate shed queries. Third, we develop two shedding algorithms, OptShed and FastShed. OptShed guarantees to find an optimal solution however at the cost of exponential complexity. FastShed, as confirmed by our experiments, achieves a close-to-optimal result in a wide range of test cases. Finally we describe the in-automaton shedding mechanism for XQuery stream engines. The experiments show that our proposed utility-driven shedding solutions consistently achieve higher utility results compared to the existing relational shedding techniques.
(Show Context)

Citation Context

...WR expressions; and the “WHERE” clause contains conjunctive selection predicates, each predicate being an operation between a variable and a constant. We assume the queries have been normalized as in =-=[6]-=-. The query pattern tree for query Q1 is given in Figure 2. In Figure 2, each navigation step in an XPath is mapped to a tree node. We use single line edges to denote the parent-children relationship ...

Achieving High Output Quality under Limited Resources through Structure-based Spilling in XML Streams

by Mingzhu Wei, Elke A. Rundensteiner, Murali Mani
"... Because of high volumes and unpredictable arrival rates, stream processing systems are not always able to keep up with input data- resulting in buffer overflow and uncontrolled loss of data. To produce eventually complete results, load spilling, which pushes some fractions of data to disks temporari ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
Because of high volumes and unpredictable arrival rates, stream processing systems are not always able to keep up with input data- resulting in buffer overflow and uncontrolled loss of data. To produce eventually complete results, load spilling, which pushes some fractions of data to disks temporarily, is commonly employed in relational stream engines. In this work, we now introduce “structurebased spilling”, a spilling technique customized for XML streams by considering the partial spillage of possibly complex XML elements. Such structure-based spilling brings new challenges. When a path is spilled, multiple paths may be affected. We analyze possible spilling effects on the query paths and how to execute the “reduced ” query to produce partial results. To select the reduced query that maximizes output quality, we develop three optimization strategies, namely, OptR, OptPrune and ToX. We also examine the clean-up stage to guarantee that an entire result set is eventually generated by producing supplementary results. Our experimental study demonstrates that our proposed solutions consistently achieve higher quality results compared to the state-of-the-art techniques. 1.
(Show Context)

Citation Context

...th “let” clause, or queries with FWR expressions nested within “for” clause, can be found in Appendix A. Algebraic Query Processing. We assume the queries have been normalized using the techniques in =-=[14]-=-. Queries are then translated into a plan. Namely, for each binding variable in the “for” clause, a structural join is conducted between the binding variable and the paths in the “return” clause. Path...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University