MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  The Bulk Index Join: A Generic Approach to Processing Non-Equijoins (1999) [9 citations — 4 self]

Download:
Download as a PDF | Download as a PS
by Bernhard Seeger, Peter Widmayer
ICDE
http://www.mathematik.uni-marburg.de/~seeger/papers/bulkIndexJoin/tr14Revised.ps
Add To MetaCart

Abstract:

Efficient join algorithms have been developed for processing different types of non-equijoins like spatial join, band join, temporal join or similarity join. Each of these previously proposed join algorithms is tailor-cut for a specific type of join, and a generalization of these algorithms to other join types is not obvious. We present an efficient algorithm called bulk index join that can be easily applied to a broad class of non-equijoins. Similar to the well-known hash join algorithms, the bulk index join performs in two phases. In the build-phase, an appropriate index structure is created that serves as a partitioning function on the first relation. In the probing-phase, the records of the second relation are probed against the first relation by using the index structure of the build-phase. In order to support both phases efficiently, we adopt a technique recently proposed for bulk loading index structures. We show that this technique can also be exploited for probing the tuples of the second relation in bulk. Similar to the generic bulk loading approach, only a predefined set of routines of the index structure is used for implementing our join algorithm. This set is generally available in tree-based index structures. The so-called band join serves as an example in this paper. We first discuss in detail how to apply our generic approach to the band join. Thereafter, we present a worst-case analysis and experimental results. Moreover, we show in our experiments that the well-known index nested loops join can benefit from performing queries in bulk as it is proposed for the probing-phase of the bulk index join. 1

Citations

1650 R-trees: A dynamic index structure for spatial searching – Guttman - 1984
184 A:Pfeffer, Generalized Search Trees for Database Systems – Hellerstein - 1995
177 The hB-tree: a multiattribute indexing method with good guaranteed performance – Lomet, Salzberg - 1990
170 Spatial query processing in an object-oriented database system – Orenstein - 1986
150 Partition based spatial-merge join – Patel, DeWitt - 1996
131 An asymptotically optimal multiversion B-tree – Becker, Gschwind, et al. - 1996
102 Spatial joins using seeded trees – Lo, Ravishankar - 1994
93 The lsd tree: Spatial access to multidimensional point and non-point objects – Henrich, Six, et al. - 1989
49 Efficient External-Memory Data Structures and Applications – Arge - 1996
46 A generic approach to bulk loading multidimensional index structures – Bercken, Seeger, et al. - 1997
7 Ravishankar: "Spatial Hash-Joins – Lo, V - 1996
5 The LSD tree: Spatial Access to – Henrich, Six, et al. - 1989
4 Ravishankar: "Spatial Joins Using Seeded Trees – Lo, V - 1994
3 Query Evaluation Techniques for Large Databases", Computing Surveys 25(2 – Graefe - 1993
2 R.Agrawal - "High-Dimensional Similarity Joins – Shim, Srikant - 1997
2 Lomet: Grow and Post Index Trees – B - 1991
1 Schneider: "An Evaluation of Non-Equijoin Algorithms", VLDB – DeWitt, Naughton, et al. - 1991
1 Sevcik: "Size Separation Spatial Join – Koudas, C - 1997
1 Lomet: "Grow and Post Index Trees – B - 1991
1 Eich: "Join Processing in Relational Databases", Computing Surveys 24(1 – Mishra, H - 1992
1 Jensen: "Efficient Evaluation of the Valid-Time Natural Join", ICDE – Soo, Snodgrass, et al. - 1994
1 Schneider: “An Evaluation of Non-Equijoin Algorithms”, VLDB – DeWitt, Naughton, et al. - 1991
1 Query Evaluation Techniques for Large Databases”, Computing Surveys 25(2 – Graefe - 1993