(Enter summary)
Abstract: A large number of index structures for high-dimensional
data have been proposed previously. In order to tune and
compare such index structures, it is vital to have efficient
cost prediction techniques for these structures. Previous
techniques either assume uniformity of the data or are not
applicable to high-dimensional data. We propose the use
of sampling to predict the number of accessed index pages
during a query execution. Sampling is independent of the
dimensionality and preserves clusters ... (Update)
Context of citations to this paper: More
...since either the number of histogram regions becomes too large or these regions contain too much empty space and become inaccurate. In [16], sampling is used to overcome this problem. In contrast to this paper, the sample is used to predict the overall query cost of a given...
...distinct experimental results appear in the diagrams with the shapes from Table 4.1. Furthermore, we use the sampling method proposed in [LS01] for our experiments. In this way, we ran 1000 queries with distinct query points for each measurement reading and computed an average...
Cited by: More
Indexing without the Index: Scalable.. - Riedewald, Agrawal, .. (2002)
(Correct)
Accelerating High-dimensional Nearest Neighbor Queries - Lang, Singh (2002)
(Correct)
Efficient Nearest Neighbor Retrieval by Using a Local.. - Balko, Schmitt (2002)
(Correct)
Similar documents (at the sentence level):
70.1%: Performance Prediction of High-Dimensional Index Structures.. - Lang, Singh (2000)
(Correct)
Active bibliography (related documents): More All
0.6: A Framework for Accelerating High-dimensional NN-queries - Lang, Singh (2001)
(Correct)
0.2: Tracking Join and Self-Join Sizes in Limited Storage - Alon, Gibbons, Matias, Szegedy (2002)
(Correct)
0.1: Efficient k Nearest Neighbor Queries on Remote Spatial.. - Liu, Lim, Ng
(Correct)
Similar documents based on text: More All
0.3: Joining Massive High-Dimensional Datasets - Kahveci, Lang, Singh (2003)
(Correct)
0.3: Stardust: Fast Stream Indexing using Incremental Wavelet.. - Bulut, Singh
(Correct)
0.2: Distributed Data Streams Indexing using Content-Based.. - Bulut, Vitenberg, Singh (2004)
(Correct)
Related documents from co-citation: More All
3: tree: An index structure for high-dimensional data (context) - Berchtold, Keim et al. - 1996
3: A model for the prediction of r-tree performance
- Theodoridis, Sellis - 1996
3: Improving the query performance of highdimensional
- Berchtold, Bohm et al.
BibTeX entry: (Update)
Christian A. Lang and Ambuj K. Singh. Modeling high-dimensional index structures using sampling. In Proc. ACM SIGMOD Int. Conf. on Management of Data, 2001. http://citeseer.ist.psu.edu/lang01modeling.html More
@article{ lang01modeling,
author = "Christian A. Lang and Ambuj K. Singh",
title = "Modeling high-dimensional index structures using sampling",
journal = "SIGMOD Record (ACM Special Interest Group on Management of Data)",
volume = "30",
number = "2",
pages = "389--400",
year = "2001",
url = "citeseer.ist.psu.edu/lang01modeling.html" }
Citations (may not include all citations):
765
trees: A dynamic index structure for spatial searching (context) - Guttman - 1984
516
tree: An efficient and robust access method for points and r.. (context) - Beckmann, Kriegel et al. - 1990
204
tree : An index structure for high-dimensional data (context) - Berchtold, Keim et al. - 1996
165
The SR-tree: An index structure for high-dimensional nearest..
- Katayama, Satoh - 1997
162
Similarity indexing with the SS-tree (context) - White, Jain - 1996
147
A quantitative analysis and performance study for similarity..
- Weber, Schek et al. - 1998
121
A cost model for nearest neighbor search in high-dimensional.. (context) - Berchtold, Bohm et al. - 1997
108
Communications of the ACM (context) - Hoare, Find - 1961
106
tree: An efficient access method for similarity search in me.. (context) - Ciaccia, Patella et al. - 1997
103
Practical selectivity estimation through adaptive sampling
- Lipton, Naughton et al. - 1990
101
Beyond uniformity and independence: Analysis of R-trees usin..
- Faloutsos, Kamel - 1994
97
The TV-tree: An index structure for high-dimensional data
- Lin, Jagadish et al. - 1994
88
The hB-tree: A multiattribute indexing method with good guar.. (context) - Lomet, Salzberg - 1990
86
The pyramid technique: Towards breaking the curse of dimensi.. (context) - Berchtold, Bohm et al. - 1998
70
The BANG file: A new kind of grid file (context) - Freeston - 1987
68
Optimal multi-step k-nearest neighbor search
- Seidl, Kriegel - 1998
65
A model for the prediction of R-tree performance
- Theodoridis, Sellis - 1996
55
Similarity indexing: Algorithms and performance
- White, Jain - 1996
50
The hybrid tree: An index structure for high dimensional fea..
- Chakrabarti, Mehrotra - 1999
49
Analysis of object oriented spatial access methods (context) - Faloutsos, Sellis et al. - 1987
47
Selectivity estimation in spatial databases
- Acharya, Poosala et al. - 1999
47
The kdb-tree: A search structure for large multi-dimensional.. (context) - Robinson - 1981
28
Improving the query performance of high-dimensional index st..
- Berchtold, Bohm et al.
25
Accounting for boundary effects in nearest neighbor searchin..
- Arya, Mount et al. - 1995
20
Performance of nearest neighbor queries in R-trees (context) - Papadopoulos, Manolopoulos - 1997
17
Towards estimation error guarantees for distinct values (context) - Charikar, Chaudhuri et al. - 2000
16
Bulk loading the M-tree
- Ciaccia, Patella - 1998
14
Analyzing range queries on spatial data (context) - Jin, An et al. - 2000
14
An approximation based data structure for similarity search
- Weber, Blott - 1997
13
Range selectivity estimation for continuous attributes
- Korn, Johnson et al. - 1999
7
Fixed-precision estimation of join selectivity (context) - Haas, Naughton et al. - 1993
4
Deflating the dimensionality curse using multiple fractal di.. (context) - Korn, Pagel et al. - 2000
1
Performance prediction of high-dimensional index structures ..
- Lang, Singh - 2000
1
multikey file structure (context) - Nievergelt, Hinterberger et al. - 1984
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC