#### DMCA

## Indexing Large Metric Spaces for Similarity Search Queries (1999)

### Cached

### Download Links

- [erciyes.ces.cwru.edu]
- [erciyes.ces.cwru.edu]
- DBLP

### Other Repositories/Bibliography

Citations: | 92 - 0 self |

### Citations

2713 | R-trees: A dynamic index structure for spatial searching - GUTTMAN - 1984 |

1329 |
The Design and Analysis of Spatial Data Structures
- Samet
- 1989
(Show Context)
Citation Context ...revious work. In section 3.3, vp-tree structure is discussed in more detail. 3.1 Distance Transformations to Euclidean Spaces For low-dimensional Euclidean domains, the conventional index structures (=-=[Sam89]-=-) such as R-trees (and its variations) [Gut84, SRF87, BKSS90] can be used effectively to answer similarity queries. In such cases, a near neighbor search query would ask for all the objects in (or tha... |

1243 | B.: â€˜The R*-tree: An Efficient and Robust Access Method for Points and Rectangles - Beckmann, Kriegel, et al. - 1990 |

650 | M-tree: An efficient access method for similarity search in metric spaces
- Ciaccia, Patella, et al.
- 1997
(Show Context)
Citation Context ...is more expensive than the vp-tree, but its search algorithm makes less distance computations in the experiments for different data sets. More recently, Ciaccia et al. introduced the M-tree structure =-=[CPZ97]-=-, which differs from the other distance-based index structures by being able to handle dynamic operations. The M-tree is constructed bottom-up (in contrast to the other structures such as vp-tree, GNA... |

585 | The X-tree: An index structure for high-dimensional data
- Berchtold, Keim, et al.
- 1996
(Show Context)
Citation Context ...sionality of the Euclidean data sets, or the choice of Euclidean distance L 2 metric is not of particular significance here. There are many other indexing techniques such as TV-trees [LJF94], X-trees =-=[BKP96]-=- that are particularly designed for high dimensional Euclidean data. For general metric spaces, we only use the pairwise distances between objects in the data space for both index construction and sea... |

585 | Nearest neighbor queries
- Roussopoulos, Kelley, et al.
- 1995
(Show Context)
Citation Context ...ere the center is the query object and the radius is the tolerance factor r. There are some special techniques for other forms of similarity queries, such as nearest neighbor queries. For example, in =-=[RKV95]-=-, some heuristics are introduced to efficiently search the R-tree structure to answer nearest neighbor queries. However, the conventional spatial structures stop being efficient if the dimensionality ... |

526 | Fast subsequence matching in time-series databases - Faloutsos, Ranganathan, et al. - 1994 |

341 | The R+-Tree: A Dynamic Index for MultiDimensional Objects - Sellis, Roussopoulos, et al. - 1987 |

215 | The TV-tree: an index structure for high-dimensional data
- Lin, Jagadish, et al.
- 1994
(Show Context)
Citation Context .... Note that dimensionality of the Euclidean data sets, or the choice of Euclidean distance L 2 metric is not of particular significance here. There are many other indexing techniques such as TV-trees =-=[LJF94]-=-, X-trees [BKP96] that are particularly designed for high dimensional Euclidean data. For general metric spaces, we only use the pairwise distances between objects in the data space for both index con... |

214 | Near neighbor search in large metric spaces
- Brin
- 1995
(Show Context)
Citation Context ... using a single reference point for all nodes in the same level is an interesting idea. We use a similar technique in the design of mvp-trees. The GNAT (Geometric Near-Neighbor Access Tree) structure =-=[Bri95]-=- is another mechanism for answering near neighbor queries. A k number of split points are chosen at the top level. Each one of the remaining points are associated with one of the k data sets (one for ... |

204 |
Satisfying general proximity/similarity queries with metric trees. Information processing letters
- Uhlmann
- 1991
(Show Context)
Citation Context ...tage point tree) as a general solution to the problem of answering similarity based queries efficiently for high-dimensional metric spaces. The mvp-tree is similar to the vp-tree (vantage point tree) =-=[Uhl91]-=- in the sense that both structures use relative distances from a vantage point to partition the domain space. In vp-trees, at every node of the tree, a vantage point is chosen among the data points, a... |

202 | A Cost Model For Nearest Neighbor Search - Berchtold, Keim - 1997 |

138 |
Some approaches to best-match file searching
- Burkhard, Keller
- 1973
(Show Context)
Citation Context ...exing techniques in section 3.2. 3.2 Distance-Based Index Structures There are a number of research results on efficiently answering similarity search queries in different contexts. Burkhard & Keller =-=[BK73]-=- suggested the use of three different techniques for the problem of finding best matching (closest) key words in a file to a given query key. They employ a metric distance function on the key space wh... |

136 | K.: Fast Similarity Search - Agrawal, Lin, et al. - 1995 |

128 | Distance-based indexing for high-dimensional metric spaces
- Bozkaya, Ozsoyoglu
- 1997
(Show Context)
Citation Context ...ompared on a pixel by pixel basis by calculating the distance between two images as the accumulation of the differences between the intensities of their pixels. 1 A preliminary version of this paper (=-=[BO97]-=-) appeared in ACM-SIGMOD 1997. 2 This research is partially supported by the National Science Foundation grant IRI 92-24660, and the National Science Foundation FAW award IRI90 2 In all the applicatio... |

74 | Content-based image indexing
- Chiueh
- 1994
(Show Context)
Citation Context ...it is also possible to generalize it to a multi-way tree for larger fanouts. In [Yia93], Yiannilos provided some analytical results on vp-trees, and suggested ways to pick 6 better vantage points. In =-=[Chi94]-=-, Chiueh proposed an algorithm for the vp-tree structure to answer nearest neighbor queries. We talk about vp-trees in detail in section 3.3. The gh-tree (generalized hyperplane tree) structure was al... |

60 | A cost model for similarity queries in metric spaces
- Ciaccia, Patella, et al.
- 1998
(Show Context)
Citation Context ...ing objects in their parent nodes. Experimental results for M-trees are provided in [CPZ97, CP98, CPZ98a, CPZ98b]. An analytical cost model based on distance distribution of the objects is derived in =-=[CPZ98b]-=- for M-trees. Evaluation of complex similarity queries (with multiple similarity predicates) using M-trees are discussed in [CPZ98a]. [CP98] provides an algorithm for creating an M-tree from a given s... |

57 | New techniques for best-match retrieval
- Shasha, Wang
- 1990
(Show Context)
Citation Context ...rch. Note that keys may appear in more than one clique; so the aim is to select the representative keys to be the ones that appear in as many cliques as possible. In another approach, Shasha and Wang =-=[SW90]-=- suggested using pre-computed distances between data elements to efficiently answer similarity search queries. The aim is to minimize the number of distance computations as much as possible, as they a... |

35 | Bulk loading the m-tree
- Ciaccia, Patella
- 1998
(Show Context)
Citation Context ...sed on distance distribution of the objects is derived in [CPZ98b] for M-trees. Evaluation of complex similarity queries (with multiple similarity predicates) using M-trees are discussed in [CPZ98a]. =-=[CP98]-=- provides an algorithm for creating an M-tree from a given set of objects via bulkloading. We provide some experimental results with M-trees in Section 8.2. 3.3 Vantage point tree structure Let us bri... |

27 | Processing complex similarity queries with distance-based access methods
- Ciaccia, Patella, et al.
- 1998
(Show Context)
Citation Context ...t model based on distance distribution of the objects is derived in [CPZ98b] for M-trees. Evaluation of complex similarity queries (with multiple similarity predicates) using M-trees are discussed in =-=[CPZ98a]-=-. [CP98] provides an algorithm for creating an M-tree from a given set of objects via bulkloading. We provide some experimental results with M-trees in Section 8.2. 3.3 Vantage point tree structure Le... |

16 |
Approximate Matching with High Dimensionality R-trees", M.Sc. scholarly paper
- Otterman
- 1992
(Show Context)
Citation Context ...ed to efficiently search the R-tree structure to answer nearest neighbor queries. However, the conventional spatial structures stop being efficient if the dimensionality is high. Experimental results =-=[Ott92]-=- show that R-trees become inefficient for n-dimensional spaces where n is greater than 20. It is possible to make use of conventional spatial index structures for some high-dimensional Euclidean domai... |

14 | et al., "Efficient and effective querying by image content - Faloutsos - 1994 |

5 |
Data Structures and Algorithms for Nearest Neighbor Search in General Metric Spaces
- Yiannilos
- 1993
(Show Context)
Citation Context ...low that node, which are constructed in the same way recursively. Although the vp-tree was introduced as a binary tree, it is also possible to generalize it to a multi-way tree for larger fanouts. In =-=[Yia93]-=-, Yiannilos provided some analytical results on vp-trees, and suggested ways to pick 6 better vantage points. In [Chi94], Chiueh proposed an algorithm for the vp-tree structure to answer nearest neigh... |

1 | A Cost Nodel for Similarity Queries in Metric Spaces", to appear - Ciaccia, Patella, et al. - 1998 |