35 citations found. Retrieving documents...
V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. International Conference on Data Engineering, pages 502--511, 1999.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents

On Approximation Algorithms for Data Mining Applications - Afrati (2002)   (Correct)

....of the cluster. 2. Cluster the rest of the points in main memory and if a cluster is very tight, then replace the corresponding set of points by its statistics; this is a compression set. 3. Consider merging compression sets. For arbitrary metric spaces, a single pass algorithm is developed in [44] which uses a data structure like an R tree to store clusters and uses summaries of data in a similar manner as in [16] Regarding the question of nding similarity measures according to which to cluster objects, in [33] methods for determining similar regions in tabular data (given in a matrix) ....

....be missed) are avoided by density biased sampling. The goal is to under sample dense regions and over sample sparse regions of the data. A memory ecient single pass algorithm is proposed that approximates density biased sampling. An excellent detailed exposition of algorithms in [51] 16] and [44] can be found in [84] An excellent survey of the algorithms in [48, 2, 11, 13] is given in [41] 7 Mining the web The challenge in mining the web for useful information is the huge size and unstructured placement of data. Search engines aim to search the web for a speci c topic and give the ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. L. Powell, and J. C. French. Clustering large datasets in arbitrary metric spaces. In ICDE, pages 502-511, 1999.


Managing Large Multidimensional Datasets Inside A Database System - Chakrabarti (2001)   (Correct)

....high dimensional feature spaces (HDFS) it must be used in conjunction with a dimensionality reduction technique in order to exploit the correlations in data and hence achieve further scalability. This approach is commonly used in both multimedia retrieval ( 43, 103, 76, 142] and data mining ([47, 8, 49]) applications. The idea is to first reduce the dimensionality of the data and then index the reduced space using a multidimensional index structure [43] Most of the information in the dataset is condensed to a few dimensions (the first few principal components (PCs) by using principal component ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. Proc. of ICDE, 1999.


A Hierarchical Visual Clustering Method Using Implicit.. - Sprenger, Brunella, Gross (2000)   (Correct)

.... literature describes various interesting data clustering approaches including their efficient and refined implementations [5] 8] 11] 12] 16] 17] 24] Because our main interest lies in visualizing clusters, we focus on the problem of clustering large data sets in coordinate space [7], also referred to as the Euclidian space, in which data objects can be represented as vectors . Unlike data sets in a distance space [7] also referred to as the data domain or the arbitrary metric space, the vector representation gives access to various efficiently implemented vector operations ....

....[11] 12] 16] 17] 24] Because our main interest lies in visualizing clusters, we focus on the problem of clustering large data sets in coordinate space [7] also referred to as the Euclidian space, in which data objects can be represented as vectors . Unlike data sets in a distance space [7], also referred to as the data domain or the arbitrary metric space, the vector representation gives access to various efficiently implemented vector operations (e.g. addition, multiplication, dot product, etc. which enables one to calculate simplified representations of complex data subregions ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. "Clustering Large Datasets in Arbitrary Metric Spaces." Technical report, University of Wisconsin-Madison, 1998.


Distances for Spatio-temporal clustering (Extended Abstract) - Nanni   (Correct)

....can be considered as a constraint. iii) A very general solution to the problem of working in non metric spaces is provided by FastMap [3] where each object is mapped into a point of an Euclidean space trying to preserve mutual distances. Such a technique is applied for instance by the Bubble FM [5] clustering algorithm. The main drawback of this approach is due to the approximations induced by the mapping: in some applications (such as clustering) it can force the coexistence of computations over the original space (to obtain precise results) and over the mapped metric space (because the ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. L. Powell, and J. C. French. Clustering large datasets in arbitrary metric spaces. In Proceedings of ICDE


COFE: A Scalable Method for Feature Extraction from.. - Hristescu, Farach-Colton (2000)   (Correct)

....data, do not perform equally well on high dimensional data. Some of the most popular database indexing structures used for similarity querying are the R # tree [5] X tree [4] SS tree [20] SR tree [12] TV tree [16] the Pyramid Technique [2] PK tree [21] BUBBLE and BUBBLE FM [9] are two recently proposed algorithms for clustering datasets in arbitrary distance spaces, based on BIRCH [22] a scalable clustering algorithm. BUBBLE FM uses FastMap to improve scalability. 3 Embedding Methods FastMap. The approach taken in FastMap [7] for embedding points into k dimensional ....

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A., French, J.: Clustering large datasets in arbitrary metric spaces. Proc. 15th ICDE Conf. (1999) 502--511


Local Dimensionality Reduction: A New Approach to Indexing .. - Chakrabarti, Mehrotra (2000)   (35 citations)  (Correct)

.... 20 30 dimensions) a simple sequential scan usually performs better at higher dimensionalities [6, 43] To scale to higher dimensionalities, a commonly used approach is dimensionality reduction [20] This technique has been proposed for both multimedia retrieval [17, 36, 27, 42] and data mining ([18, 4, 21]) applications. The idea is to first reduce the dimensionality of the data and then index the reduced space using a multidimensional index structure [17] Most of the information in the dataset is condensed to a few dimensions (the first few principal components (PCs) by using principal component ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. Proc. of ICDE, 1999.


Local Dimensionality Reduction: A New Approach to Indexing .. - Chakrabarti, Mehrotra (2000)   (35 citations)  (Correct)

.... 20 30 dimensions) a simple sequential scan usually performs better at higher dimensionalities [6, 43] To scale to higher dimensionalities, a commonly used approach is dimensionality reduction [20] This technique has been proposed for both multimedia retrieval [17, 36, 27, 42] and data mining ([18, 4, 21]) applications. The idea is to first reduce the dimensionality of the data and then index the reduced space using a multidimensional index structure [17] Most of the information in the dataset is condensed to a few dimensions (the first few principal components (PCs) by using principal component ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. Proc. of ICDE, 1999.


Using Clustering Strategies for Creating Authority Files - James French Allison (2000)   Self-citation (Powell French)   (Correct)

No context found.

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A., and French, J. (1999). Clustering Large Datasets in Arbitrary Metric Spaces. In 15th International Conference on Data Engineering (ICDE'99), pages 502--511, Sydney.


A Framework for Measuring Changes in Data Characteristics - Venkatesh Ganti Johannes (1999)   (23 citations)  Self-citation (Ganti Ramakrishnan Gehrke)   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison Powell, and James French. Clustering large datasets in arbitrary metric spaces. In Proceedings of the IEEE International Conference on Data Engineering, Sydney, March 1999.


Clustering Large Datasets in Arbitrary Metric Spaces - Venkatesh Ganti Raghu (1999)   (27 citations)  Self-citation (Ganti Ramakrishnan Gehrke Powell French)   (Correct)

No context found.

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. Technical report, University of Wisconsin-Madison, 1998.


Extending the Vocabulary Available for Cross-Disciplinary.. - French, Martin, Olsen (2002)   Self-citation (French)   (Correct)

....problem that users encounter. The terms in the vocabulary capture semantic nuances and as a result are often related in subtle ways. The figure shows how we are attempting to help users visualize this aspect of the vocabulary. Using techniques for automating the construction of authority files [6, 7, 8, 10] that we developed in collaboration with astronomers studying publication trends CHEMILUMINESCENT CHEMILUMINESCENCE PHOTOLYSIS CHEMILUMINESCENCE NITRIC OXIDE CHEMILUMINESCENCE LUMINOL CHEMILUMINESCENCE ETHENE CHEMILUMINESCENCE ETHYNE CHEMILUM EDDY CORR ....

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering Large Datasets in Arbitrary Metric Spaces. In 15th International Conference on Data Engineering (ICDE'99), pages 502--511, Sydney, March 1999.


Using Context to Assist in Personal File Retrieval - Soules (2006)   (Correct)

No context found.

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. International Conference on Data Engineering, pages 502--511, 1999.


Copyright C - Society Of Photo-Optical (2002)   (Correct)

No context found.

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French, "Clustering large datasets in arbitrary metric spaces," in Proceedings of the Fifteenth International Conference on Data Engineering, pp. 502--511, 1999.


Trends in Data Mining and Knowledge Discovery - Kurgan (2005)   (1 citation)  (Correct)

No context found.

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A.L., French, J.C., Clustering Large Datasets in Arbitrary Metric Spaces. Proceedings of the 15th International Conference on Data Engineering (ICDE), pp.502-511, 1999


Analysis and Comparison of Methods and Algorithms.. - Catarci, Ciaccia.. (2000)   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison Powell, and James French. Clustering large datasets in arbitrary metric spaces. In Proc. 15th IEEE Conf. Data Engineering, ICDE, 23-26 March1999.


Fuzzy Segmentation of X-Ray Fluoroscopy Images - Russakoff, al. (2002)   (Correct)

No context found.

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French, "Clustering large datasets in arbitrary metric spaces," in Proceedings of the Fifteenth International Conference on Data Engineering, pp. 502--511, 1999.


Agent-Based Distributed Data Mining: The KDEC Scheme - Klusch, Lodi, Moro   (Correct)

No context found.

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A.L., French, J.C.: Clustering large datasets in arbitrary metric spaces. In: Proceedings of the 15th International Conference on Data Engineering (ICDE 1999.


Methods For Clustering, Meta-Querying, Approximate And.. - Ciaccia, al. (2001)   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison Powell, and James French. Clustering large datasets in arbitrary metric spaces. In Proc. 15th IEEE Conf. Data Engineering, ICDE, 23-26 March 1999.


Distributed Clustering Based on Sampling Local Density.. - Klusch, Lodi, Moro (2003)   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison L. Powell, and James C. French. Clustering large datasets in arbitrary metric spaces. In Proceedings of the 15th International Conference on Data Engineering (ICDE 1999), pages 502--511, Sydney, Austrialia, March 1999.


COFE: A Scalable Method for Feature Extraction from.. - Hristescu, Farach-Colton (2000)   (Correct)

No context found.

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A., French, J.: Clustering large datasets in arbitrary metric spaces. Proc. 15th ICDE Conf. (1999) 502--511


Applying Data Mining to Data Security - Christina Chung University   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison Powell, and James French. Clustering large datasets in arbitrary metric spaces. Technical Report 250, University of Wisconsin-Madison, 1998.


Analysis and Comparison of Methods and Algorithms.. - Catarci, Cesari.. (2000)   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison Powell, and James French. Clustering large datasets in arbitrary metric spaces. In Proc. 15th IEEE Conf. Data Engineering, ICDE, 23-26 March 1999.


QROCK: A Quick Version of the ROCK Algorithm for.. - Dutta, Mahanta, Pujari   (Correct)

No context found.

. V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell and J. French. Clustering large datasets in arbitrary metric spaces. In the Proceedings of International Conference on Data Engineering, 1999.


Survey Of Clustering Data Mining Techniques - Berkhin (2002)   (18 citations)  (Correct)

No context found.

Ganti, V., Ramakrishnan, R., Gehrke, J., Powell, A., and French, J. Clustering large datasets in arbitrary metric spaces. in Proceedings of the I. th Int'l Conf. on Data Engineering, 1999, 502-511.


A Subquadratic Algorithm for Cluster and Outlier Detection in.. - Chávez   (Correct)

No context found.

Venkatesh Ganti, Raghu Ramakrishnan, Johannes Gehrke, Allison L. Powell, and James C. French. Clustering large datasets in arbitrary metric spaces. In ICDE, pages 502-511, 1999. 12

First 50 documents

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC