See this document in CiteSeerX!

Clustering Large Datasets in Arbitrary Metric Spaces (1999)  (Make Corrections)  (35 citations)
Venkatesh Ganti Raghu Ramakrishnan Johannes Gehrke Computer Sciences...
ICDE



  Home/Search   Context   Related

 
View or download:
cornell.edu/johann...999clustering.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cornell.edu/johann...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Clustering partitions a collection of objects into groups called clusters, such that similar objects fall into the same group. Similarity between objects is defined by a distance function satisfying the triangle inequality; this distance function along with the collection of objects describes a distance space. In a distance space, the only operation possible on data objects is the computation of distance between them. All scalable algorithms in the literature assume a special type of distance... (Update)

Cited by:   More
Using Context to Assist in Personal File Retrieval - Soules (2006)   (Correct)
Using Clustering Strategies for Creating Authority Files - James French Allison (2000)   (Correct)
Copyright C - Society Of Photo-Optical (2002)   (Correct)

Active bibliography (related documents):   More   All
2.0:   Clustering Large Datasets in Arbitrary Metric Spaces - Ganti, Ramakrishnan.. (1998)   (Correct)
1.1:   Automated Modeling and Nonlinear Axis Scaling - Leejay Wu (2005)   (Correct)
0.3:   Flexible Clustering by Tendency in High Dimensional Space - Liu, Wang (2003)   (Correct)

Similar documents based on text:   More   All
0.5:   A Framework for Measuring Differences in Data.. - Ganti, Ramakrishnan.. (1999)   (Correct)
0.5:   BOAT - Optimistic Decision Tree Construction - Gehrke, Ganti, Ramakrishnan, Loh (1999)   (Correct)
0.5:   Mining Very Large Databases - Ganti, Gehrke, Ramakrishnan (1999)   (Correct)

Related documents from co-citation:   More   All
13:   CURE: An efficient clustering algorithm for large databases - Guha, Rastogi et al. - 1998
13:   Data-Mining and Visualization of Traditional and Multimedia Datasets (context) - Faloutsos, Lin et al. - 1995
12:   Automatic subspace clustering of high dimensional data for data mining applicati.. - Agrawal, Gehrke et al. - 1998

BibTeX entry:   (Update)

V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering large datasets in arbitrary metric spaces. Technical report, University of Wisconsin-Madison, 1998. http://citeseer.ist.psu.edu/ganti99clustering.html   More

@inproceedings{ ganti99clustering,
    author = "Venkatesh Ganti and Raghu Ramakrishnan and Johannes Gehrke and Allison L. Powell and James C. French",
    title = "Clustering Large Datasets in Arbitrary Metric Spaces",
    booktitle = "{ICDE}",
    pages = "502-511",
    year = "1999",
    url = "citeseer.ist.psu.edu/ganti99clustering.html" }
Citations (may not include all citations):
2133   Pattern Classification and Scene analysis (context) - DudaandP - 1973
516   tree: an efficient and robust access method for points and r.. (context) - Beckmann, Kriegel et al. - 1990
475   Automatic subspace clustering of high dimensional data for d.. - Agrawal, Gehrke et al. - 1998
349   Knowledge acquisition via incremental conceptual clustering (context) - Fisher - 1987
302   datamining and visualization of traditional and multimedia d.. (context) - Faloutsos, Lin et al. - 1995
242   Efficient and effective clustering methods for spatial data .. - Ng, Han - 1994
210   A densitybased algorithm for discovering clusters in large s.. - Ester, Kriegel et al. - 1995
133   Cure: An efficient clustering algorithm for large databases - Guha, Rastogi et al. - 1998
111   Scaling clustering algorithms to large databases - Bradley, Fayyad et al. - 1998
106   tree: An efficient access method for similarity search in me.. (context) - Ciaccia, Patella et al. - 1997
98   Multidimensional scaling (context) - Kruskal, Wish - 1978
70   The analysis of proximities: Multidimensional scaling with a.. (context) - Shepard - 1962
47   Iterative optimization and simplification of hierarchical cl.. - Fisher - 1995
46   A database interface for clustering in large spatial databas.. (context) - Ester, Kriegel et al. - 1995
39   Multidimensional scaling: history (context) - Young - 1987
35   Clustering large datasets in arbitrary metric spaces - Ganti, Ramakrishnan et al. - 1998
34   theory and method (context) - Torgerson - 1952
33   Clustering methodologies in exploratory data analysis (context) - Dubes, Jain - 1980
14   Birch: An efficient data clustering method for large databas.. (context) - Zhang, Ramakrishnan et al. - 1996
10   Automating the Construction of Authority Files in Digital Li.. - French, Powell et al. - 1997
8   Authority Control: An Eighty-Year Review (context) - Auld - 1982
8   Finding Groups in Data - An Introduction to Cluster Analysis (context) - Kaufmann, Rousseuw - 1990
8   Applications of Approximate Word Matching in Information Ret.. - French, Powell et al. - 1997
3   A survey of recent hierarchical clustering algorithms (context) - Murtagh - 1983
2   Focussing techniques for efficient class indentification (context) - Ester, Kriegel et al. - 1995
2   A hybrid clustering method for identifying highdensity clust.. (context) - Wong - 1982



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cornell.edu/johannes/publications.html):   More
A Framework for Measuring Changes in Data Characteristics - Ganti, Gehrke.. (1998)   (Correct)
Fast Scheduling of Periodic Tasks on Multiple Resources - Baruah, Gehrke, Plaxton   (Correct)
RainForest - A Framework for Fast Decision Tree.. - Gehrke, Ramakrishnan.. (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC