(Enter summary)
Abstract: Clustering, in data mining, is useful for discovering groups and identifying interesting distributions in the underlying data. Traditional clustering algorithms either favor clusters with spherical shapes and similar sizes, or are very fragile in the presence of outliers. We propose a new clustering algorithm called CURE that is more robust to outliers, and identifies clusters having non-spherical shapes and wide variances in size. CURE achieves this by representing each cluster by a certain... (Update)
Similar documents based on text: More All
0.6: CURE: An Efficient Clustering Algorithm for Large Databases - Guha, Rastogi, Shim (1998)
(Correct)
0.5: ROCK: A Robust Clustering Algorithm for Categorical Attributes - Guha, Rastogi, Shim (2000)
(Correct)
0.4: REHIST: Relative Error Histogram Construction Algorithms - Guha, Shim, Woo (2004)
(Correct)
BibTeX entry: (Update)
S. Guha, R. Rastogi, and K. Shim. CURE: An efficient clustering algorithm for large databases. In Proceedings of ACM SIGMOD International Conference on Management of Data, pages 73--84, New York, 1998. ACM. http://citeseer.ist.psu.edu/article/guha98cure.html More
@inproceedings{ guha98cure,
author = "Sudipto Guha and Rajeev Rastogi and Kyuseok Shim",
title = "{CURE}: an efficient clustering algorithm for large databases",
pages = "73--84",
month = {June},
booktitle = {ACM SIGMOD International Conference on Management of Data},
year = "1998",
url = {citeseer.ist.psu.edu/article/guha98cure.html} }
Citations (may not include all citations):
3972
Introduction to Algorithms (context) - Cormen, Leiserson et al. - 1990
837
Cambridge University Press (context) - Motwani, Raghavan - 1995
805
Algorithms for Clustering Data (context) - Jain, Dubes - 1988
666
The Design and Analysis of Spatial Data Structures (context) - Samet - 1989
666
The Design and Analysis of Spatial Data Structures (context) - Samet - 1990
516
tree: an efficient and robust access method for points and r.. (context) - Beckmann, Kriegel et al. - 1990
247
An algorithm for finding best matches in logarithmic expecte.. (context) - Friedman, Bentley et al. - 1977
242
Efficient and effective clustering methods for spatial data ..
- Ng, Han - 1994
233
tree: a dynamic index for multidimensional objects
- Sellis, Roussopoulos et al. - 1987
222
Birch: An efficient data clustering method for very large da..
- Zhang, Ramakrishnan et al. - 1996
210
A density-based algorithm for discovering clusters in large ..
- Ester, Kriegel et al. - 1996
189
Sampling large databases for association rules
- Toivonen - 1996
46
A database interface for clustering in large spatial databas.. (context) - Ester, Kriegel et al. - 1995
38
Parallel algorithms for hierarchical clustering
- Olson - 1993
17
ACM Transactions on Mathematical Software (context) - Vitter, with et al. - 1985
[Article contains additional citations not shown here]
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.bell-labs.com/projects/serendip/): More
PUBLIC: A Decision Tree Classifier that Integrates Building and.. - Rastogi (1998)
(Correct)
Mining Optimized Association Rules with Categorical and.. - Rastogi, Shim (1998)
(Correct)
Mining Optimized Support Rules for Numeric Attributes - Rastogi, Shim (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC