BIRCH: an efficient data clustering method for very large databases
Abstract: Finding useful patterns in large datasets has attracted considerable interest recently, and one of the most widely studied problems in this area is the identification of clusters, or densely populated regions, in a multi-dimensional dataset. Prior work does not, adequately address the problem of large datasets and mininization of I/O costs. This paper presents a data clustering method named BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies), and demonstrates that it is... (Update)
Cited by: More
Algorithms for Clustering High Dimensional and - Tao
(Correct)
PENS: An Algorithm for Density-Based Clustering in.. - Mei Li Guanling
(Correct)
Enhancing Data Analysis with Noise Removal - Hui Xiong Member
(Correct)
Similar documents (at the sentence level):
48.4%: BIRCH: An Efficient Data Clustering Method for Very Large .. - Zhang, Ramakrishnan.. (1996)
(Correct)
13.4%: An Experimental Comparison of Several Clustering and.. - Melia (1998)
(Correct)
Active bibliography (related documents): More All
0.0: Fast Algorithms for Projected Clustering - Aggarwal, Procopiuc, Wolf, Yu, al. (1999)
(Correct)
0.0: A Framework for Finding Projected Clusters in High.. - Aggarwal.. (1999)
(Correct)
0.0: Collective, Hierarchical Clustering from Distributed.. - Johnson, Kargupta (1999)
(Correct)
Similar documents based on text: More All
2.0: Interactive Classification of Very Large Datasets with BIRCH - Zhang, Ramakrishnan, Livny
(Correct)
0.4: WaveCluster: A Multi-Resolution Clustering Approach.. - Sheikholeslami.. (1998)
(Correct)
0.4: Clustering Large Datasets in Arbitrary Metric Spaces - Ganti, Ramakrishnan.. (1998)
(Correct)
Related documents from co-citation: More All
73: Efficient and Effective Clustering Methods for Spatial Data Mining
- Ng, Han - 1994
48: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases wi..
- Ester, Kriegel et al. - 1996
29: Finding Groups in Data: an Introduction to Cluster Analysis (context) - Kaufman, Rousseeuw - 1990
BibTeX entry: (Update)
Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. In Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, pages 103--114, Montreal, Canada, 1996. http://citeseer.ist.psu.edu/zhang96birch.html More
@inproceedings{ zhang96birch,
author = "Tian Zhang and Raghu Ramakrishnan and Miron Livny",
title = "{BIRCH}: an efficient data clustering method for very large databases",
pages = "103--114",
year = "1996",
url = "citeseer.ist.psu.edu/zhang96birch.html" }
Citations (may not include all citations):
2133
Pattern Classification and Scene Analysis (context) - Duda, Peter et al. - 1973
350
Kluwer Academic Publishers (context) - Gersho, ray et al. - 1992
349
Knowledge Acquisition via Incremental Conceptual Clustering (context) - Fisher - 1987
222
BIRCH: An Efficient Data Clustering Method for Very Large Da..
- Zhang, Ramakrishnan et al. - 1995
153
AutoClass : A Bayesian Classification System (context) - Cheeseman, Kelly et al. - 1988
52
Knowledge Discovery in Large Spatial Databases: Focusing Tec..
- Ester, Kriegel et al. - 1995
38
Parallel Algorithms for Hierarchical Clustering
- Olson - 1993
33
Clustering Methodologie Exploratory Data Analysis Advances i.. (context) - Dubes, Jain - 1980
1
A Survey of Recent Advances in Hierarchi- cal (context) - Murtagh
1
for Spatial Data Mining (context) - Ng, Hah et al. - 1994
1
Iterative Optimizatio and Simplifica- tion of Hierarchical C.. (context) - Fisher
1
Data - An Introduction to Cluster Analysis (context) - Kaufman, Rousseeuw et al. - 1990
1
Eaperiment with Incremental (context) - Lebowitz, with et al. - 1987
1
Advances in hfforlnation Systems Science (context) - Lee, aalysis et al. - 1981
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.lans.ece.utexas.edu/course/ee380l/1999fall/papers/list2/index.shtml): More
The Variable Selection Problem - George (1999)
(Correct)
Detecting Change in Categorical Data: Mining Contrast Sets - Bay, Pazzani (1999)
(Correct)
Interactive Data Analysis: The Control Project - Ata Analysis Is (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC