MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  SETS USING P-TREES

Download:
Download as a PDF
unknown authors
http://www.cs.ndsu.nodak.edu/%7Eadenton/thesis/chapter7_Final.pdf
Add To MetaCart

Abstract:

Hierarchical clustering methods have attracted much attention by giving the user a maximum amount of flexibility. Rather than requiring parameter choices to be predetermined, the result represents all possible levels of granularity. In this paper, a hierarchical method is introduced that is fundamentally related to partitioning methods, such as k-medoids and k-means, as well as to a density based method, center-defined DENCLUE. It is superior to both k-means and k-medoids in its reduction of outlier influence. Nevertheless, it avoids both the time complexity of some partition-based algorithms and the storage requirements of density-based ones. An implementation that is particularly suited to spatial, stream, and multimedia data using P-trees for efficient data storage and access is presented. Many clustering algorithms require choosing parameters that will determine the granularity of the result. Partitioning methods such as the k-means and k-medoids [1] algorithms require that the number of clusters, k, be specified. Density-based methods, e.g., DENCLUE [2] and DBScan [3], use input parameters that relate directly to cluster size

Citations

702 Finding Groups in Data: an Introduction to Cluster Analysis – Kaufman, Rousseuw - 1990
187 Scaling clustering algorithms to large databases – Bradley, Fayyad, et al. - 1998
128 An efficient approach to clustering in large multimedia data sets with noise – Hinneburg, Keim - 1998
123 An efficient data clustering method for very large databases – Zhang, Ramakrishnan, et al. - 1996
81 Knowledge discovery in large spatial databases: Focusing techniques for efficient class identification – Ester, Kriegel, et al. - 1995
23 K-Nearest Neighbor Classification of Spatial Data Streams using – Khan, Ding, et al. - 2002
14 The P-tree Algebra – Ding, Khan, et al. - 2002
10 Efficient and effective clustering methods for spatial data mining – Ng, Han - 1994
8 Association Rule Mining on Remotely Sensed Images Using P-trees – Ding, Ding, et al. - 2002
8 On Mining Satellite and Other Remotely Sensed – Ding, Perrizo, et al. - 2001
4 J.Sander, "Spatial Data Mining: A Database Approach – Ester, Kriegel - 1997
4 1+1>2: Merging Distance and Density Based Clustering – Dash, Liu, et al. - 2001
3 Implementation of Peano Count Tree and Fast P-tree Algebra – Roy - 2001