MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Data Mining Tools

Download:
Download as a PDF | Download as a PS
by Irfan Altas, Sergey Bakin, Markus Hegland, Steve Roberts, Berwin A. Turlach
http://www.stats.adelaide.edu.au/people/bturlach/cv/../psfiles/dm.tools.ps.gz
Add To MetaCart

Abstract:

In this paper, we discuss several scalable and parallel discovery and predictive data mining tools. They successfully address many of the computational challenges associated with the analysis of data sets with millions to billions of records and tens to hundreds of attributes. In particular, we describe parallel and scalable methods for additive models, approximate thin plate splines and adaptive regression splines. In addition we describe a method to discover symbolic descriptions of key areas of very large data sets.

Citations

643 Generalized Additive Models – Hastie, Tibshirani - 1990
221 Adaptive Control Processes: A Guided Tour – Bellman - 1961
132 Subset Selection in Regression – Miller - 2002
15 Multivariate adaptive regression splines." The Annals of Statistics – Friedman - 1991
10 Finite element thin plate splines for surface fitting – Hegland, Roberts, et al. - 1997
10 Evolutionary hot spots data mining - an architecture for exploring for interesting discoveries – Williams - 1999
7 Mining the knowledge mine: The hot spots methodology for mining large real world databases – Williams, Huang - 1997
1 Generalised Additive Models – McIntosh - 1998
1 Practical Spline Approximation", in Topics in Numerical Analysis – Cox - 1982
1 Analysis of Motor Vehicle Claims Data using Statistical Data Mining, CSIRO – Chan, Milne, et al. - 1997