Data Mining Tools
by Irfan Altas, Sergey Bakin, Markus Hegland, Steve Roberts, Berwin A. Turlach
http://www.stats.adelaide.edu.au/people/bturlach/cv/../psfiles/dm.tools.ps.gz
Add To MetaCart
Abstract:
In this paper, we discuss several scalable and parallel discovery and predictive data mining tools. They successfully address many of the computational challenges associated with the analysis of data sets with millions to billions of records and tens to hundreds of attributes. In particular, we describe parallel and scalable methods for additive models, approximate thin plate splines and adaptive regression splines. In addition we describe a method to discover symbolic descriptions of key areas of very large data sets.
Citations
| 643 | Generalized Additive Models – Hastie, Tibshirani - 1990 |
| 221 | Adaptive Control Processes: A Guided Tour – Bellman - 1961 |
| 132 | Subset Selection in Regression – Miller - 2002 |
| 15 | Multivariate adaptive regression splines." The Annals of Statistics – Friedman - 1991 |
| 10 | Finite element thin plate splines for surface fitting – Hegland, Roberts, et al. - 1997 |
| 10 | Evolutionary hot spots data mining - an architecture for exploring for interesting discoveries – Williams - 1999 |
| 7 | Mining the knowledge mine: The hot spots methodology for mining large real world databases – Williams, Huang - 1997 |
| 1 | Generalised Additive Models – McIntosh - 1998 |
| 1 | Practical Spline Approximation", in Topics in Numerical Analysis – Cox - 1982 |
| 1 | Analysis of Motor Vehicle Claims Data using Statistical Data Mining, CSIRO – Chan, Milne, et al. - 1997 |

