MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  1

Download:
Download as a PDF | Download as a PS
by A. Ragel, J. L. Bosson
http://users.info.unicaen.fr/~bruno/papers/sci99_2c.ps
Add To MetaCart

Abstract:

Many analysis tasks have to deal with missing values and some of them have developed specific and internal treatments to guess them. In this paper we present the use of a new method, called MVC (Missing Values Completion), for this question: MVC is based on data preprocessing which gives prominence to understandable associations and gives the user a central part. Such qualities will allow to use it for the data cleaning step of the Knowledge Discovery in Databases process. The efficiency of this method rests on the Robust Association Rules algorithm that we have proposed. This algorithm extends the concept of association rules for databases with multiple missing values. We give some examples of the use of MVC in a real world data set (in medicine), highlighting typical use of this method.

Citations

2552 Induction of decision trees – Quinlan - 1986
1505 Mining Association rules between sets of items in large databases – Agrawal, Imielinski, et al. - 1993
524 C4.5: Programs for – Quinlan - 1993
468 Statistical analysis with missing data – Little, Rubin - 2002
459 From Data Mining to Knowledge Discovery: An Overview – Fayyad, Piatetsky-Shapiro, et al. - 1996
285 Sampling large databases for association rules – Toivonen - 1996
263 Mining quantitative association rules in large relational tables – Srikant, Agrawal - 1996
217 Verkamo. Fast discovery of association rules – Agrawal, Mannila, et al. - 1996
131 Data mining using MLC++: a machine learning library in – Kohavi, Sommerfield, et al. - 1997
114 Overfitting avoidance as bias – Schaffer - 1993
95 Unknown attribute values in induction – Quinlan - 1989
62 An information theoretic approach to rule induction from databases – Smyth, Goodman - 1992
45 R.: Partial classification using association rules – Ali, Manganaris, et al. - 1997
38 Bayesian classification – Cheeseman, Self, et al. - 1988
19 MVC: A preprocessing method to deal with missing values – Ragel, Cremilleux - 1999
18 Imputation of missing data using machine learning techniques – Lakshminarayan, Harp, et al. - 1996
15 Techniques for dealing with missing values in classification. Second Intl. Symp. Intelligent Data Analysis – Liu, White, et al. - 1997
12 Treatment of missing values for association rules – Ragel, Cremilleux - 1998
11 A theoretical framework for decision trees in uncertain domains: Application to medical data sets – Cremilleux, Robert - 1997
7 A pruning method for decision trees in uncertain domains: Applications in medicine – Cr'emilleux, Robert - 1996
5 Classificationand Regression Trees. Wadsworth International Group – Breiman, Friedman, et al. - 1984
1 Missing Data Analysis 1.0 The Solution for Missing Values – for