Download:
|
by A. Ragel, J. L. Bosson
http://users.info.unicaen.fr/~bruno/papers/sci99_2c.ps
Add To MetaCart
Abstract:
Many analysis tasks have to deal with missing values and some of them have developed specific and internal treatments to guess them. In this paper we present the use of a new method, called MVC (Missing Values Completion), for this question: MVC is based on data preprocessing which gives prominence to understandable associations and gives the user a central part. Such qualities will allow to use it for the data cleaning step of the Knowledge Discovery in Databases process. The efficiency of this method rests on the Robust Association Rules algorithm that we have proposed. This algorithm extends the concept of association rules for databases with multiple missing values. We give some examples of the use of MVC in a real world data set (in medicine), highlighting typical use of this method.
Citations
|
2552
|
Induction of decision trees
– Quinlan
- 1986
|
|
1505
|
Mining Association rules between sets of items in large databases
– Agrawal, Imielinski, et al.
- 1993
|
|
524
|
C4.5: Programs for
– Quinlan
- 1993
|
|
468
|
Statistical analysis with missing data
– Little, Rubin
- 2002
|
|
459
|
From Data Mining to Knowledge Discovery: An Overview
– Fayyad, Piatetsky-Shapiro, et al.
- 1996
|
|
285
|
Sampling large databases for association rules
– Toivonen
- 1996
|
|
263
|
Mining quantitative association rules in large relational tables
– Srikant, Agrawal
- 1996
|
|
217
|
Verkamo. Fast discovery of association rules
– Agrawal, Mannila, et al.
- 1996
|
|
131
|
Data mining using MLC++: a machine learning library in
– Kohavi, Sommerfield, et al.
- 1997
|
|
114
|
Overfitting avoidance as bias
– Schaffer
- 1993
|
|
95
|
Unknown attribute values in induction
– Quinlan
- 1989
|
|
62
|
An information theoretic approach to rule induction from databases
– Smyth, Goodman
- 1992
|
|
45
|
R.: Partial classification using association rules
– Ali, Manganaris, et al.
- 1997
|
|
38
|
Bayesian classification
– Cheeseman, Self, et al.
- 1988
|
|
19
|
MVC: A preprocessing method to deal with missing values
– Ragel, Cremilleux
- 1999
|
|
18
|
Imputation of missing data using machine learning techniques
– Lakshminarayan, Harp, et al.
- 1996
|
|
15
|
Techniques for dealing with missing values in classification. Second Intl. Symp. Intelligent Data Analysis
– Liu, White, et al.
- 1997
|
|
12
|
Treatment of missing values for association rules
– Ragel, Cremilleux
- 1998
|
|
11
|
A theoretical framework for decision trees in uncertain domains: Application to medical data sets
– Cremilleux, Robert
- 1997
|
|
7
|
A pruning method for decision trees in uncertain domains: Applications in medicine
– Cr'emilleux, Robert
- 1996
|
|
5
|
Classificationand Regression Trees. Wadsworth International Group
– Breiman, Friedman, et al.
- 1984
|
|
1
|
Missing Data Analysis 1.0 The Solution for Missing Values
– for
|