MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Mining a commercial banking data set: The SaintEtiQ approach [1 citations — 1 self]

Download:
pdf
by R. Saint-paul, G. Raschia, N. Mouaddib
In Proc. of the IEEE Int. Conf. on Systems, Man & Cybernetics (SMC’2002), Hammamet
http://www.simulation.fr/seq/publis/2002-smc.pdf
Add To MetaCart

Abstract:

Abstract—In this paper, an original approach to database summarization is applied to a massive data set provided by a bank marketing department. The overall summarization process is concerned with the knowledge discovery paradigm, even if purposes of the approach are quite different from those of KDD. The summarization process is intended to find general representations of data overall the database, whereas KDD processes deal with knowledge nugget extraction from data, without prioritizing the cover property. The summarization process is based on an incremental and hierarchical conceptual clustering algorithm, building a summary hierarchy from database records. Levels of the hierarchy provides some views with different granularities over the entire database. Each summary describes part of the data set. Furthermore, the fuzzy set-based representation of summaries allows the system to ensure a strong robustness and accuracy regarding the wellknown threshold effect of the crisp clustering methods. The summarization process is also supported by some background knowledge, providing a user-friendly vocabulary to describe summaries with a high-level semantics. Even though our method is not immediately concerned with computational performance, its low time and memory requirements makes it appropriate for large real-life databases. The scalability of the process is demonstrated through the application on a banking data set. The produced summary hierarchy not only provides human-friendly views over the all database, but can also be queried in a knowledge discovery perspective.

Citations

1486 Fuzzy sets – Zadeh - 1965
527 Knowledge acquisition via incremental conceptual clustering – Fisher - 1987
382 The Concept of a Linguistic Variable and its Application to Approximate Reasoning – Zadeh - 1975
207 Learning from observation: Conceptual Clustering – Michalski, Stepp - 1983
73 A new approach to clustering – Ruspini - 1969
11 Fuzzy sets in data summaries - Outline of a new approach – Dubois, Prade - 2000
8 A new approach to the summarization of data – Yager - 1982
8 Data summarization in relational databases through fuzzy dependencies – Cubero, Medina, et al. - 1999
6 On data summaries based on gradual rules – Bosc, Pivert, et al. - 1999
4 Fuzzy query language for hypothesis evaluation – Rasmussen, Yager - 1997
4 Fuzzy logic for linguistic summarization of databases – Kacprzyk - 1999
3 Extended functional dependencies as a basis for linguistic summaries – Bosc, Lietard, et al. - 1998
3 A fuzzy-based conceptual KDD approach: the SaintEtiQ system – Raschia, Mouaddib
3 Fuzzy set-based representation of domain knowledge and concepts for database summarization – Raschia, Mouaddib
3 A Fuzzy-Based Heuristic Measure Evaluating Quality of a Concept Partition: Application to SaintEtiQ, a Database Summarization System – RASCHIA, MOUADDIB
2 Using fuzzy labels as background knowledge for linguistic summarization of databases – Raschia, Mouaddib - 2001
1 The three semantics of fuzzy sets,” Int – Dubois, Prade - 1997