Download:
|
by Shiby Thomas, Sharma Chakravarthy
In Proc. DaWaK
ftp://ftp.cise.ufl.edu/cis/tech-reports/tr98/tr98-017.ps.gz
Add To MetaCart
Abstract:
The rapid growth in data warehousing technology and the tremendous drop in storage prices has enabled collection of large volumes of data in an ever increasing number of business organizations. One of the greatest challenges today is how to turn these rapidly expanding data stores into nuggets of actionable knowledge. The state-of-the-art data mining tools integrate loosely with data stored in DBMSs, typically through a cursor interface. In this paper, we consider several formulations of association rule mining using SQL-92 queries primarily from a performance perspective. We study the performance of dierent join orders and join methods for executing the various queries. Based on the performance study we identify certain optimizations and develop the Set-oriented Apriori approach. We analyze the cost of the dierent execution plans which provides a basis to incorporate the semantics of association rule mining into future query optimizers. The cost expressions we develop are in terms of the relational operators and hence can be used as \addons " to the current optimizers. This work is an initial step towards developing \SQL-aware" mining algorithms and exploring the enhancements to current relational DBMSs to make them \mining-aware " thereby bridging the gap between the two. 1
Citations
|
1638
|
Fast algorithms for mining association rules
– Agrawal, Srikant
- 1994
|
|
1472
|
Mining Association Rules Between Sets of Items in Large Databases
– Agrawal, Imielinski, et al.
- 1993
|
|
182
|
Scaling Clustering Algorithms to Large Databases
– Bradley, Fayyad, et al.
- 1998
|
|
174
|
Mannila H.: A Database Perspective on Knowledge Discovery
– Imielinski
- 1996
|
|
123
|
Psaila G.: A New SQL-like Operator for Mining Association Rules
– Ceri, Meo
- 1996
|
|
97
|
Integrating association rule mining with relational database systems: Alternatives and implications
– Sarawagi, Thomas, et al.
|
|
86
|
DMQL: A data mining query language for relational database
– Han
|
|
63
|
Set-oriented mining of association rules
– Houtsma, Swami
- 1993
|
|
60
|
Scalable Techniques for Mining Causal Structures
– Silverstein, Brin, et al.
|
|
51
|
The Quest data mining system
– Agrawal, Mehta, et al.
- 1996
|
|
49
|
Developing tightly-coupled Data Mining Applications on a Relational Database System
– Agrawal, Shim
- 1995
|
|
31
|
Data mining and database systems: Where is the intersection
– Chaudhuri
- 1998
|
|
18
|
An Eective Hash Based Algorithm for Mining Association Rules
– Park, Chen, et al.
- 1995
|
|
14
|
Query Flocks: A generalization of association rule mining
– Tsur, Ullman, et al.
- 1998
|
|
9
|
Abdulghani A., Discovery board application programming interface and query lan- guage for database mining
– Imielinski, Virmani
- 1996
|
|
7
|
Houtsma and Arun Swami. Set-oriented mining of association rules
– Maurice
- 1995
|