MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Performance evaluation and optimization of join queries for association rule mining (1999) [11 citations — 1 self]

Download:
Download as a PDF | Download as a PS
by Shiby Thomas, Sharma Chakravarthy
In Proc. DaWaK
ftp://ftp.cise.ufl.edu/cis/tech-reports/tr98/tr98-017.ps.gz
Add To MetaCart

Abstract:

The rapid growth in data warehousing technology and the tremendous drop in storage prices has enabled collection of large volumes of data in an ever increasing number of business organizations. One of the greatest challenges today is how to turn these rapidly expanding data stores into nuggets of actionable knowledge. The state-of-the-art data mining tools integrate loosely with data stored in DBMSs, typically through a cursor interface. In this paper, we consider several formulations of association rule mining using SQL-92 queries primarily from a performance perspective. We study the performance of dierent join orders and join methods for executing the various queries. Based on the performance study we identify certain optimizations and develop the Set-oriented Apriori approach. We analyze the cost of the dierent execution plans which provides a basis to incorporate the semantics of association rule mining into future query optimizers. The cost expressions we develop are in terms of the relational operators and hence can be used as \addons " to the current optimizers. This work is an initial step towards developing \SQL-aware" mining algorithms and exploring the enhancements to current relational DBMSs to make them \mining-aware " thereby bridging the gap between the two. 1

Citations

1638 Fast algorithms for mining association rules – Agrawal, Srikant - 1994
1472 Mining Association Rules Between Sets of Items in Large Databases – Agrawal, Imielinski, et al. - 1993
182 Scaling Clustering Algorithms to Large Databases – Bradley, Fayyad, et al. - 1998
174 Mannila H.: A Database Perspective on Knowledge Discovery – Imielinski - 1996
123 Psaila G.: A New SQL-like Operator for Mining Association Rules – Ceri, Meo - 1996
97 Integrating association rule mining with relational database systems: Alternatives and implications – Sarawagi, Thomas, et al.
86 DMQL: A data mining query language for relational database – Han
63 Set-oriented mining of association rules – Houtsma, Swami - 1993
60 Scalable Techniques for Mining Causal Structures – Silverstein, Brin, et al.
51 The Quest data mining system – Agrawal, Mehta, et al. - 1996
49 Developing tightly-coupled Data Mining Applications on a Relational Database System – Agrawal, Shim - 1995
31 Data mining and database systems: Where is the intersection – Chaudhuri - 1998
18 An Eective Hash Based Algorithm for Mining Association Rules – Park, Chen, et al. - 1995
14 Query Flocks: A generalization of association rule mining – Tsur, Ullman, et al. - 1998
9 Abdulghani A., Discovery board application programming interface and query lan- guage for database mining – Imielinski, Virmani - 1996
7 Houtsma and Arun Swami. Set-oriented mining of association rules – Maurice - 1995