Applies the diffset technique to speed-up vertical mining which uses fast set intersection operations.
Abstract: A number of vertical mining algorithms have been proposed recently for association mining, which have shown to be very effective and usually outperform horizontal approaches. The main advantage of the vertical format is support for fast frequency counting via intersection operations on transaction ids (tids) and automatic pruning of irrelevant data. The main problem with these approaches is when intermediate results of vertical tid lists become too large for memory, thus affecting the algorithm ... (Update)
Cited by: More
Frequent Subtree Mining - An Overview - Chi, Nijssen, al. (2001)
(Correct)
Data Mining and Knowledge Discovery, 11, 1--20, 2005 c - Springer Science Business
(Correct)
On the Efficiency of Association-rule Mining Algorithms - Pudi, Haritsa (2002)
(Correct)
Similar documents (at the sentence level): More
56.4%: Fast Vertical Mining Using Diffsets - Zaki, Gouda (2001)
(Correct)
9.6%: CHARM: An Efficient Algorithm for Closed Itemset Mining - Zaki, Hsiao (2002)
(Correct)
8.5%: Efficient Algorithms for Mining Closed - Itemsets And Their
(Correct)
Active bibliography (related documents): More All
0.3: Advances in Frequent Itemset Mining Implementations: Report.. - Goethals, Zaki (2003)
(Correct)
0.2: A Scalable Multi-Strategy Algorithm for Counting.. - Orlando, Palmerini.. (2002)
(Correct)
0.2: Adaptive and Resource-Aware Mining of Frequent Sets - Orlando Palmerini Perego (2002)
(Correct)
Similar documents based on text: More All
0.6: Efficiently Mining Maximal Frequent Itemsets - Gouda, Zaki (2001)
(Correct)
0.6: Turbo-charging Vertical Mining of Large Databases - Shenoy, Haritsa, Sudarshan, .. (2000)
(Correct)
0.5: Generating Non-Redundant Association Rules - Zaki (2000)
(Correct)
Related documents from co-citation: More All
20: Mining frequent patterns without candidate generation
- Han, Pei et al. - 1999
17: Fast Algorithms for Mining Association Rules
- Agrawal, Srikant - 1994
8: Turbo-charging vertical mining of large databases
- Shenoy, Haritsa et al. - 2000
BibTeX entry: (Update)
M. J. Zaki and K. Gouda. Fast Vertical Mining Using Diffsets. RPI Technical Report 01-1, 2001 11 http://citeseer.ist.psu.edu/zaki01fast.html More
@inproceedings{ zaki03fast,
author = {Mohammed J. Zaki and Karam Gouda},
title = {Fast vertical mining using diffsets},
booktitle = {KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining},
year = {2003},
isbn = {1-58113-737-0},
pages = {326--335},
location = {Washington, D.C.},
doi = {http://doi.acm.org/10.1145/956750.956788},
publisher = {ACM Press},
address = {New York, NY, USA},
url = {citeseer.ist.psu.edu/zaki01fast.html} }
Citations (may not include all citations):
400
Fast discovery of association rules (context) - Agrawal - 1996
249
Mining frequent patterns without candidate generation
- Han, Pei et al. - 2000
242
Dynamic itemset counting and implication rules for market ba..
- Brin, Motwani et al. - 1997
145
Sprint: A scalable parallel classifier for data mining
- Shafer, Agrawal et al. - 1996
109
New algorithms for fast discovery of association rules
- Zaki, Parthasarathy et al. - 1997
85
Discovering frequent closed itemsets for association rules
- Pasquier, Bastide et al. - 1999
60
ective hash based algorithm for mining association rules (context) - Park, Chen et al. - 1995
56
Generating non-redundant association rules
- Zaki - 2000
54
MAFIA: a maximal frequent itemset algorithm for transactiona.. (context) - Burdick, Calimlim et al. - 2001
54
Pincer-search: A new algorithm for discovering the maximum f..
- Lin, Kedem - 1998
53
ciently mining long patterns from databases (context) - Bayardo - 1998
40
Turbo-charging vertical mining of large databases
- Shenoy - 2000
39
IEEE Transactions on Knowledge and Data Engineering (context) - Zaki, for - 2000
37
Discovering all the most specific sentences by randomized al..
- Gunopulos, Mannila et al. - 1997
36
cient algorithm for mining association rules in large databa.. (context) - Savasere, Omiecinski et al. - 1995
33
cient algorithm for closed itemset mining (context) - Zaki, Hsiao et al. - 2002
26
Mining association rules: Anti-skew algorithms
- Lin, Dunham - 1998
21
cient algorithm for mining frequent closed itemsets (context) - Pei, Han et al. - 2000
20
ciently mining maximal frequent itemsets (context) - Gouda, Zaki - 2001
20
Depth First Generation of Long Patterns
- Agrawal, Aggarwal et al. - 2000
16
Integrating association rule mining with databases: alternat.. (context) - Sarawagi, Thomas et al. - 1998
10
Data organization and access for e#cient data mining (context) - Dunkel, Soparkar - 1999
7
Sequential pattern mining using bitmaps (context) - Ayres, Gehrke et al. - 2002
4
Data Mining: Concepts and Techniuqes (context) - Han, Kamber - 2001
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.rpi.edu/~zaki/papers.html): More
Parallel Classification for Data Mining on Shared-Memory.. - Zaki (1998)
(Correct)
Efficient Enumeration of Frequent Sequences - Zaki (1998)
(Correct)
PlanMine: Predicting Plan Failures using Sequence Mining - Zaki, Lesh, Ogihara (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC