See this document in CiteSeerX!

Fast Vertical Mining Using Diffsets (2001)  (Make Corrections)  (24 citations)
Mohammed J. Zaki, Karam Gouda
KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining



  Home/Search   Context   Related

 
View or download:
rpi.edu/~zaki/./PS...KDD03diffsets.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rpi.edu/~zaki/papers (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
Applies the diffset technique to speed-up vertical mining which uses fast set intersection operations.

Abstract: A number of vertical mining algorithms have been proposed recently for association mining, which have shown to be very effective and usually outperform horizontal approaches. The main advantage of the vertical format is support for fast frequency counting via intersection operations on transaction ids (tids) and automatic pruning of irrelevant data. The main problem with these approaches is when intermediate results of vertical tid lists become too large for memory, thus affecting the algorithm ... (Update)

Cited by:   More
Frequent Subtree Mining - An Overview - Chi, Nijssen, al. (2001)   (Correct)
Data Mining and Knowledge Discovery, 11, 1--20, 2005 c - Springer Science Business   (Correct)
On the Efficiency of Association-rule Mining Algorithms - Pudi, Haritsa (2002)   (Correct)

Similar documents (at the sentence level):   More
56.4%:   Fast Vertical Mining Using Diffsets - Zaki, Gouda (2001)   (Correct)
9.6%:   CHARM: An Efficient Algorithm for Closed Itemset Mining - Zaki, Hsiao (2002)   (Correct)
8.5%:   Efficient Algorithms for Mining Closed - Itemsets And Their   (Correct)

Active bibliography (related documents):   More   All
0.3:   Advances in Frequent Itemset Mining Implementations: Report.. - Goethals, Zaki (2003)   (Correct)
0.2:   A Scalable Multi-Strategy Algorithm for Counting.. - Orlando, Palmerini.. (2002)   (Correct)
0.2:   Adaptive and Resource-Aware Mining of Frequent Sets - Orlando Palmerini Perego (2002)   (Correct)

Similar documents based on text:   More   All
0.6:   Efficiently Mining Maximal Frequent Itemsets - Gouda, Zaki (2001)   (Correct)
0.6:   Turbo-charging Vertical Mining of Large Databases - Shenoy, Haritsa, Sudarshan, .. (2000)   (Correct)
0.5:   Generating Non-Redundant Association Rules - Zaki (2000)   (Correct)

Related documents from co-citation:   More   All
20:   Mining frequent patterns without candidate generation - Han, Pei et al. - 1999
17:   Fast Algorithms for Mining Association Rules - Agrawal, Srikant - 1994
8:   Turbo-charging vertical mining of large databases - Shenoy, Haritsa et al. - 2000

BibTeX entry:   (Update)

M. J. Zaki and K. Gouda. Fast Vertical Mining Using Diffsets. RPI Technical Report 01-1, 2001 11 http://citeseer.ist.psu.edu/zaki01fast.html   More

@inproceedings{ zaki03fast,
 author = {Mohammed J. Zaki and Karam Gouda},
 title = {Fast vertical mining using diffsets},
 booktitle = {KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining},
 year = {2003},
 isbn = {1-58113-737-0},
 pages = {326--335},
 location = {Washington, D.C.},
 doi = {http://doi.acm.org/10.1145/956750.956788},
 publisher = {ACM Press},
 address = {New York, NY, USA},
 url = {citeseer.ist.psu.edu/zaki01fast.html} }
Citations (may not include all citations):
400   Fast discovery of association rules (context) - Agrawal - 1996
249   Mining frequent patterns without candidate generation - Han, Pei et al. - 2000
242   Dynamic itemset counting and implication rules for market ba.. - Brin, Motwani et al. - 1997
145   Sprint: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
109   New algorithms for fast discovery of association rules - Zaki, Parthasarathy et al. - 1997
85   Discovering frequent closed itemsets for association rules - Pasquier, Bastide et al. - 1999
60   ective hash based algorithm for mining association rules (context) - Park, Chen et al. - 1995
56   Generating non-redundant association rules - Zaki - 2000
54   MAFIA: a maximal frequent itemset algorithm for transactiona.. (context) - Burdick, Calimlim et al. - 2001
54   Pincer-search: A new algorithm for discovering the maximum f.. - Lin, Kedem - 1998
53   ciently mining long patterns from databases (context) - Bayardo - 1998
40   Turbo-charging vertical mining of large databases - Shenoy - 2000
39   IEEE Transactions on Knowledge and Data Engineering (context) - Zaki, for - 2000
37   Discovering all the most specific sentences by randomized al.. - Gunopulos, Mannila et al. - 1997
36   cient algorithm for mining association rules in large databa.. (context) - Savasere, Omiecinski et al. - 1995
33   cient algorithm for closed itemset mining (context) - Zaki, Hsiao et al. - 2002
26   Mining association rules: Anti-skew algorithms - Lin, Dunham - 1998
21   cient algorithm for mining frequent closed itemsets (context) - Pei, Han et al. - 2000
20   ciently mining maximal frequent itemsets (context) - Gouda, Zaki - 2001
20   Depth First Generation of Long Patterns - Agrawal, Aggarwal et al. - 2000
16   Integrating association rule mining with databases: alternat.. (context) - Sarawagi, Thomas et al. - 1998
10   Data organization and access for e#cient data mining (context) - Dunkel, Soparkar - 1999
7   Sequential pattern mining using bitmaps (context) - Ayres, Gehrke et al. - 2002
4   Data Mining: Concepts and Techniuqes (context) - Han, Kamber - 2001



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.rpi.edu/~zaki/papers.html):   More
Parallel Classification for Data Mining on Shared-Memory.. - Zaki (1998)   (Correct)
Efficient Enumeration of Frequent Sequences - Zaki (1998)   (Correct)
PlanMine: Predicting Plan Failures using Sequence Mining - Zaki, Lesh, Ogihara (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC