See this document in CiteSeerX!

Shared Memory Parallelization of Data Mining Algorithms: Techniques, Programming Interface, and Performance (2004)  (Make Corrections)  (4 citations)
Ruoming Jin, Ge Yang, Gagan Agrawal



  Home/Search   Context   Related

 
View or download:
ohiostate.edu/~jinr/Paper...tkde04.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ohiostate.edu/~jinr/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: With recent technological advances, shared memory parallel machines have become more scalable, and offer large main memories and high bus bandwidths. They are emerging as good platforms for data warehousing and data mining. In this paper, we focus on shared memory parallelization of data mining algorithms. We have developed a series of techniques for parallelization of data mining algorithms, including full replication, full locking, fixed locking, optimized full locking, and cache-sensitive ... (Update)

Context of citations to this paper:   More

...is of significant interest. In our previous work, we have developed several techniques for parallelizing random write reductions [10, 11]. One of the tech niques involves creating a copy of the reduction object for each thread and is referred to as full replication. The...

Cited by:   More
Compiler and Runtime Support for Shared Memory.. - Li, Jin, Agrawal (2002)   (Correct)
Communication and Memory Efficient Parallel Decision Tree.. - Jin, Agrawal (2003)   (Correct)
Distributed Data Mining Bibliography - Hillol   (Correct)

Similar documents (at the sentence level):
15.8%:   Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)   (Correct)
5.7%:   Performance Prediction for Random Write Reductions: A Case.. - Jin, Agrawal (2002)   (Correct)
5.7%:   A Middleware for Developing Parallel Data Mining Applications - Jin, Agrawal (2001)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Compiler and Middleware Support for Scalable Data Mining - Agrawal, Jin, Li   (Correct)
0.6:   An Efficient Association Mining Implementation on Cluster of SMPs - Jin, Agrawal (2001)   (Correct)
0.5:   Thesis Proposal - Ruoming Jin Department   (Correct)

Similar documents based on text:   More   All
0.2:   Compiling Data Intensive Applications with Spatial Coordinates - Ferreira, Agrawal, Jin (2000)   (Correct)
0.2:   Communication and Memory Optimal Parallel Data Cube.. - Jin, Yang, Agrawal   (Correct)
0.2:   High Level Programming Methodologies for Data Intensive.. - Renato (2000)   (Correct)

Related documents from co-citation:   More   All
3:   Parallel and distributed association mining: A survey - Zaki - 1999
3:   IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Shafer et al. - 1996
3:   A middleware for developing parallel data mining implementations (context) - Jin, Agrawal - 2001

BibTeX entry:   (Update)

Ruoming Jin and Gagan Agrawal. Shared Memory Parallelization of Data Mining Algorithms: Techniques, Programming Interface, and Performance. In Proceedings of the second SIAM conference on Data Mining, April 2002. http://citeseer.ist.psu.edu/jin04shared.html   More

@misc{ jin02shared,
  author = "R. Jin and G. Agrawal",
  title = "Shared Memory Parallelization of Data Mining Algorithms: Techniques",
  text = "Ruoming Jin and Gagan Agrawal. Shared Memory Parallelization of Data Mining
    Algorithms: Techniques, Programming Interface, and Performance. In Proceedings
    of the second SIAM conference on Data Mining, April 2002.",
  year = "2002",
  url = "citeseer.ist.psu.edu/jin04shared.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1993
1575   Computer Architecture: A Quantitative Approach (context) - Hennessy, Patterson - 1996
910   Fast Algorithms for Mining Association Rules - Agrawal, Srikant - 1994
805   Algorithms for Clustering Data (context) - Jain, Dubes - 1988
249   Mining Frequent Patterns without Candidate Generation - Han, Pei et al. - 2000
242   Dynamic Itemset Counting and Implication Rules for Market Ba.. - Brin, Motwani et al. - 1997
230   Cilk: An Efficient Multithreaded Runtime System - Blumofe, Joerg - 1995
225   Data Mining: Concepts and Techniques (context) - Han, Kamber - 2000
197   Maximizing Multiprocessor Performance with the SUIF Compiler - Hall, Amarsinghe et al. - 1996
164   An Efficient Algorithm for Mining Association Rules in Large.. (context) - Savasere, Omiecinski et al. - 1995
145   SPRINT: A Scalable Parallel Classifier for Data Mining - Shafer, Agrawal et al. - 1996
136   Parallel Programming with Polaris (context) - Blume, Doallo et al. - 1996
115   ScalableParallel Datamining for Association Rules - Han, Karypis et al. - 1997
115   Scalable Parallel Datamining for Association Rules - Han, Karypis et al. - 2000
111   SLIQ: A Fast Scalable Classifier for Data Mining - Mehta, Agrawal et al. - 1996
100   Database Mining: A Performance Perspective - Agrawal, Imielinski et al. - 1993
94   Run-Time Parallelization and Scheduling of Loops (context) - Saltz, Mirchandaney et al. - 1991
56   Parallel Mining of Association Rules - Agrawal, Shafer - 1996
45   Parallel Data Mining for Association Rules on Shared Memory .. - Zaki, Ogihara et al. - 1996
45   Parallel Data Mining for Association Rules on Shared-Memory .. - Parthasarathy, Zaki et al. - 2000
44   Fast Sequential and Parallel Algorithms for Association Rule.. - Mueller - 1995
39   Parallel and Distributed Association Mining: A Survey - Zaki - 1999
39   A Data-Clustering Algorithm on Distributed Memory Multiproce.. - Dhillon, Modha - 1999
38   Automatic Construction of Decision Trees from Data: A Multi-.. - Murthy - 1998
37   Boat---- Optimistic Decision Tree Construction - Gehrke, Ganti et al. - 1999
35   A Survey of Methods for Scaling up Inductive Algorithms - Provost, Kolluri - 1999
31   Rainforest---A Framework for Fast Decision Tree Construction.. - Gehrke, Ramakrishnan et al. - 1998
29   Compiler and Software Distributed Shared Memory Support for .. - Lu, Cox et al. - 1997
27   Scalparc: A New Scalable and Efficient Parallel Classificati.. - Joshi, Karypis et al. - 1998
22   Efficient Synchronization: Let Them Eat QOLB (context) - Kagi, Burger et al. - 1997
21   the Automatic Parallelization of Sparse and Irregular Fortra.. - Lin, Padua - 1998
17   Adaptive Reduction Parallelization Techniques - Yu, Rauchwerger - 2000
14   Memory Placement Techniques for Parallel Association Mining - Parthasarathy, Zaki et al. - 1998
13   Parallel Formulations of Decision-Tree Classification Algori.. - Srivastava, Han et al. - 1998
13   Strategies for Parallel Data Mining - Skillicorn - 1999
12   A Compiler Method for the Parallel Execution of Irregular Re.. - Gutierrez, Plata et al. - 2000
11   Parallel Classification for Data Mining on Shared-Memory Mul.. - Zaki, Ho et al. - 1999
11   Compiling Object-Oriented Data Intensive Computations (context) - Ferreira, Agrawal et al. - 2000
8   Mining of Association Rules in Very Large Databases: A Struc.. - Becuzzi, Coppola et al. - 1999
7   PARSIMONY: An Infrastructure for Parallel Multidimensional A.. (context) - Goil, Choudhary - 2001
7   Clouds: Classification for Large or Out-of-Core Datasets (context) - Alsabti, Ranka et al. - 1998
5   A Middleware for Developing Parallel Data Mining Implementat.. (context) - Jin, Agrawal - 2001
4   Performance Prediction for Random Write Reductions: A Case S.. - Jin, Agrawal - 2002
4   An Effecitive Hash Based Algorithm for Mining Association Ru.. (context) - Park, Chen et al. - 1995
4   Efficient Parallel Classification Using Dimensional Aggregat.. - Goil, Choudhary - 1999
4   Architectural Considerations for Parallel Query Evaluation A.. - Shatdal - 1999
3   Distributed Data Clustering Can be Efficient and Exact (context) - Forman, Zhang - 2000
2   Compiler and Runtime Support for Shared Memory Parallelizati.. - Li, Jin et al. - 2002
2   Mechanisms for Efficient Shared-Memory, Lock-Based Sychroniz.. - Kagi - 1999
2   An Efficient Implementation of Apriori Association Mining on.. (context) - Jin, Agrawal - 2001
2   Oracle Parallel Processing (context) - Mahapatra, Mishra - 2000
2   Density Biases Sampling: An Improved Method for Data Mining .. (context) - Palmer, Faloutsos - 2000
1   Bayesian Classification (Autoclass): Theory and Practice (context) - Cheeseman, Stutz - 1996
1   A Parallel, Multithreaded Decision Tree Builder - Narlikar - 1998
1   A Compilation Framework for Distributed Memory Parallelizatt.. (context) - Li, Jin et al. - 2002
1   Efficient C4.5 (context) - Ruggieri - 1999
1   Universal Database Goes Parallel with Enterprise and Enterpr.. (context) - Db - 1999

Documents on the same site (http://www.cse.ohio-state.edu/~jinr/):   More
Thesis Proposal - Ruoming Jin Department   (Correct)
Efficient Decision Tree Construction on Streaming Data - Jin, Agrawal (2003)   (Correct)
Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC