See this document in CiteSeerX!

A Middleware for Developing Parallel Data Mining Applications (2001)  (Make Corrections)  (1 citation)
Ruoming Jin, Gagan Agrawal



  Home/Search   Context   Related

 
View or download:
udel.edu/~agrawal/Data...middleware1.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  udel.edu/~agrawal/DataMin...Intro (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: As the amount of information available for analysis is increasing, scalability of data mining applications is becoming a critical factor. To this end, parallel versions of most of the commonly used data mining algorithms have been developed in recent years. However, developing, maintaining, and optimizing a parallel data mining application on today's parallel systems is an extremely time-consuming task. (Update)

Cited by:   More
Distributed Data Mining Bibliography - Hillol   (Correct)

Similar documents (at the sentence level):   More
42.4%:   An Efficient Association Mining Implementation on Cluster of SMPs - Jin, Agrawal (2001)   (Correct)
17.1%:   Compiler and Middleware Support for Scalable Data Mining - Agrawal, Jin, Li   (Correct)
8.1%:   Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)   (Correct)

Active bibliography (related documents):   More   All
1.0:   Advanced Compiler and Runtime Support for Data Intensive.. - Ferreira, Agrawal, Saltz   (Correct)
0.8:   Compiling Data Intensive Applications with Spatial.. - Ferreira, Agrawal, Jin.. (2000)   (Correct)
0.6:   Efficient Parallel Frequency Mining Based On A Novel Top-Down.. - Özkural (2002)   (Correct)

Similar documents based on text:   More   All
0.3:   Using Distributed Query Result Caching to Evaluate.. - Taylor, Stoffel.. (1998)   (Correct)
0.3:   Strategies for Parallel Data Mining - Skillicorn (1999)   (Correct)
0.3:   Large Scale Data Mining: The Challenges and The Solutions - Chattratichat.. (1997)   (Correct)

Related documents from co-citation:   More   All
2:   Supporting the Optimisation of Distributed Data Mining by Predicting Application.. - Krishnaswamy, Loke et al. - 2002
2:   Robust order statistics based ensembles for distributed data mining (context) - Tumer, Ghosh - 1999
2:   Parallel data mining for association rules on shared-memory multi-processors - Zaki, Ogihara et al. - 1996

BibTeX entry:   (Update)

R. Jin and G. Agrawal. A Middleware for Developing Parallel Data Mining Applications. In 2001, editor, Proceedings of the First SIAM International Conference on Data Mining, April Chicago, IL. http://citeseer.ist.psu.edu/jin01middleware.html   More

@misc{ jin01middleware,
  author = "R. Jin and G. Agrawal",
  title = "A Middleware for Developing Parallel Data Mining Applications",
  text = "R. Jin and G. Agrawal. A Middleware for Developing Parallel Data Mining
    Applications. In 2001, editor, Proceedings of the First SIAM International
    Conference on Data Mining, April Chicago, IL.",
  year = "2001",
  url = "citeseer.ist.psu.edu/jin01middleware.html" }
Citations (may not include all citations):
805   Algorithms for Clustering Data (context) - Jain, Dubes - 1988  ACM
225   Data Mining: Concepts and Techniques (context) - Han, Kamber - 2000
159   The LRPD test: Speculative run-time parallelization of loops.. - Rauchwerger, Padua - 1999  DBLP
117   IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Shafer et al. - 1996
115   Scalable parallel datamining for association rules - Han, Karypis et al. - 2000
115   Scalable parallel datamining for association rules - Han, Karypis et al.
67   ACM Transactions on Computer Systems (context) - Corbett, Feitelson et al. - 1996
47   Titan: A high performance remote-sensing database - Chang, Moon et al.
45   Digital dynamic telepathology - the Virtual Microscope - Afework, Beynon et al.
42   Mining very large databases with parallel processing (context) - Freitas, Lavington - 1998  ACM
39   An extended two-phase method for accessing sections of out-o.. - Thakur, Choudhary - 1996  ACM
39   Parallel and distributed association mining: A survey - Zaki - 1999
39   A data-clustering algorithm on distributed memory multiproce.. - Dhillon, Modha - 1999
38   Infrastructure for building parallel database systems for mu.. - Chang, Ferreira et al. - 1999
32   Server-directed collective I/O in Panda - Seamons, Chen et al. - 1995  ACM   DBLP
29   Coupling multiple simulations via a high performance customi.. (context) - Kurc, Sussman et al. - 1999  DBLP
26   The Virtual Microscope - Ferreira, Moon et al.
19   A customizable parallel database for multi-dimensional data (context) - Chang, Acharya et al. - 1998
19   Passion: Optimized I/O for parallel applications (context) - Thakur, Choudhary et al. - 1996  DBLP
14   Implementation issues in the design of i/o intensive data mi.. - Baraglia, Laforenza et al.  ACM   DBLP
14   The design and evaluation of a high-performance earth scienc.. - Shock, Chang et al. - 1998  ACM   DBLP
13   Strategies for parallel data mining - Skillicorn - 1999  ACM
12   Run-time parallelization and scheduling of loops (context) - Saltz, Mirchandaney et al. - 1991  ACM   DBLP
11   Query planning for range queries with user-defined aggregati.. - Chang, Kurc et al. - 1999
11   Compiling object-oriented data intensive computations (context) - Ferriera, Agrawal et al.
8   Mining of association rules in very large databases: A struc.. - Becuzzi, Coppola et al. - 1999
7   Scheduling in a high performance remote-sensing data server (context) - Chang, Sussman et al. - 1997  DBLP
7   Designing a kernel for data mining (context) - Anand - 1997
7   A requirements analysis for parallel kdd systems - Maniatty, Zaki  ACM   DBLP
6   Memory placement techniques for parallel association mining - Parthasarathy, Zaki et al. - 1998  DBLP
6   Parallel data mining for association rules on shared-memory .. - Parthasarathy, Zaki et al. - 2000  DBLP
5   High-level programming methodologies for data intensive comp.. - Agrawal, Ferreira et al. - 2000
5   Theory and practice (context) - Cheeseman, Stutz et al. - 1996  ACM   DBLP
4   Performance models for co-ordinating parallel data classific.. - Darlington, Ghanem et al. - 1997
4   Scalable parallel clustering for data mining on multicompute.. - Foti, Lipari et al.  ACM   DBLP
4   Language extensions and compilation techniques for data inte.. - Agrawal, Ferriera et al. - 2000
4   Towards data mining benchmarking: A test bed for performance.. (context) - Pei, Mao et al.
3   Parallel kh mean clustering large dataset (context) - Stoffel, Parallel et al. - 1999
3   Strategies for parallelizing data mining - Skillicorn
2   Large scale data mining: The challenges and the solutions - Chattratichat, Darlington et al. - 1997
2   Parallel algorithms for data mining - Joshi, Sam et al. - 2000

Documents on the same site (http://www.cis.udel.edu/~agrawal/DataMining/Intro/Intro.html):   More
An Efficient Association Mining Implementation on Cluster of SMPs - Jin, Agrawal (2001)   (Correct)
Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)   (Correct)
Mining Residue Contacts in Proteins Using Local Structure.. - Zaki, Jin, Bystroff (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC