(Enter summary)
Abstract: As the amount of information available for analysis is increasing, scalability of data mining applications is
becoming a critical factor. To this end, parallel versions of most of the commonly used data mining algorithms
have been developed in recent years. However, developing, maintaining, and optimizing a parallel data mining
application on today's parallel systems is an extremely time-consuming task. (Update)
Cited by: More
Distributed Data Mining Bibliography - Hillol
(Correct)
Similar documents (at the sentence level): More
42.4%: An Efficient Association Mining Implementation on Cluster of SMPs - Jin, Agrawal (2001)
(Correct)
17.1%: Compiler and Middleware Support for Scalable Data Mining - Agrawal, Jin, Li
(Correct)
8.1%: Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)
(Correct)
Active bibliography (related documents): More All
1.0: Advanced Compiler and Runtime Support for Data Intensive.. - Ferreira, Agrawal, Saltz
(Correct)
0.8: Compiling Data Intensive Applications with Spatial.. - Ferreira, Agrawal, Jin.. (2000)
(Correct)
0.6: Efficient Parallel Frequency Mining Based On A Novel Top-Down.. - Özkural (2002)
(Correct)
Similar documents based on text: More All
0.3: Using Distributed Query Result Caching to Evaluate.. - Taylor, Stoffel.. (1998)
(Correct)
0.3: Strategies for Parallel Data Mining - Skillicorn (1999)
(Correct)
0.3: Large Scale Data Mining: The Challenges and The Solutions - Chattratichat.. (1997)
(Correct)
Related documents from co-citation: More All
2: Supporting the Optimisation of Distributed Data Mining by Predicting Application..
- Krishnaswamy, Loke et al. - 2002
2: Robust order statistics based ensembles for distributed data mining (context) - Tumer, Ghosh - 1999
2: Parallel data mining for association rules on shared-memory multi-processors
- Zaki, Ogihara et al. - 1996
BibTeX entry: (Update)
R. Jin and G. Agrawal. A Middleware for Developing Parallel Data Mining Applications. In 2001, editor, Proceedings of the First SIAM International Conference on Data Mining, April Chicago, IL. http://citeseer.ist.psu.edu/jin01middleware.html More
@misc{ jin01middleware,
author = "R. Jin and G. Agrawal",
title = "A Middleware for Developing Parallel Data Mining Applications",
text = "R. Jin and G. Agrawal. A Middleware for Developing Parallel Data Mining
Applications. In 2001, editor, Proceedings of the First SIAM International
Conference on Data Mining, April Chicago, IL.",
year = "2001",
url = "citeseer.ist.psu.edu/jin01middleware.html" }
Citations (may not include all citations):
805
Algorithms for Clustering Data (context) - Jain, Dubes - 1988 ACM
225
Data Mining: Concepts and Techniques (context) - Han, Kamber - 2000
159
The LRPD test: Speculative run-time parallelization of loops..
- Rauchwerger, Padua - 1999 DBLP
117
IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Shafer et al. - 1996
115
Scalable parallel datamining for association rules
- Han, Karypis et al. - 2000
115
Scalable parallel datamining for association rules
- Han, Karypis et al.
67
ACM Transactions on Computer Systems (context) - Corbett, Feitelson et al. - 1996
47
Titan: A high performance remote-sensing database
- Chang, Moon et al.
45
Digital dynamic telepathology - the Virtual Microscope
- Afework, Beynon et al.
42
Mining very large databases with parallel processing (context) - Freitas, Lavington - 1998 ACM
39
An extended two-phase method for accessing sections of out-o..
- Thakur, Choudhary - 1996 ACM
39
Parallel and distributed association mining: A survey
- Zaki - 1999
39
A data-clustering algorithm on distributed memory multiproce..
- Dhillon, Modha - 1999
38
Infrastructure for building parallel database systems for mu..
- Chang, Ferreira et al. - 1999
32
Server-directed collective I/O in Panda
- Seamons, Chen et al. - 1995 ACM DBLP
29
Coupling multiple simulations via a high performance customi.. (context) - Kurc, Sussman et al. - 1999 DBLP
26
The Virtual Microscope
- Ferreira, Moon et al.
19
A customizable parallel database for multi-dimensional data (context) - Chang, Acharya et al. - 1998
19
Passion: Optimized I/O for parallel applications (context) - Thakur, Choudhary et al. - 1996 DBLP
14
Implementation issues in the design of i/o intensive data mi..
- Baraglia, Laforenza et al. ACM DBLP
14
The design and evaluation of a high-performance earth scienc..
- Shock, Chang et al. - 1998 ACM DBLP
13
Strategies for parallel data mining
- Skillicorn - 1999 ACM
12
Run-time parallelization and scheduling of loops (context) - Saltz, Mirchandaney et al. - 1991 ACM DBLP
11
Query planning for range queries with user-defined aggregati..
- Chang, Kurc et al. - 1999
11
Compiling object-oriented data intensive computations (context) - Ferriera, Agrawal et al.
8
Mining of association rules in very large databases: A struc..
- Becuzzi, Coppola et al. - 1999
7
Scheduling in a high performance remote-sensing data server (context) - Chang, Sussman et al. - 1997 DBLP
7
Designing a kernel for data mining (context) - Anand - 1997
7
A requirements analysis for parallel kdd systems
- Maniatty, Zaki ACM DBLP
6
Memory placement techniques for parallel association mining
- Parthasarathy, Zaki et al. - 1998 DBLP
6
Parallel data mining for association rules on shared-memory ..
- Parthasarathy, Zaki et al. - 2000 DBLP
5
High-level programming methodologies for data intensive comp..
- Agrawal, Ferreira et al. - 2000
5
Theory and practice (context) - Cheeseman, Stutz et al. - 1996 ACM DBLP
4
Performance models for co-ordinating parallel data classific..
- Darlington, Ghanem et al. - 1997
4
Scalable parallel clustering for data mining on multicompute..
- Foti, Lipari et al. ACM DBLP
4
Language extensions and compilation techniques for data inte..
- Agrawal, Ferriera et al. - 2000
4
Towards data mining benchmarking: A test bed for performance.. (context) - Pei, Mao et al.
3
Parallel kh mean clustering large dataset (context) - Stoffel, Parallel et al. - 1999
3
Strategies for parallelizing data mining
- Skillicorn
2
Large scale data mining: The challenges and the solutions
- Chattratichat, Darlington et al. - 1997
2
Parallel algorithms for data mining
- Joshi, Sam et al. - 2000
Documents on the same site (http://www.cis.udel.edu/~agrawal/DataMining/Intro/Intro.html): More
An Efficient Association Mining Implementation on Cluster of SMPs - Jin, Agrawal (2001)
(Correct)
Shared Memory Parallelization of Data Mining Algorithms.. - Jin, Agrawal (2002)
(Correct)
Mining Residue Contacts in Proteins Using Local Structure.. - Zaki, Jin, Bystroff (2000)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC