(Enter summary)
Abstract: Classification of very large datasets is a challenging
problem in data mining. It is desirable to have decision-tree
classifiers that can handle large datasets, because a large
dataset often increases the accuracy of the resulting classification
model. Classification tree algorithms can benefit
from parallelization because of large memory and computation
requirements for handling large datasets. Clusters
of shared-memory multiprocessors (SMPs), in which each
shared-memory node has a small... (Update)
Context of citations to this paper: More
.... visualization and processing of digitized microscopy images [3] visualization of largescale data [5,8,29,42,61] and data mining [4,7,34,68]. Although the datasets used for analysis and the data products generated by applications that manipulate those datasets may differ...
Cited by: More
Processing Large-Scale Multidimensional Data in.. - Beynon, Chang.. (2002)
(Correct)
Active bibliography (related documents): More All
0.7: Communication and Memory Efficient Parallel Decision Tree.. - Jin, Agrawal (2003)
(Correct)
0.5: Data Mining Architectures - A Comparative Study - Thomas, Jayakumar, Muthukumaran
(Correct)
0.5: Parallel and Distributed Computing for Data Mining - Zomaya, al. (1999)
(Correct)
Similar documents based on text: More All
0.6: Efficient Execution of Multiple Query Workloads - In Data Analysis
(Correct)
0.6: On Cache Replacement Policies for Servicing Mixed.. - Andrade, Kurc.. (2002)
(Correct)
0.5: Scheduling Multiple Data Visualization Query Workloads .. - Andrade, Kurc..
(Correct)
Related documents from co-citation: More All
2: Improving the performance and functionality of the virtual microscope (context) - Catalyurek, Kurc et al. - 2001
BibTeX entry: (Update)
H. Andrade, T. Kurc, A. Sussman, and J. Saltz. Decision tree construction for data mining on clusters of shared-memory multiprocessors. Technical Report CS-TR-4203 and UMIACS-TR- http://citeseer.ist.psu.edu/388245.html More
@misc{ andrade-decision,
author = "H. Andrade and T. Kurc and A. Sussman and J. Saltz",
title = "Decision tree construction for data mining on clusters of shared-memory
multiprocessors",
text = "H. Andrade, T. Kurc, A. Sussman, and J. Saltz. Decision tree construction
for data mining on clusters of shared-memory multiprocessors. Technical
Report CS-TR-4203 and UMIACS-TR-",
url = "citeseer.ist.psu.edu/388245.html" }
Citations (may not include all citations):
145
SPRINT: A scalable parallel classifier for data mining
- Shafer, Agrawal et al. - 1996
117
IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Imielinski et al. - 1993
55
High-performance sorting on networks of workstations (context) - Arpaci-Dusseau, Arpaci-Dusseau et al. - 1997
38
Automatic construction of decision trees from data: A multi-..
- Murthy - 1998
36
Designing and mining multi-terabyte astronomy archives: The ..
- Szalay, Kunszt et al. - 1999
35
A survey of methods for scaling up inductive algorithms
- Provost, Kolluri - 1999
15
Scientific and Engineering Computation Series (context) - Snir, Otto et al. - 1996
13
Parallel formulations of decision-tree classification algori..
- Srivastava, Han et al. - 1999
12
A comprehensive bibliography of distributed shared memory (context) - Eskicioglu - 1996
11
Parallel classification for data mining on shared-memory mul..
- Zaki, Ho et al. - 1999
11
Density biased sampling: An improved method for data mining ..
- Palmer, Faloutsos - 2000
9
CLOUDS: A decision tree classifier for large datasets
- Alsabti, Ranka et al. - 1998
8
Update protocols and cluster-based shared memory
- Keleher - 1999
6
Taming the giants and the monsters: Mining large databasesfo..
- Fayyad - 1998
4
Efficient parallel classification using dimensional aggregat..
- Goil, Choudhary - 1999
3
multithreaded decision tree builder (context) - Narlikar, parallel - 1998
3
Sorting on clusters of SMPs
- Helman, Jaja - 1998
2
Darwin: A scalable integrated system for data mining (context) - Tamayo, Berlin et al. - 1997
2
Parallel classification on SMP systems
- Zaki, Ho et al. - 1998
1
Space-Effcient Multithreading (context) - Narlikar - 1999
1
Dynamic load balancing of unstructured computations in decis..
- Srivastava, Han et al. - 1998
1
ScalParC: Anew scalable and efficient parallel classificatio.. (context) - Joshi, Karypis et al. - 1998
Documents on the same site (http://larva.cs.umd.edu/misc/publist.asp): More
Compiler and Runtime Support for Programming in.. - Edjlali, Agrawal, .. (1995)
(Correct)
Runtime Compilation Techniques for Data Partitioning.. - Ponnusamy, Saltz.. (1993)
(Correct)
Interleaved Parallel Hybrid Arnoldi Method for a Parallel.. - Edjlali, Petiton (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC