See this document in CiteSeerX!

Decision Tree Construction for Data Mining on Cluster of Shared-Memory Multiprocessors  (Make Corrections)  (1 citation)
Henrique Andrade, Tahsin Kurc, Alan Sussman, Joel Saltz



  Home/Search   Context   Related

 
View or download:
umd.edu/pub/hpsl/p...1dataminetr.ps.Z
umd.edu/pub/hpsl/p...01dataminetr.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  umd.edu/misc/publist (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Classification of very large datasets is a challenging problem in data mining. It is desirable to have decision-tree classifiers that can handle large datasets, because a large dataset often increases the accuracy of the resulting classification model. Classification tree algorithms can benefit from parallelization because of large memory and computation requirements for handling large datasets. Clusters of shared-memory multiprocessors (SMPs), in which each shared-memory node has a small... (Update)

Context of citations to this paper:   More

.... visualization and processing of digitized microscopy images [3] visualization of largescale data [5,8,29,42,61] and data mining [4,7,34,68]. Although the datasets used for analysis and the data products generated by applications that manipulate those datasets may differ...

Cited by:   More
Processing Large-Scale Multidimensional Data in.. - Beynon, Chang.. (2002)   (Correct)

Active bibliography (related documents):   More   All
0.7:   Communication and Memory Efficient Parallel Decision Tree.. - Jin, Agrawal (2003)   (Correct)
0.5:   Data Mining Architectures - A Comparative Study - Thomas, Jayakumar, Muthukumaran   (Correct)
0.5:   Parallel and Distributed Computing for Data Mining - Zomaya, al. (1999)   (Correct)

Similar documents based on text:   More   All
0.6:   Efficient Execution of Multiple Query Workloads - In Data Analysis   (Correct)
0.6:   On Cache Replacement Policies for Servicing Mixed.. - Andrade, Kurc.. (2002)   (Correct)
0.5:   Scheduling Multiple Data Visualization Query Workloads .. - Andrade, Kurc..   (Correct)

Related documents from co-citation:   More   All
2:   Improving the performance and functionality of the virtual microscope (context) - Catalyurek, Kurc et al. - 2001

BibTeX entry:   (Update)

H. Andrade, T. Kurc, A. Sussman, and J. Saltz. Decision tree construction for data mining on clusters of shared-memory multiprocessors. Technical Report CS-TR-4203 and UMIACS-TR- http://citeseer.ist.psu.edu/388245.html   More

@misc{ andrade-decision,
  author = "H. Andrade and T. Kurc and A. Sussman and J. Saltz",
  title = "Decision tree construction for data mining on clusters of shared-memory
    multiprocessors",
  text = "H. Andrade, T. Kurc, A. Sussman, and J. Saltz. Decision tree construction
    for data mining on clusters of shared-memory multiprocessors. Technical
    Report CS-TR-4203 and UMIACS-TR-",
  url = "citeseer.ist.psu.edu/388245.html" }
Citations (may not include all citations):
145   SPRINT: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
117   IEEE Transactions on Knowledge and Data Engineering (context) - Agrawal, Imielinski et al. - 1993
55   High-performance sorting on networks of workstations (context) - Arpaci-Dusseau, Arpaci-Dusseau et al. - 1997
38   Automatic construction of decision trees from data: A multi-.. - Murthy - 1998
36   Designing and mining multi-terabyte astronomy archives: The .. - Szalay, Kunszt et al. - 1999
35   A survey of methods for scaling up inductive algorithms - Provost, Kolluri - 1999
15   Scientific and Engineering Computation Series (context) - Snir, Otto et al. - 1996
13   Parallel formulations of decision-tree classification algori.. - Srivastava, Han et al. - 1999
12   A comprehensive bibliography of distributed shared memory (context) - Eskicioglu - 1996
11   Parallel classification for data mining on shared-memory mul.. - Zaki, Ho et al. - 1999
11   Density biased sampling: An improved method for data mining .. - Palmer, Faloutsos - 2000
9   CLOUDS: A decision tree classifier for large datasets - Alsabti, Ranka et al. - 1998
8   Update protocols and cluster-based shared memory - Keleher - 1999
6   Taming the giants and the monsters: Mining large databasesfo.. - Fayyad - 1998
4   Efficient parallel classification using dimensional aggregat.. - Goil, Choudhary - 1999
3   multithreaded decision tree builder (context) - Narlikar, parallel - 1998
3   Sorting on clusters of SMPs - Helman, Jaja - 1998
2   Darwin: A scalable integrated system for data mining (context) - Tamayo, Berlin et al. - 1997
2   Parallel classification on SMP systems - Zaki, Ho et al. - 1998
1   Space-Effcient Multithreading (context) - Narlikar - 1999
1   Dynamic load balancing of unstructured computations in decis.. - Srivastava, Han et al. - 1998
1   ScalParC: Anew scalable and efficient parallel classificatio.. (context) - Joshi, Karypis et al. - 1998

Documents on the same site (http://larva.cs.umd.edu/misc/publist.asp):   More
Compiler and Runtime Support for Programming in.. - Edjlali, Agrawal, .. (1995)   (Correct)
Runtime Compilation Techniques for Data Partitioning.. - Ponnusamy, Saltz.. (1993)   (Correct)
Interleaved Parallel Hybrid Arnoldi Method for a Parallel.. - Edjlali, Petiton (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC