See this document in CiteSeerX!

SPRINT: A Scalable Parallel Classifier for Data Mining (1996)  (Make Corrections)  (145 citations)
John Shafer, Rakesh Agrawal, Manish Mehta
Proc. 22nd Int. Conf. Very Large Databases, VLDB



  Home/Search   Context   Related

Links:   DBLP

 
View or download:
barbera.cnuce.cnr.i...vldb96_sprint.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help
Problem Downloading?
From:  barbera.cnuce.cnr.it/~...articles (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Classification is an important data mining problem. Although classification is a well-studied problem, most of the current classification algorithms require that all or a portion of the the entire dataset remain permanently in memory. This limits their suitability for mining over large databases. We present a new decision-tree-based classification algorithm, called SPRINT that removes all of the memory restrictions, and is fast and scalable. The algorithm has also been designed to be easily... (Update)

Cited by:   More
Pipelining of Fuzzy ARTMAP without Matchtracking.. - Castro, Secretan, ..   (Correct)
A Data Partitioning Approach to speed up the Fuzzy ARTMAP.. - Castro, al.   (Correct)
The Use of Emerging Patterns in the Analysis of Gene.. - Dong, Li, Wong (2003)   (Correct)

Similar documents (at the sentence level):
65.6%:   SPRINT: A Scalable Parallel Classifier for Data Mining - Shafer, Agrawal, Mehta (1996)   (Correct)

Active bibliography (related documents):   More   All
0.5:   Learning Features that Predict Cue Usage - Di Eugenio, Moore, Paolucci (1997)   (Correct)
0.2:   Fast Similarity Search in the Presence of Noise.. - Agrawal, Lin, Sawhney, .. (1995)   (Correct)
0.2:   A Linear Method for Deviation Detection in Large Databases - Arning, Agrawal, Raghavan (1996)   (Correct)

Similar documents based on text:   More   All
0.2:   Parallel Classification on SMP Systems - Zaki, Ho, Agrawal (1998)   (Correct)
0.2:   The Quest Data Mining System - Agrawal, Mehta, Shafer, Srikant.. (1996)   (Correct)
0.2:   Statistical Behavior and Consistency of Classification Methods.. - Zhang (2001)   (Correct)

Related documents from co-citation:   More   All
46:   SLIQ: A fast scalable classifier for data mining - Mehta, Agrawal et al. - 1996
41:   Programs for machine learning (context) - Quinlan - 1993
39:   Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984

BibTeX entry:   (Update)

J. Shafer, R. Agrawal, M. Mehta. SPRINT: A scalable parallel classifier for data mining. In 22nd VLDB Conference, Sept 1996. http://citeseer.ist.psu.edu/shafer96sprint.html   More

@inproceedings{ shafer96sprint,
    author = "John C. Shafer and Rakesh Agrawal and Manish Mehta",
    title = "{SPRINT}: {A} Scalable Parallel Classifier for Data Mining",
    booktitle = "Proc. 22nd Int. Conf. Very Large Databases, {VLDB}",
    month = "3--6~",
    publisher = "Morgan Kaufmann",
    editor = "T. M. Vijayaraman and Alejandro P. Buchmann and C. Mohan and Nandlal L. Sarda",
    isbn = "1-55860-382-4",
    pages = "544--555",
    year = "1996",
    url = "citeseer.ist.psu.edu/shafer96sprint.html" }
Citations (may not include all citations):
6   cation and Regression Trees (context) - Stone - 1984
5   Technical Report STAN-CS (context) - STAN, University - 1979
4   IEEE Transactions on Knowledge and Data Engineering - mining, perspective - 1993
2   on and Prediction Methods from Statistics, Neural Nets, Mach.. (context) - that, Classi - 1991
1   er for database mining applications (context) - Iyer, Swami et al. - 1992
1   Neural and Statistical Classi- #cation (context) - Learning - 1994
1   The Gamma database machine project (context) - Bricker, Hsiao et al. - 1990  ACM   DBLP
1   GA23-2475-02 edition (context) - Version - 1992



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://barbera.cnuce.cnr.it/~palmeri/datam/articles.html):   More
CACTUS - Clustering Categorical Data Using Summaries - Ganti, Gehrke, Ramakrishnan (1999)   (Correct)
Mining Very Large Databases - Ganti, Gehrke, Ramakrishnan (1999)   (Correct)
Automatic Subspace Clustering of High Dimensional.. - Agrawal, Gehrke.. (1998)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC