(Enter summary)
Abstract: Classification is an important data mining problem. Although classification is a well-studied problem, most of the current classification algorithms require that all or a portion of the the entire dataset remain permanently in memory. This limits their suitability for mining over large databases. We present a new decision-tree-based classification algorithm, called SPRINT that removes all of the memory restrictions, and is fast and scalable. The algorithm has also been designed to be easily... (Update)
Cited by: More
Pipelining of Fuzzy ARTMAP without Matchtracking.. - Castro, Secretan, ..
(Correct)
A Data Partitioning Approach to speed up the Fuzzy ARTMAP.. - Castro, al.
(Correct)
The Use of Emerging Patterns in the Analysis of Gene.. - Dong, Li, Wong (2003)
(Correct)
Similar documents (at the sentence level):
65.6%: SPRINT: A Scalable Parallel Classifier for Data Mining - Shafer, Agrawal, Mehta (1996)
(Correct)
Active bibliography (related documents): More All
0.5: Learning Features that Predict Cue Usage - Di Eugenio, Moore, Paolucci (1997)
(Correct)
0.2: Fast Similarity Search in the Presence of Noise.. - Agrawal, Lin, Sawhney, .. (1995)
(Correct)
0.2: A Linear Method for Deviation Detection in Large Databases - Arning, Agrawal, Raghavan (1996)
(Correct)
Similar documents based on text: More All
0.2: Parallel Classification on SMP Systems - Zaki, Ho, Agrawal (1998)
(Correct)
0.2: The Quest Data Mining System - Agrawal, Mehta, Shafer, Srikant.. (1996)
(Correct)
0.2: Statistical Behavior and Consistency of Classification Methods.. - Zhang (2001)
(Correct)
Related documents from co-citation: More All
46: SLIQ: A fast scalable classifier for data mining
- Mehta, Agrawal et al. - 1996
41: Programs for machine learning (context) - Quinlan - 1993
39: Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
BibTeX entry: (Update)
J. Shafer, R. Agrawal, M. Mehta. SPRINT: A scalable parallel classifier for data mining. In 22nd VLDB Conference, Sept 1996. http://citeseer.ist.psu.edu/shafer96sprint.html More
@inproceedings{ shafer96sprint,
author = "John C. Shafer and Rakesh Agrawal and Manish Mehta",
title = "{SPRINT}: {A} Scalable Parallel Classifier for Data Mining",
booktitle = "Proc. 22nd Int. Conf. Very Large Databases, {VLDB}",
month = "3--6~",
publisher = "Morgan Kaufmann",
editor = "T. M. Vijayaraman and Alejandro P. Buchmann and C. Mohan and Nandlal L. Sarda",
isbn = "1-55860-382-4",
pages = "544--555",
year = "1996",
url = "citeseer.ist.psu.edu/shafer96sprint.html" }
Citations (may not include all citations):
6
cation and Regression Trees (context) - Stone - 1984
5
Technical Report STAN-CS (context) - STAN, University - 1979
4
IEEE Transactions on Knowledge and Data Engineering
- mining, perspective - 1993
2
on and Prediction Methods from Statistics, Neural Nets, Mach.. (context) - that, Classi - 1991
1
er for database mining applications (context) - Iyer, Swami et al. - 1992
1
Neural and Statistical Classi- #cation (context) - Learning - 1994
1
The Gamma database machine project (context) - Bricker, Hsiao et al. - 1990 ACM DBLP
1
GA23-2475-02 edition (context) - Version - 1992
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://barbera.cnuce.cnr.it/~palmeri/datam/articles.html): More
CACTUS - Clustering Categorical Data Using Summaries - Ganti, Gehrke, Ramakrishnan (1999)
(Correct)
Mining Very Large Databases - Ganti, Gehrke, Ramakrishnan (1999)
(Correct)
Automatic Subspace Clustering of High Dimensional.. - Agrawal, Gehrke.. (1998)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC