(Enter summary)
Abstract: Data mining is a problem for which cluster computing provides a competitive
alternative to specialized high performance computers for mining large data sets.
Distribued clusters provide a natural infrastructure for mining large distributed
data sets. Distributed clusters can be connected by commodity networks to form
what we call meta-clusters and by high performance networks to form what we
call super-clusters. In this paper, we describe the design of a system called
Papyrus which is designed... (Update)
Cited by: More
Multi-Database Mining - Shichao Zhang Xindong (2003)
(Correct)
A DataSpace Infrastructure for Astronomical Data - Grossman, Creel, Mazzucco.. (2001)
(Correct)
Merging Multiple Data Streams on Common Keys over.. - Mazzucco.. (2002)
(Correct)
Similar documents (at the sentence level):
15.3%: Papyrus: A System for Data Mining over Local and.. - Bailey, Grossman, .. (1999)
(Correct)
9.1%: A High Performance Implementation of the Data.. - Bailey, Creel.. (1999)
(Correct)
Active bibliography (related documents): More All
0.6: The Management and Mining of Multiple Predictive Models.. - Robert Grossman National (1999)
(Correct)
0.5: Impact of High Performance Sockets on Data Intensive.. - Pavan Balaji Jiesheng (2003)
(Correct)
0.5: Ubiquitous Data Stream Mining - Gaber, Krishnaswamy, Zaslavsky
(Correct)
Similar documents based on text: More All
0.2: WitanWeb and the Software Engineering of Web-based Applications - Johnson, MacKay
(Correct)
0.2: Evolution of the Supercluster-Void Network - Frisch, Einasto, Einasto.. (1994)
(Correct)
0.1: Papyrus: A History-Based VLSI Design Process Management System - Tzi-Cker Chiueh Randy (1994)
(Correct)
Related documents from co-citation: More All
8: Jam: Java agents for meta-learning over distributed databases
- Stolfo, Prodromidis et al. - 1997
6: The WoRLD: Knowledge Discovery from Multiple Distributed Databases
- Aronis, Kolluri et al. - 1997
6: Distributed cooperative bayesian learning strategies (context) - Yamanishi - 1997
BibTeX entry: (Update)
Grossman, R., Bailey, S., Kasif, S., Mon, D., Ramu, A., & Malhi, B. (1998). The preliminary design of papyrus: A system for high performance, distributed data mining over clusters, meta-clusters and superclusters. http://citeseer.ist.psu.edu/grossman99preliminary.html More
@misc{ grossman98preliminary,
author = "R. Grossman and S. Bailey and S. Kasif and D. Mon and A. Ramu and B. Malhi",
title = "The preliminary design of papyrus: A system for high performance",
text = "Grossman, R., Bailey, S., Kasif, S., Mon, D., Ramu, A., & Malhi, B. (1998).
The preliminary design of papyrus: A system for high performance, distributed
data mining over clusters, meta-clusters and superclusters.",
year = "1998",
url = "citeseer.ist.psu.edu/grossman99preliminary.html" }
Citations (may not include all citations):
2177
Programs for Machine Learning (context) - Quinlan - 1993
663
The Grid: Blueprint for a New Computing Infrastructure (context) - Foster, Kesselman - 1999
659
Globus: A Metacomputing Infrastructure Toolkit
- Foster, Kesselman - 1997
289
The Legion Vision of a Worldwide Virtual Computer (context) - Grimshaw, Wulf - 1997
262
From Data Mining to Knowledge Discovery: An Overview (context) - Fayyad, Piatetsky-Shapiro et al. - 1996
173
Networks of Workstations (context) - Anderson, Culler et al. - 1995
137
Machine Learning Research: Four Current Directions
- Dietterich - 1997
86
JAM: Java Agents for MetaLearning over Distributed Databases
- Stolfo, Prodromidis et al. - 1997
71
A Comparative Evaluation of Voting and MetaLearning on Parti..
- Chan, Stolfo - 1995
37
Transportable Information Agents
- Gray, Rus et al. - 1996
20
Distributed Data Mining Using an Agent Based Architecture (context) - Kargupta, Hamzaoglu et al. - 1997
12
The Management and Mining of Multiple Predictive Models Usin..
- Grossman, Bailey et al. - 1999
11
Object-Based Approaches (context) - Gannon, Grimshaw - 1999
8
An Architecture for Distributed Data Mining (context) - Subramonian, Parthasarathy
7
Data Mining and Tree-based Optimization
- Grossman, Bodek et al. - 1996
7
The Grid: Blueprint for a New Computing Infrastructure (context) - Moore, Baru et al. - 1999
5
MetaLearnig for Parallel Data Mining (context) - Guo, Rueger et al. - 1997
5
Mobile Agents With Aglets (context) - Lange, Oshima et al. - 1998
4
EM Learning on A Generalised Finite Mixture Model for Combin.. (context) - Xu, Jordan - 1993
4
Optimal Strategies for Distributed Data Mining using Data an.. (context) - Turinsky, Grossman
3
Data Mining Using Light Weight Object Management in Clustere.. (context) - Grossman, Bailey et al. - 1997
3
Neural Networks Volume (context) - Wolpert, Generalization - 1992
2
The Predictive Model Mark Up Language Version (context) - Model, Language et al. - 1998
2
Supporting the Data Mining Process with Next Generation Data.. (context) - Grossman - 1998
2
The Grid: Blueprint for a New Computing Infrastructure (context) - Guerin, Schulzrinne et al. - 1999
1
QoS Requirements for Internet (context) - Teitelbaum, Hanss
1
Papyrus: A System for Data Mining on Clusters (context) - Grossman
1
Scalable Data Mining from Vertically Partitioned Feature Spa.. (context) - Karagupta, Johnson et al.
http://www.w3c.org
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.rgrossman.com/pubs.htm): More
A High Performance Implementation of the Data.. - Bailey, Creel.. (1999)
(Correct)
Experimental Studies Using Photonic Data Services at.. - Grossman, Gu.. (2002)
(Correct)
Models for Free Nilpotent Lie Algebras - Grayson, Grossman (1988)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC