MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Executing Multiple Pipelined Data Analysis Operations in the Grid

Download:
pdf
by Matthew Spencer Z, Renato Ferreira Z, Michael Beynon Y, Tahsin Kurc Z, Umit Catalyurek Z, Alan Sussman Y, Joel Saltz Z
http://www.sc2002.org/paperpdfs/pap.pap258.pdf
Add To MetaCart

Abstract:

Processing of data in many data analysis applications can be represented as an acyclic, coarse grain data flow, from data sources to the client. This paper is concerned with scheduling of multiple data analysis operations, each of which is represented as a pipelined chain of processing on data. We define the scheduling problem for effectively placing components onto Grid resources, and propose two scheduling algorithms. Experimental results are presented using a visualization application. 1

Citations

373 Grid Information Services for Distributed Resource Sharing – Czajkowski, Fitzgerald, et al. - 2001
157 Marching cubes: a high resolution 3d surface reconstruction algorithm – Lorensen, Cline - 1987
133 Supporting Dynamic Data Structures on Distributed Memory Machines – Rogers, Carlisle, et al. - 1995
93 The AppLeS Project: A Status Report – Berman, Wolski - 1997
88 Static scheduling algorithms for allocating directed task graphs to multiprocessors – Kwok, Ahmad - 1999
87 Economy Driven Resource Management Architecture for – Buyya, Abramson, et al. - 2000
83 Determining average program execution times and their variance – Sarkar - 1989
62 Replica Selection in the Globus Data Grid – Vazhkudai, Tuecke, et al. - 2001
53 ACDS: Adapting computational data streams for high performance – Isert, Schwan - 2000
51 efficient data transport and replica management for high-performance data-intensive computing – “Secure
46 A Dynamic matching and scheduling algorithm for heterogeneous computing systems – MAHESWARAN, SIEGEL - 1998
30 dQUOB: Managing large data flows using dynamic embedded queries – Plale, Schwan - 2000
28 Task Scheduling Algorithms for Heterogeneous Processors – Topcuoglu, Hariri, et al. - 1999
27 Distributed processing of very large datasets with DataCutter – Beynon, Kurc, et al.
26 Monte Carlo simulation of neuromuscular transmitter release using MCell, a general simulator of cellular physiological processes – Stiles, Bartol, et al. - 1998
25 Design of a framework for data-intensive wide-area applications – Beynon, Kurc, et al. - 2000
25 A comparison study of static mapping heuristics for a class of meta-tasks on heterogeneous computing systems – Braun, Siegel, et al. - 1999
21 Dynamic, competitive scheduling of multiple DAGs in a distributed heterogeneous environment – Iverson, Ozguner - 1998
21 Armada: A parallel file system for computational – Oldfield, Kotz - 2001
20 Snp: A program for nonparametric time series analysis – Gallant, Tauchen - 1997
17 Harnessing the Capacity of Computational Grids for High Energy Physics – Basney, Livny, et al. - 2000
15 Optimizing retrieval and processing of multi-dimensional scientific datasets – Chang, Kurc, et al. - 2000
15 A study of deadline scheduling for client-server systems on the computational grid – Takefusa, Matsuoka, et al. - 2001
12 A component-based implementation of iso-surface rendering for visualizing large datasets – Beynon, Kurc, et al. - 2001
9 Processing large-scale multidimensional data in parallel and distributed environments – Beynon, Chang, et al. - 2002
8 Run-time support for scheduling parallel applications in heterogeneous nows – Weissman, Zhao - 1997
5 AIRES users guide and reference manual, version 2.0.0 – Sciutto - 1999
5 Bricks: A performance evaluation system for scheduling algorithms on the grids – Takefusa - 2001
4 High Performance Fortran interface to the Parallel C – Yang, Gannon, et al. - 1994
3 Object-space parallel polygonrendering on hypercubes – Kurc, Aykanat, et al. - 1998
3 Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures – Majumdar - 2000
3 The network weather service: A distributedresource performance forecasting service for metacomputing – Wolski, Spring, et al. - 1999
2 Efficient manipulation of large datasets on heterogeneous storage systems – Beynon, Kurc, et al. - 2002
2 Mapping heterogeneous task graphs ontp heterogeneous system graphs – Eshaghian, Wu - 1997
1 Modeling photochemical pollutionusing parallel and distributed computing platforms – Abramson, Cope, et al. - 1994
1 Supporting Data Intensive Applications in a Heterogeneous Environment – Beynon - 2001
1 G-Commerce: The study and building of computational economies for the computational grid – Plank, Wolski, et al. - 2000
1 Expressing and enforcing distributed storage sharing agreements – Zhao, Karamcheti - 2000