See this document in CiteSeerX!

Performance Modelling and Experimental Evaluation of Systems that Perform N Tasks using P fault-prone Processors in Parallel (2002)  (Make Corrections)  
Don Moses Gehan Weerasinghe



  Home/Search   Context   Related

 
View or download:
uconn.edu/~lester/papers/gehan.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  uconn.edu/~lester/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This thesis presents a family of Markov models for analyzing the performance of parallel /distributed systems that execute a job consisting of N independent and idempotent tasks using P fault-prone processors in parallel. A prototype implemented using an extended version of ACMPI is used for actual experiments that are based on simulated tasktimes and processor failures. The model is a Markov Chain with states representing service and failure rates with k (0 ! k P ) active processors. The... (Update)

Similar documents (at the sentence level):
7.5%:   A Distributed Fault-Tolerant Asynchronous Algorithm for.. - Weerasinghe, Lipsky (2001)   (Correct)
7.3%:   An Analytic Performance Model Of Parallel Systems That .. - Weerasinghe.. (2001)   (Correct)

Active bibliography (related documents):   More   All
2.9:   Theoretical and Experimental Results of Processing N Tasks.. - Weerasinghe, Lipsky (2002)   (Correct)
2.0:   Performance Analysis of Distributed Systems that Perform N.. - Weerasinghe, Lipsky   (Correct)
0.5:   Structured Performability Analysis Of Fault Tolerant Parallel.. - Dougherty (1998)   (Correct)

Similar documents based on text:   More   All
0.1:   Performing Tasks on Synchronous Restartable.. - Chlebus, De Prisco.. (2001)   (Correct)
0.1:   Performing Tasks on Synchronous Restartable.. - Chlebus, De Prisco.. (2001)   (Correct)
0.1:   Moses: An Automatic Code Generation Tool for Client-Server.. - Fulbright (1994)   (Correct)

BibTeX entry:   (Update)

@misc{ weerasinghe-performance,
  author = "Don Moses Gehan Weerasinghe",
  title = "Performance Modelling and Experimental Evaluation of Systems that Perform
    N Tasks using P fault-prone Processors in Parallel",
  url = "citeseer.ist.psu.edu/weerasinghe02performance.html" }
Citations (may not include all citations):
1749   An Introduction to Probability Theory and its Applications (context) - Feller - 1971
912   MPI: A Message-Passing Interface Standard - Passing, Forum - 1995
288   Introduction to Parallel Computing: Design and Analysis of A.. (context) - Kumar, Grama et al. - 1994
260   Validity of the Single Processor Approach to Achieving Large.. (context) - Amdahl
176   MATRIX-GEOMETRIC SOLUTIONS IN STOCHASTIC MODELS (context) - Neuts - 1981
141   Using Process Groups to Implement Failure Detection in Async.. (context) - Ricciardi, Birman - 1991
80   Performance and Reliability Analysis of Computer Systems: An.. (context) - Sahner, Trivedi et al. - 1996
60   On Evaluating the Performability of Degradable Computing Sys.. (context) - Meyer - 1980
50   Supporting Fault-Tolerant Parallel Programming in Linda - Bakken, Schlichting - 1995
42   Fault-Tolerant Parallel Computation (context) - Kanellakis, Shvartsman - 1997
39   QUEUEING THEORY: A Linear Algebraic Approach (context) - Lipsky - 1992
37   High Performance Cluster Computing: Architectures and System.. (context) - Buyya - 1999
23   and Computer Science Applications (context) - Trivedi, Statistics et al. - 1982
22   Starfish: Fault Tolerant Dynamic MPI Programs on Clusters of.. (context) - Agbaria, Friedman - 1999
14   Analysis of a Fault-Tolerant Multiprocessor Scheduling Algor.. - Mosse, Melhem et al. - 1994
14   The Importance of Power-tail Distributions for Modeling Queu.. - Greiner, Jobmann et al. - 1999
14   Performing Work Efficiently in the Presence of Faults - Dwork, Halpern et al. - 1992
13   Performability Analysis: A New Algorithm (context) - Nabli, Sericola - 1996
13   Egida: An Extensible Toolkit for Low-overhead Fault-Toleranc.. - Rao, Alvisi et al. - 1998
12   Determining Redundancy Levels for Fault Tolerant Real-Time S.. - Wang, Ramamritham et al. - 1995
12   Micro Time Cost Analysis of Parallel Computations (context) - Qin, Sholl et al. - 1991
12   Executing Multithreaded Programs Efficiently - Blumofe - 1995
11   Analysis of a Composite Performance Reliability Measure for .. (context) - Donatiello, Iyer - 1987
11   Adaptive Fault Tolerance and Graceful Degradation Under Dyna.. (context) - Gonzalez, Shrikumar et al. - 1997
9   Metacomputing with MILAN - Baratloo, Dasgupta et al. - 1999
9   The Completion Time of Programs on Processors Subject to Fai.. (context) - Trivedi, Chimento - 1993
9   Time-Optimal Message-Efficient Work Performance in the Prese.. - Prisco, Mayer et al. - 1994
8   Performing Tasks on Synchronous Restartable Message-Passing .. - Chlebus, Prisco et al. - 2001
8   Fault Detection Using Hints from the Socket Layer - Neves, Fuchs - 1997
8   A New Methodology for Calculating Distributions of Reward Ac.. - Qureshi, Sanders - 1996
8   A System for Fault Tolerant Execution of Data and Compute In.. - Smith, Shrivastava - 1996
7   The Hector Distributed Run-Time Environment - Russ, Robinson et al. - 1998
7   A Practical Model of Parallel Computation (context) - Culler - 1996
6   Fault-Injection-Based Testing of Fault-Tolerant Algorithms i.. (context) - Blough, Torii - 1997
6   The Performance of Parallel Computers: Order Statistics and .. (context) - Lipsky, Zhang et al. - 1996
5   Diskless Checkpointing - Plank, Li et al. - 1998
5   A Distributed Fault-Tolerant Asynchronous Algorithm for Perf.. - Weerasinghe, Lipsky - 2001
5   A Distributed Fault-Tolerant Asynchronous Algorithm for Perf.. - Weerasinghe, Lipsky - 2001
5   Asynchronous Parallel Simulation of Parallel Programs - Prakash, Deelman et al. - 2000
5   Hierarchical Modeling of Availability in Distributed Systems (context) - Hariri, Mutlu - 1995
4   Performance Evaluation of Gracefully Degrading Systems (context) - Gay, Ketelsen - 1979
3   Value-Driven Resource Assignment in Object-Oriented Real-Tim.. (context) - Bondavalli, Giandomenico et al. - 1997
3   An Asynchronous Model of Communication and Computation for M.. (context) - Weerasinghe, Greenshields - 2000
3   An Analytic Performance Model Of Parallel Systems That Perfo.. - Weerasinghe, Antonios et al. - 2002
2   Free Performance and Fault Tolerance: using system idle capa.. (context) - Tridandapani, Dahbura et al. - 1995
2   Performance of Fault Tolerant Networks of Workstations (context) - Morris - 1999
2   Reliability Analysis of Clustered Computing Systems (context) - Mendiratta - 1998
2   A fault-tolerant parallel heuristic for assignment problems (context) - Talbi, Geib et al. - 1998
2   Application-Dependent Performability Evaluation of Fault-Tol.. (context) - Dalibor, Hein et al. - 1996
2   A Generalized Analytic Performance Model Of Distributed Syst.. (context) - Weerasinghe, Antonios et al. - 2002
2   Conjoint Simulation - a Technique for the Combined Performan.. (context) - Hein, Goswami - 1996
1   Incorporating Thread-safe Communication into MPI (context) - Weerasinghe, Greenshields - 1999
1   Fault-Tolerant Task Management and Load Redistribution on Ma.. (context) - Ahmad, Ghafoor - 1992
1   Supporting Fault-Tolerance in Heterogenous Distributed Algor.. (context) - Maheshwari, Ouyang - 1997
1   Optimization of the Processing Rate of an Unreliable Server (context) - Tatashev - 1992
1   Theoretical and Experimental Results of Processing N Tasks i.. - Weerasinghe, Lipsky - 2002
1   On Markov Reward Modeling with FSPNs (context) - Wolter, Zisowsky - 2000
1   Fault Tolerant Communications in Embedded Supercomputing (context) - Efthivoulidis, Verentziotis et al. - 1998
1   The Simulation of a Fault Tolerant Computer System (context) - Griffin, Comfort - 1985
1   Centralized Failure Injection for Distributed FaultTolerant .. - Alvarez, Cristian - 1997
1   Processor Allocation and Checkpointing Interval Selection in.. (context) - Plank, Thomason - 2001
1   the Effect of Recovery Block Scheme on System Performance (context) - Abulnaja, Hosseini et al. - 1997

Documents on the same site (http://www.engr.uconn.edu/~lester/):   More
An Analytic Performance Model Of Parallel Systems That .. - Weerasinghe.. (2001)   (Correct)
A Distributed Fault-Tolerant Asynchronous Algorithm for.. - Weerasinghe, Lipsky (2001)   (Correct)
Energy Levels and Classifications of Triply Excited States.. - Conneely, Lipsky   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC