Results 1 - 10
of
53,898
ANALYSIS ON DISTRIBUTED MEMORY PARALLEL SYSTEMS
"... A parallel computational scheme for analyzing large-scale molecular vibration on distributed memory computing platforms is presented in this paper. This method combines the implicitly restarted Lanczos algorithm with a state-of-art parallel sparse direct solver to compute a set of low frequency vibr ..."
Abstract
- Add to MetaCart
partitioning scheme one can achieve scalable performance on lightly coupled distributed memory parallel systems. In particular, we demonstrate performance enhancement achieved by using the latency tolerant “selective inversion" scheme in the sparse triangular substitution phase of the computation.
Computation Migration: Enhancing Locality for Distributed-Memory Parallel Systems
"... We describe computation migration, a new technique that is based on compile-time program transformations, for accessing remote data in a distributed-memory parallel system. In contrast with RPC-style access, where the access is performed remotely, and with data migration, where the data is moved so ..."
Abstract
-
Cited by 55 (4 self)
- Add to MetaCart
We describe computation migration, a new technique that is based on compile-time program transformations, for accessing remote data in a distributed-memory parallel system. In contrast with RPC-style access, where the access is performed remotely, and with data migration, where the data is moved so
A Messages Density Monitoring Strategy For Distributed Memory Parallel Systems
"... This paper proposes a distributed strategy for monitoring the messages density within a distributed memory parallel system. Each processor keeps track of the communication delays with its contiguous neighbours; when this time exceeds a certain average value, the data requests are not sent to those n ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This paper proposes a distributed strategy for monitoring the messages density within a distributed memory parallel system. Each processor keeps track of the communication delays with its contiguous neighbours; when this time exceeds a certain average value, the data requests are not sent to those
Largescale normal coordinate analysis on distributed memory parallel systems, technical report
- Edmond Chow, Steve Lee, Panayot Vassilevski, Carol Woodward Carnegie Mellon University
"... A parallel computational scheme for analyzing large-scale molecular vibration on distributed memory computing plat-forms is presented in this paper. This method combines the implicitly restarted Lanczos algorithm with a state-of-art parallel sparse direct solver to compute a set of low fre-quency vi ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
and data partition-ing scheme one can achieve scalable performance on lightly coupled distributed memory parallel systems. In particular, we demonstrate performance enhancement achieved by using the latency tolerant “selective inver-sion " scheme in the sparse triangular substitution phase
Integrated Flow and Stress Using an Unstructured Mesh on Distributed Memory Parallel Systems.
, 1995
"... this paper, we give a brief overview of the domain decomposition method used for the parallelisation of UIFS and present some performance results. The paper concludes with an examination of the load balancing difficulties presented by the inhomogeneous problem space typical of multi-physical models ..."
Abstract
-
Cited by 10 (10 self)
- Add to MetaCart
this paper, we give a brief overview of the domain decomposition method used for the parallelisation of UIFS and present some performance results. The paper concludes with an examination of the load balancing difficulties presented by the inhomogeneous problem space typical of multi-physical models such as fluidstructure interaction and metals casting. * Sponsored by the Engineering and Physical Science Research Council. 2. PARALLELISATION BY DOMAIN DECOMPOSITION 2.1 Objectives
Benchmarking for Distributed Memory Parallel Systems: Gaining Insight from Numbers
"... . The inherent difficulties of benchmarking parallel computers and the motivation behind the development of the GENESIS benchmark suite introduce the paper. Some of the broader issues that must be taken into account with any rigorous benchmarking effort on a distributed memory system are then discus ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
. The inherent difficulties of benchmarking parallel computers and the motivation behind the development of the GENESIS benchmark suite introduce the paper. Some of the broader issues that must be taken into account with any rigorous benchmarking effort on a distributed memory system
EXPERIENCES ON PARALLELIZING COMPILATION FOR DEVELOPMENT AND PORTING OF LARGE SCALE APPLICATIONS ON DISTRIBUTED MEMORY PARALLEL SYSTEMS∗
"... parallel applications and porting of legacy codes to parallel architectures have not yet gained a broad acceptance and diffusion. Common objections claim difficulty of performance tuning, limitation of its application to regular, data parallel computations, and lack of robustness of parallelizing HP ..."
Abstract
- Add to MetaCart
parallel applications and porting of legacy codes to parallel architectures have not yet gained a broad acceptance and diffusion. Common objections claim difficulty of performance tuning, limitation of its application to regular, data parallel computations, and lack of robustness of parallelizing
Global Trees: A Framework for Linked Data Structures on Distributed Memory Parallel Systems
"... This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system utilizes coarse-grained data movement to enhance l ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system utilizes coarse-grained data movement to enhance
Distributed-Memory Multiprocessors *
, 1993
"... Executing multi-threaded programs on multiprocessors can exploit the inherent paral-lelism among threads. However I the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions is to e ..."
Abstract
- Add to MetaCart
Executing multi-threaded programs on multiprocessors can exploit the inherent paral-lelism among threads. However I the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions
Thread Migration On Distributed-Memory Multiprocessors
- In Proc. of High Performance Computing Symp
, 1994
"... Executing multi-threaded programs on multiprocessors can exploit the inherent parallelism among threads. However, the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions is to enh ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
Executing multi-threaded programs on multiprocessors can exploit the inherent parallelism among threads. However, the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions
Results 1 - 10
of
53,898