• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 53,898
Next 10 →

ANALYSIS ON DISTRIBUTED MEMORY PARALLEL SYSTEMS

by Chao Yang, Padma Raghavan, Lloyd Arrowood, Donald W Noid, Bobby G Sumpter, Robert E Tuzun, Chao Yang, Padma Raghavan, Lloyd Arrowood, Donald W. Noid, Bobby G. Sumpter, Robert E. Tuzun
"... A parallel computational scheme for analyzing large-scale molecular vibration on distributed memory computing platforms is presented in this paper. This method combines the implicitly restarted Lanczos algorithm with a state-of-art parallel sparse direct solver to compute a set of low frequency vibr ..."
Abstract - Add to MetaCart
partitioning scheme one can achieve scalable performance on lightly coupled distributed memory parallel systems. In particular, we demonstrate performance enhancement achieved by using the latency tolerant “selective inversion" scheme in the sparse triangular substitution phase of the computation.

Computation Migration: Enhancing Locality for Distributed-Memory Parallel Systems

by Wilson C. Hsieh, Paul Wang, William E. Weihl
"... We describe computation migration, a new technique that is based on compile-time program transformations, for accessing remote data in a distributed-memory parallel system. In contrast with RPC-style access, where the access is performed remotely, and with data migration, where the data is moved so ..."
Abstract - Cited by 55 (4 self) - Add to MetaCart
We describe computation migration, a new technique that is based on compile-time program transformations, for accessing remote data in a distributed-memory parallel system. In contrast with RPC-style access, where the access is performed remotely, and with data migration, where the data is moved so

A Messages Density Monitoring Strategy For Distributed Memory Parallel Systems

by Luís Paulo Santos, Lu'is Paulo Santos, Alan Chalmers, Alberto Proença
"... This paper proposes a distributed strategy for monitoring the messages density within a distributed memory parallel system. Each processor keeps track of the communication delays with its contiguous neighbours; when this time exceeds a certain average value, the data requests are not sent to those n ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
This paper proposes a distributed strategy for monitoring the messages density within a distributed memory parallel system. Each processor keeps track of the communication delays with its contiguous neighbours; when this time exceeds a certain average value, the data requests are not sent to those

Largescale normal coordinate analysis on distributed memory parallel systems, technical report

by Chao Yang, Padma Raghavan, Lloyd Arrowood, Donald W. Noid, Bobby G. Sumpter, Robert E. Tuzun - Edmond Chow, Steve Lee, Panayot Vassilevski, Carol Woodward Carnegie Mellon University
"... A parallel computational scheme for analyzing large-scale molecular vibration on distributed memory computing plat-forms is presented in this paper. This method combines the implicitly restarted Lanczos algorithm with a state-of-art parallel sparse direct solver to compute a set of low fre-quency vi ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
and data partition-ing scheme one can achieve scalable performance on lightly coupled distributed memory parallel systems. In particular, we demonstrate performance enhancement achieved by using the latency tolerant “selective inver-sion " scheme in the sparse triangular substitution phase

Integrated Flow and Stress Using an Unstructured Mesh on Distributed Memory Parallel Systems.

by Mcmanus Cross, K Mcmanus, M Cross, S Johnson , 1995
"... this paper, we give a brief overview of the domain decomposition method used for the parallelisation of UIFS and present some performance results. The paper concludes with an examination of the load balancing difficulties presented by the inhomogeneous problem space typical of multi-physical models ..."
Abstract - Cited by 10 (10 self) - Add to MetaCart
this paper, we give a brief overview of the domain decomposition method used for the parallelisation of UIFS and present some performance results. The paper concludes with an examination of the load balancing difficulties presented by the inhomogeneous problem space typical of multi-physical models such as fluidstructure interaction and metals casting. * Sponsored by the Engineering and Physical Science Research Council. 2. PARALLELISATION BY DOMAIN DECOMPOSITION 2.1 Objectives

Benchmarking for Distributed Memory Parallel Systems: Gaining Insight from Numbers

by Addison Getov, C. A. Addison, V. S. Getov, A. J. G. Hey, R. W. Hockney, I. C. Wolton
"... . The inherent difficulties of benchmarking parallel computers and the motivation behind the development of the GENESIS benchmark suite introduce the paper. Some of the broader issues that must be taken into account with any rigorous benchmarking effort on a distributed memory system are then discus ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
. The inherent difficulties of benchmarking parallel computers and the motivation behind the development of the GENESIS benchmark suite introduce the paper. Some of the broader issues that must be taken into account with any rigorous benchmarking effort on a distributed memory system

EXPERIENCES ON PARALLELIZING COMPILATION FOR DEVELOPMENT AND PORTING OF LARGE SCALE APPLICATIONS ON DISTRIBUTED MEMORY PARALLEL SYSTEMS∗

by B. Di Martino, S. Briguglio, M. Celino, G. Fogaccia, G. Vlad, M. Briscolini
"... parallel applications and porting of legacy codes to parallel architectures have not yet gained a broad acceptance and diffusion. Common objections claim difficulty of performance tuning, limitation of its application to regular, data parallel computations, and lack of robustness of parallelizing HP ..."
Abstract - Add to MetaCart
parallel applications and porting of legacy codes to parallel architectures have not yet gained a broad acceptance and diffusion. Common objections claim difficulty of performance tuning, limitation of its application to regular, data parallel computations, and lack of robustness of parallelizing

Global Trees: A Framework for Linked Data Structures on Distributed Memory Parallel Systems

by D. Brian Larkins, James Dinan, Sriram Krishnamoorthy, Srinivasan Parthasarathy, Atanas Rountev
"... This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system utilizes coarse-grained data movement to enhance l ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system utilizes coarse-grained data movement to enhance

Distributed-Memory Multiprocessors *

by Purdue E-pubs, Vernon J. Rego, Vernon Rego, Vernon Rego , 1993
"... Executing multi-threaded programs on multiprocessors can exploit the inherent paral-lelism among threads. However I the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions is to e ..."
Abstract - Add to MetaCart
Executing multi-threaded programs on multiprocessors can exploit the inherent paral-lelism among threads. However I the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions

Thread Migration On Distributed-Memory Multiprocessors

by Janche Sang, Geoffrey W. Peters, Vernon Rego - In Proc. of High Performance Computing Symp , 1994
"... Executing multi-threaded programs on multiprocessors can exploit the inherent parallelism among threads. However, the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions is to enh ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
Executing multi-threaded programs on multiprocessors can exploit the inherent parallelism among threads. However, the load imbalance and frequent remote data access factors may degrade the performance in distributed-memory parallel systems. To cope with these problems, one of the solutions
Next 10 →
Results 1 - 10 of 53,898
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University