MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  z

Download:
pdf | ps
by Honghui Lu, Y. Charlie Hu, Willy Zwaenepoel
http://moe.space.swri.edu/sc98/TechPapers/sc98_FullAbstracts/Lu1105/text.ps
Add To MetaCart

Abstract:

We describe an implementation of a sizable subset of OpenMP on networks of workstations (NOWs). By extending the availability of OpenMP to NOWs, we overcome one of its primary drawbacks compared to MPI, namely lack of portability to environments other than hardware shared memory machines. In order to support OpenMP execution on NOWs, our compiler targets a software distributed shared memory system (DSM) which provides multi-threaded execution and memory consistency. This paper presents two contributions. First, we identify two aspects of the current OpenMP standard that make an implementation on NOWs hard, and suggest simple modifications to the standard that remedy the situation. These problems reflect differences in memory architecture between software and hardware shared memory and the high cost of synchronization on NOWs. Second, we present performance results of a prototype implementation of an OpenMP subset on a NOW, and compare them with hand-coded software DSM and MPI results for the same applications on the same platform. We use five applications (ASCI Sweep3d, NAS 3D-FFT, SPLASH-2 Water, QSORT, and TSP) exhibiting various styles of parallelization, including pipelined execution, data parallelism, coarse-grained parallelism, and task queues. The measurements show little difference between OpenMP and hand-coded software DSM, but both are still lagging behind MPI. Further work will concentrate on compiler optimization to reduce these differences. 1

Citations

724 The SPLASH-2 programs: Characterization and methodological considerations – Woo, Ohara, et al. - 1995
462 The NAS Parallel Benchmarks – Bailey, Barton, et al. - 1991
372 TreadMarks: Shared Memory Computing on Networks of Workstations – Amza, Cox, et al. - 1996
115 Programming with POSIX threads – Butenhof - 1997
106 The SUIF compiler for scalable parallel machines – Amarasinghe, Anderson, et al. - 1995
89 Operating System Concepts – Peterson, Silberschatz - 1985
78 An Integrated Compile-Time/Run-Time Software Distributed Shared Memory System – Dwarkadas, Cox, et al. - 1996
45 Enhancing software DSM for compiler-parallelized applications – Keleher, Tseng - 1997
40 Quantifying the Performance Differences between PVM and TreadMarks – Lu, Cox, et al. - 1997
33 Evaluating the Performance of Software Distributed Shared Memory as a Target for Parallelizing Compilers – Cox, Dwarkadas, et al. - 1997