| P. Liu, "The parallel implementation of N--body algorithms," PhD thesis, Yale University, 1994. |
....used different algorithms, different problem sizes and parameters controlling the accuracies. The Barnes Hut O(N log N ) algorithm has been implemented using the message passing programming paradigm by Salmon and Warren [14, 15, 16] on the Intel Touchstone Delta and by Liu and Bhatt [17, 18] on the CM 5. Both groups used assembly language for time critical kernels and achieved efficiencies in the range 24 28 and 30 , respectively. Zhao and Johnsson [19] developed a data parallel implementation of Zhao s method on the CM 2, and achieved an efficiency of 12 for expansions in ....
....are evaluated hierarchically. 4 HU AND JOHNSSON Table 1. Efficiencies of various parallel implementations of hierarchical N body methods of Peak Author Programming model efficiency Machine Salmon, Warren Salmon [14, 15, 16] F77 message passing 24 28 512 node Intel Delta Liu Bhatt [17, 18] C message passing assembly 30 256 node CM 5 Leathrum Board [20, 21] F77 20 32 node KSR 1 Elliott Board [22] F77 14 32 node KSR 1 Zhao Johnsson [19] Lisp assembly 12 256 node (8k) CM 2 Hu Johnsson [this article] CMF 27 35 256 node CM 5 5E The hierarchy of computational elements ....
P. Liu, "The parallel implementation of N--body algorithms," PhD thesis, Yale University, 1994.
....method on any parallel machine as well as the first implementation of an O(N ) N body algorithm in a data parallel language. Barnes Hut O(N log 2 N ) has been implemented using the message passing programming paradigm by Salmon et al. 25, 32, 33] on the Intel Touchstone Delta and by Liu [21] on the CM 5. Salmon et al. achieved efficiencies in the range 24 28 , while Liu using assembly language for time critical kernels achieved 30 efficiency. Zhao and Johnsson developed a data parallel implementation of Zhao s method on the CM 2, and achieved an efficiency of 12 for ....
Pangfeng Liu. The parallel implementation of N--body algorithms. PhD thesis, Yale University, 1994.
....the locality of reference is to avoid multiple requests of data from the same node by aggregating all such requests into a single network request followed by local sharing of data. This technique is used, for instance, in the gather scatter routines of the CMSSL, and have also been used by Liu [21] and Salmon et. al [28] In many design and analysis problems for which MPPs are used, a great deal is known about data reference patterns. For arrays and other easily described data structures this global knowledge is exceedingly valuable, and the routing and scheduling of data can be made ....
Pangfeng Liu. The parallel implementation of N--body algorithms. PhD thesis, Yale University, 1994.
....programming models, as summarized in Table 3.4. Barnes and Hut s O(N log N) method has been implemented using the message passing programming paradigm by Salmon et al. Sal90, WS92, WS93] on the Intel Touchstone Delta Chapter 3. Hierarchical N body Methods 28 and by Liu and Bhatt [Liu94, LB94] on the CM 5. Salmon et al. achieved efficiencies in the range 24 28 1 , while Liu using assembly language for time critical kernels achieved 30 efficiency. Zhao and Johnsson developed a SIMD implementation of Zhao s method on the CM 2, and achieved an efficiency of 12 for ....
Pangfeng Liu. The parallel implementation of N--body algorithms. PhD thesis, Yale University, 1994.
....in memory usage. Table 1 gives a summary of sequential and parallel implementations of hierarchical N body methods. Barnes and Hut s O(N log N ) method has been implemented using the message passing programming paradigm by Salmon et al. 27, 33, 34] on the Intel Touchstone Delta and by Liu [23] on the CM 5. Salmon et al. achieved efficiencies in the range 24 28 , while Liu using assembly language for time critical kernels achieved 30 efficiency. Zhao and Johnsson developed a data parallel implementation of Zhao s method on the CM 2, and achieved an efficiency of 12 for ....
....paper. Author Method error Prog. model N t (sec. P Efficiency Machine Salmon [27] BH, quadrupole msg. passing Ncube Warren Salmon 92 [33] BH, quadrupole msg. passing 8.78M 77 512 26 Intel Delta Warren Salmon 93 [34] BH, ffl 1 = 10 Gamma3 msg. passing 8. 78M 114 512 28 Intel Delta Liu [23] BH, quadrupole msg. passing 10M 59 256 30 CM 5 Singh et al. 30] BH DASH, KSR 1 Leathrum Board [22, 18] GR, p=8 100,000 500 1 65 IBM RS 6000 360 GR, p=8 1M 1520 32 20 KSR 1 Elliott Board [9, 18] GR, FFT, p=8 100,000 400 1 73 IBM RS 6000 360 GR, FFT, p=8 1M 1420 32 14 KSR 1 Schmidt Lee ....
Pangfeng Liu. The parallel implementation of N--body algorithms. PhD thesis, Yale University, 1994.
....Costzones (Singh, Hennessey and Gupta 1992) 3. Tree Traversal Latency hiding tree traversal (Warren and Salmon 1992) 4. Locally essential data (Receiver oriented) Gather essential data for force computation (Warren and Salmon 1992) Sender oriented) Send essential data to processor needing it (Liu P. 1994) 5. Incremental Updates (Sender oriented) Incremental Tree Updates (Liu P. 1994) Incremental updates of locally essential data (Liu P. 1994) Table 2.1: Parallel approaches to N body treecodes CHAPTER 2. A SURVEY OF HIERARCHICAL APPLICATIONS 15 2.1.3 Data Partitioning A distributed octree or a ....
....tree traversal (Warren and Salmon 1992) 4. Locally essential data (Receiver oriented) Gather essential data for force computation (Warren and Salmon 1992) Sender oriented) Send essential data to processor needing it (Liu P. 1994) 5. Incremental Updates (Sender oriented) Incremental Tree Updates (Liu P. 1994) Incremental updates of locally essential data (Liu P. 1994) Table 2.1: Parallel approaches to N body treecodes CHAPTER 2. A SURVEY OF HIERARCHICAL APPLICATIONS 15 2.1.3 Data Partitioning A distributed octree or a k d tree representation is maintained on a set of processors. Each processor owns ....
[Article contains additional citation context not shown here]
P. Liu. The Parallel Implementation of N-body Algorithms. PhD thesis, Rutgers University, 1994.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC