The Implementation of Cashmere 1
Abstract:
Cashmere is a software distributed shared memory (SDSM) system designed for today's high performance cluster architectures. These clusters typically consist of symmetric multiprocessors (SMPs) connected by a low-latency system area network. Cashmere introduces several novel techniques for delegating intra-node sharing to the hardware coherence mechanism available within the SMPs, and also for leveraging advanced network features such as remote memory access. The efficacy of the Cashmere design has been borne out through head-to-head comparisons with other well-known, mature SDSMs and with Cashmere variants that do not take advantage of the various hardware features. In this paper, we describe the implementation of the Cashmere SDSM. Our discussion is organized around the core components that comprise Cashmere. We discuss both component interactions and lowlevel implementation details. We hope this paper provides researchers with the background needed to
Citations
| 383 | TreadMarks: Shared memory computing on networks of workstations – Amza, Cox, et al. - 1996 |
| 122 | Cashmere-2L: Software coherent shared memory on a dlustered remote write network – Stets, Dwarkadas, et al. - 1997 |
| 87 | ATOM: A Flexible Interface for Building High Performance Program Analysis – Eustace, Srivastava - 1994 |
| 49 | VM-Based Shared Memory on Low-Latency, Remote-MemoryAccess Networks – Kontothanassis, Hunt, et al. - 1997 |
| 32 | Memory Channel: An Optimized Cluster Interconnect – Gillett - 1996 |
| 21 | Architecture and Implementation of Memory Channel 2 – Fillo, Gillett - 1997 |
| 14 | The Effect of Network Total Order, Broadcast, and Remote-Write Capability on Network-Based Shared Memory Computing – Stets, Dwarkadas, et al. - 2000 |
| 2 | Understanding Parallel Program Performance Using Cause-Effect Analysis – Meira - 1997 |
| 2 | Leveraging Symmetric Multiprocessors and System Area Networks in Software Distributed Shared – Stets - 1999 |

