Download:
|
by Zheng Zhang, Christos Karamanolis
in Proc. of the 20th Symposium on Reliable Distributed Systems
http://www.hpl.hp.com/research/itc/csl/ssp/papers/SRDS-2001.ps.gz
Add To MetaCart
Abstract:
A number of ongoing research projects follow a partition-based approach to provide highly scalable distributed storage services. These systems maintain namespaces that reference objects distributed across multiple locations in the system. Typically, atomic commitment protocols, such as 2-phase commit, are used for updating the namespace, in order to guarantee its consistency even in the presence of failures. Atomic commitment protocols are known to impose a high overhead to failure-free execution. Furthermore, they use conservative recovery procedures and may considerably restrict the concurrency of overlapping operations in the system. This paper proposes a set of new protocols implementing the fundamental operations in a distributed namespace. The protocols impose a minimal overhead to failure-free execution. They are robust against both communication and host failures, and use aggressive recovery procedures to re-execute incomplete operations. The proposed protocols are compared with their 2-phase commit counterparts and are shown to outperform them in all critical performance factors: communication roundtrips, synchronous I/O, operation concurrency.
Citations
|
703
|
Scale and performance in a distributed file system
– Howard, Kazar, et al.
- 1988
|
|
178
|
A Quorum-Based Commit Protocol
– Skeen
- 1982
|
|
79
|
Metadata Update Performance in File Systems
– Ganger, Patt
- 1994
|
|
72
|
Interposed Request Routing for Scalable Network Storage
– Anderson, Chase, et al.
- 2000
|
|
25
|
Archipelago: An island-based file system for highly available and scalable internet services
– Ji, Felten, et al.
- 2000
|
|
24
|
A New Presumed Commit Optimization for Two Phase Commit
– Lampson, Lomet
- 1993
|
|
8
|
CIFS: A Common Internet File System. Microsoft Interactive Developer
– Leach, Perry
- 1996
|
|
8
|
Implementing Journaling in a Linux Shared Disk File System
– Preslan, Barry, et al.
- 2000
|
|
7
|
Transaction Management in the R* Distributed Data Base Management System
– Mohan, Lindsay, et al.
- 1986
|
|
5
|
An Architecture for Scalable and Manageable File Services,” Hewlett-Packard Labs, Palo Alto
– Karamanolis, Liu, et al.
- 2001
|
|
4
|
Site Optimal Termination Protocols for a Distributed Database under Network Partitioning
– Cheung, Kameda
- 1985
|
|
2
|
Understanding the Linux Kernel. 1st ed
– Bovet, Cesati
- 2001
|
|
1
|
Cross-Partition Protocols in a Distributed File Service", Hewlet-Packard Labs
– Zhang, Karamanolis, et al.
- 2001
|