This paper presents and evaluates PAST, a large-scale peer-to-peer persistent storage utility. PAST is based on a self-configuring, Internet based overlay network of storage nodes that cooperatively route file queries, store multiple replicas of files, and cache additional copies of popular files. In the PAST system, storage nodes and files are each assigned uniformly distributed identifiers, and replicas of a file are stored at nodes whose identifier matches most closely the file's identifier. This statistical assignment of files to storage nodes approximately balances the number of files stored on each node. However, non-uniform storage node capacities and file sizes require more explicit storage load balancing to permit graceful behavior under high global storage utilization; and, non-uniform popularity of files requires caching to minimize fetch distance and to balance the query load. We present and evaluate PAST, with an emphasis on its storage management and caching system. Extensive trace-driven experiments show that the system minimizes fetch distance, that it balances the query load for popular files, and that it displays graceful degradation of performance as the global storage utilization increases beyond
|
2238
|
Chord: A scalable peerto-peer lookup service for internet applications
– Stoica, Morris, et al.
- 2001
|
|
1854
|
A scalable content-addressable network
– Ratnasamy, Francis, et al.
- 2001
|
|
1203
|
Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems
– Rowstron, Druschel
- 2001
|
|
733
|
A Measurement Study of Peer-to-Peer File Sharing Systems
– Saroiu, Gummadi, et al.
- 2002
|
|
678
|
Oceanstore: An architecture for global-scale persistent storage
– Kubiatowicz, Bindel, et al.
- 2000
|
|
675
|
Freenet: A Distributed Anonymous Information Storage
– Clarke, Sandberg, et al.
- 2000
|
|
610
|
Widearea cooperative storage with cfs
– Dabek, Kaashoek, et al.
- 2001
|
|
575
|
Web caching and Zipf-like distributions: Evidence and implications
– Breslau, Cao, et al.
- 1999
|
|
416
|
Cost-Aware WWW Proxy Caching Algorithms
– Cao, Irani
- 1997
|
|
398
|
Accessing nearby copies of replicated objects in a distributed environment
– Plaxton, Rajaraman, et al.
- 1997
|
|
396
|
A scalable location service for geographic ad-hoc routing
– Li, Jannotti, et al.
- 2002
|
|
380
|
Serverless network file systems
– Anderson, Dahlin, et al.
- 1995
|
|
375
|
Overcast: Reliable multicasting with an overlay network
– Jannotti, Gifford, et al.
- 2000
|
|
366
|
The design and implementation of an intentional naming system
– Adjie-Winoto, Schwartz, et al.
- 1999
|
|
213
|
Feasibility of a Serverless Distributed File System Deployed on an Existing Set of Desktop PCs
– BOLOSKY, DOUCEUR, et al.
- 2000
|
|
182
|
Tapestry: An infrastructure for fault-resilient wide-area location and routing
– Zhao, Kubiatowicz, et al.
- 2001
|
|
169
|
Past: A Large-Scale Persistent Peer-to-Peer Storage Utility
– Druschel, Rowstron
- 2001
|
|
158
|
The Free Haven project: Distributed anonymous storage service
– DINGLEDINE, FREEDMAN, et al.
- 2000
|
|
156
|
Publius: A robust, tamper-evident, censorship-resistant, web publishing system
– WALDMAN, RUBIN, et al.
- 2000
|
|
150
|
Separating key management from file system security
– MAZIÈRES, KAMINSKY, et al.
- 1999
|
|
148
|
The eternity service
– Anderson
- 1996
|
|
136
|
et al. Oceanstore: An architecture for global-scale persistent storage
– Kubiatowicz
- 2000
|
|
127
|
Fast and Secure Distributed Read-Only File System
– Fu, Kaashoek, et al.
- 2002
|
|
108
|
Designing a Global Name Service
– Lampson
- 1986
|
|
107
|
A tutorial on Reed-Solomon coding for faulttolerance in RAID-like systems
– Plank
- 1997
|
|
86
|
Piconet: Embedded mobile networking
– Bennett, Clarke, et al.
- 1997
|
|
43
|
Decentralizing a Global Naming Service for Improved Performance and Fault Tolerance
– Cheriton, Mann
- 1989
|
|
36
|
Univers: An Attribute-based Name Server
– Bowman, Peterson, et al.
- 1990
|
|
31
|
Discover: A resource discovery system based on content routing
– Sheldon, Duda, et al.
- 1995
|
|
25
|
Performance evaluation of redirection schemes in content distribution networks
– Kangasharju, Roberts, et al.
- 1999
|
|
24
|
Seamlessly Selecting the Best Copy from Internet-Wide Replicated Web Servers
– Amir, Peterson, et al.
- 1998
|
|
16
|
A measurement study of peer-to-peer sharing systems
– Saroiu, Gummadi, et al.
- 2002
|
|
15
|
A replicated architecture for the domain name system
– Kangasharju, Ross
- 2000
|
|
14
|
Serverless network systems
– Anderson, Dahlin, et al.
- 1995
|
|
8
|
Feasibility of a serverless distributed system deployed on an existing set of desktop PCs
– Bolosky, Douceur, et al.
- 2000
|
|
5
|
RFC 1309: Technical overview of directory services using the x.500 protocol
– Reynolds
- 1992
|
|
1
|
Pastry: Scalable, distributed content location and routing for large-scale peer-to-peer systems
– Anonymous
- 2001
|