MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility (2001) [482 citations — 27 self]

Download:
Download as a PDF | Download as a PS
by Antony Rowstron, Peter Druschel
http://www.research.microsoft.com/~antr/PAST/past-sosp.ps
Add To MetaCart

Abstract:

This paper presents and evaluates PAST, a large-scale peer-to-peer persistent storage utility. PAST is based on a self-configuring, Internet based overlay network of storage nodes that cooperatively route file queries, store multiple replicas of files, and cache additional copies of popular files. In the PAST system, storage nodes and files are each assigned uniformly distributed identifiers, and replicas of a file are stored at nodes whose identifier matches most closely the file's identifier. This statistical assignment of files to storage nodes approximately balances the number of files stored on each node. However, non-uniform storage node capacities and file sizes require more explicit storage load balancing to permit graceful behavior under high global storage utilization; and, non-uniform popularity of files requires caching to minimize fetch distance and to balance the query load. We present and evaluate PAST, with an emphasis on its storage management and caching system. Extensive trace-driven experiments show that the system minimizes fetch distance, that it balances the query load for popular files, and that it displays graceful degradation of performance as the global storage utilization increases beyond

Citations

2238 Chord: A scalable peerto-peer lookup service for internet applications – Stoica, Morris, et al. - 2001
1854 A scalable content-addressable network – Ratnasamy, Francis, et al. - 2001
1203 Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems – Rowstron, Druschel - 2001
733 A Measurement Study of Peer-to-Peer File Sharing Systems – Saroiu, Gummadi, et al. - 2002
678 Oceanstore: An architecture for global-scale persistent storage – Kubiatowicz, Bindel, et al. - 2000
675 Freenet: A Distributed Anonymous Information Storage – Clarke, Sandberg, et al. - 2000
610 Widearea cooperative storage with cfs – Dabek, Kaashoek, et al. - 2001
575 Web caching and Zipf-like distributions: Evidence and implications – Breslau, Cao, et al. - 1999
416 Cost-Aware WWW Proxy Caching Algorithms – Cao, Irani - 1997
398 Accessing nearby copies of replicated objects in a distributed environment – Plaxton, Rajaraman, et al. - 1997
396 A scalable location service for geographic ad-hoc routing – Li, Jannotti, et al. - 2002
380 Serverless network file systems – Anderson, Dahlin, et al. - 1995
375 Overcast: Reliable multicasting with an overlay network – Jannotti, Gifford, et al. - 2000
366 The design and implementation of an intentional naming system – Adjie-Winoto, Schwartz, et al. - 1999
213 Feasibility of a Serverless Distributed File System Deployed on an Existing Set of Desktop PCs – BOLOSKY, DOUCEUR, et al. - 2000
182 Tapestry: An infrastructure for fault-resilient wide-area location and routing – Zhao, Kubiatowicz, et al. - 2001
169 Past: A Large-Scale Persistent Peer-to-Peer Storage Utility – Druschel, Rowstron - 2001
158 The Free Haven project: Distributed anonymous storage service – DINGLEDINE, FREEDMAN, et al. - 2000
156 Publius: A robust, tamper-evident, censorship-resistant, web publishing system – WALDMAN, RUBIN, et al. - 2000
150 Separating key management from file system security – MAZIÈRES, KAMINSKY, et al. - 1999
148 The eternity service – Anderson - 1996
136 et al. Oceanstore: An architecture for global-scale persistent storage – Kubiatowicz - 2000
127 Fast and Secure Distributed Read-Only File System – Fu, Kaashoek, et al. - 2002
108 Designing a Global Name Service – Lampson - 1986
107 A tutorial on Reed-Solomon coding for faulttolerance in RAID-like systems – Plank - 1997
86 Piconet: Embedded mobile networking – Bennett, Clarke, et al. - 1997
43 Decentralizing a Global Naming Service for Improved Performance and Fault Tolerance – Cheriton, Mann - 1989
36 Univers: An Attribute-based Name Server – Bowman, Peterson, et al. - 1990
31 Discover: A resource discovery system based on content routing – Sheldon, Duda, et al. - 1995
25 Performance evaluation of redirection schemes in content distribution networks – Kangasharju, Roberts, et al. - 1999
24 Seamlessly Selecting the Best Copy from Internet-Wide Replicated Web Servers – Amir, Peterson, et al. - 1998
16 A measurement study of peer-to-peer sharing systems – Saroiu, Gummadi, et al. - 2002
15 A replicated architecture for the domain name system – Kangasharju, Ross - 2000
14 Serverless network systems – Anderson, Dahlin, et al. - 1995
8 Feasibility of a serverless distributed system deployed on an existing set of desktop PCs – Bolosky, Douceur, et al. - 2000
5 RFC 1309: Technical overview of directory services using the x.500 protocol – Reynolds - 1992
1 Pastry: Scalable, distributed content location and routing for large-scale peer-to-peer systems – Anonymous - 2001