Zebra is a network file system that increases throughput by striping file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an approach similar to a log-structured file system. This provides high performance for writes of small files as well as for reads and writes of large files. Zebra also writes parity information in each stripe in the style of RAID disk arrays; this increases storage costs slightly but allows the system to continue operation even while a single storage server is unavailable. A prototype implementation of Zebra, built in the Sprite operating system, provides 4-5 times the throughput of the standard Sprite file system or NFS for large files and a 15 % to 300 % improvement for writing small files.
|
703
|
Scale and performance in a distributed file system
– Howard, Kazar, et al.
- 1988
|
|
656
|
The design and implementation of a log-structured file system
– Rosenblum, Ousterhout
- 1992
|
|
550
|
The case for redundant arrays of inexpensive disks (RAID
– Patterson, Gibson, et al.
- 1988
|
|
431
|
A fast file system for UNIX
– McKusick, Joy, et al.
- 1984
|
|
396
|
Coda: A highly available file system for a distributed workstation environment
– Satyanarayanan, Kistler, et al.
- 1990
|
|
246
|
The Sprite Network Operating System
– Ousterhout, Cherenson, et al.
- 1988
|
|
245
|
A case for NOW (Networks of Workstations
– Anderson, Culler, et al.
- 1995
|
|
175
|
Concurrency control for distributed database systems
– Bernstein, Goodman
- 1981
|
|
173
|
Grapevine: An exercise in distributed computing
– BIRRELL, LEVIN, et al.
- 1982
|
|
165
|
The LOCUS distributed operating system
– Walker, Popek, et al.
- 1983
|
|
162
|
Non-Volatile Memory for Fast, Reliable File Systems
– Baker, Asami, et al.
- 1992
|
|
158
|
An Implementation of a Log-Structured File System for
– Seltzer, Bostic, et al.
- 1993
|
|
135
|
Replication in the Harp file system
– Liskov, Ghemawat, et al.
- 1991
|
|
132
|
Maximizing performance in a striped disk array
– Chen, Patterson
- 1990
|
|
121
|
Extent-like Performance from a UNIX File System
– McVoy, Kleiman
- 1991
|
|
119
|
Reimplementing the Cedar file system using logging and group commit
– Hagmann
- 1987
|
|
102
|
Implementation of the ficus replicated file system
– Guy, Heidemann, et al.
- 1990
|
|
100
|
File system logging versus clustering: A performance comparison
– Seltzer, Smith, et al.
- 1995
|
|
93
|
A Concurrent File System for a Highly Parallel
– Pierce
- 1989
|
|
92
|
The Episode file system
– Chutani, Anderson, et al.
- 1992
|
|
82
|
The TickerTAIP Parallel RAID Architecture
– Cao, Lim, et al.
- 1993
|
|
76
|
Swift: Using Distributed Disk Striping to Provide
– Cabrera, Long
- 1991
|
|
73
|
Distributed Filaments: Efficient Fine-Grain Parallelism on a Cluster of Workstations
– Freeh, Lowenthal, et al.
- 1994
|
|
66
|
Deceit: A flexible distributed file system
– Siegel, Birman, et al.
- 1989
|
|
50
|
sfs: A parallel file system for the CM-5
– LoVerso, Isman, et al.
- 1993
|
|
49
|
The recovery box: Using fast recovery to provide high availability in the UNIX environment
– Baker, Sullivan
- 1992
|
|
43
|
Bridge: A high-performance file system for parallel processors
– Dibble, Scott, et al.
- 1988
|
|
43
|
RAID-II: A high-bandwidth network file server
– Drapeau
- 1994
|
|
36
|
Directory reference patterns in hierarchical file systems
– Floyd, Ellis
- 1989
|
|
28
|
Availability and consistency tradeoffs in the Echo distributed file system
– HISGEN, BIRRELL, et al.
- 1989
|
|
24
|
A trace-driven analysis of name and attribute caching in a distributed system
– Shirriff, Ousterhout
- 1992
|
|
23
|
Ousterhout, "Measurements of a Distributed File System
– Baker, Hartman, et al.
- 1991
|
|
23
|
Datamesh research project, phase 1
– WILKES
- 1992
|
|
15
|
Ousterhout, "Caching in the Sprite network file system
– Nelson, Welch, et al.
- 1988
|
|
13
|
Ousterhout, “letter to the editor
– Hartman, John
- 1993
|
|
10
|
Name service locality and cache design in a distributed operating system
– Sheltzer, Lindell, et al.
- 1986
|
|
8
|
Swift/RAID: A
– Long, Montague, et al.
- 1994
|
|
6
|
Highly Redundant Management of Distributed Data
– Schloss, Stonebraker
- 1990
|
|
1
|
A Critique of Seltzer's 1993 USENIX Paper". Available as http://www.smli.com/~ouster/seltzer93.html
– Ousterhout
|