In this paper we propose models for both temporal and spatial locality of reference in streams of requests arriving at Web servers. We show that simple models based on document popularity alone are insufficient for capturing either temporal or spatial locality. Instead, we rely on an equivalent, but numerical, representation of a reference stream: a stack distance trace. We show that temporal locality can be characterized by the marginal distribution of the stack distance trace, and we propose models for typical distributions and compare their cache performance to our traces. We also show that spatial locality in a reference stream can be characterized using the notion of self-similarity. Self-similarity describes longrange correlations in the dataset, which is a property that previous researchers have found hard to incorporate into synthetic reference strings. We show that stack distance strings appear to be stongly self-similar, and we provide measurements of the degree of self-similarity in our traces. Finally, we discuss methods for generating synthetic Web traces that exhibit the properties of temporal and spatial locality that we measured in our data. 1
|
1367
|
On the self-similar nature of Ethernet traffic
– Leland, Taqqu, et al.
- 1993
|
|
703
|
Scale and performance in a distributed file system
– Howard, Kazar, et al.
- 1988
|
|
680
|
The Fractal Geometry of Nature
– Mandelbrot
- 1983
|
|
542
|
Human Behavior and the Principle of Least-Effort
– Zipf
- 1949
|
|
537
|
Cache memories
– Smith
- 1982
|
|
297
|
Broadcast Disks: Data Management for Asymmetric Communications Environments
– Acharya, Alonso, et al.
- 1994
|
|
264
|
Cooperative Caching: Using Remote Client Memory to Improve File System Performance. Operating Systems Design and Implementation
– Dahlin, Wang, et al.
- 1994
|
|
255
|
Characteristics of WWW Client-based Traces
– Cunha, Bestavros, et al.
- 1995
|
|
240
|
The case for geographical push caching
– Gwertzman, Seltzer
- 1995
|
|
199
|
Removal policies in network caches for World-Wide Web documents
– Williams, Abrams, et al.
- 1996
|
|
194
|
Evaluation techniques for storage hierarchies
– Mattson, Gecsei, et al.
|
|
175
|
Techniques in Fractal Geometry
– Falconer
- 2003
|
|
166
|
A Caching Relay for the World Wide Web
– Glassman
- 1994
|
|
137
|
Caching Proxies: Limitations and Potentials
– Abrams, Standridge, et al.
- 1995
|
|
100
|
Crovella and Azer Bestavros. Self-similarity in World Wide Web traffic: Evidence and possible causes
– Mark
- 1997
|
|
90
|
Using speculation to reduce server load and service time on the WWW
– Bestavros
|
|
88
|
Multi-level caching in distributed file systems - or - your cache ain’t nuthin’ but trash
– Muntz, Honeyman
- 1992
|
|
84
|
A Case for Caching File Objects Inside Internetworks
– Danzig, Hall, et al.
- 1993
|
|
78
|
Fast Approximation of Self-Similar Network Traffic
– Paxson
- 1995
|
|
67
|
Modeling and simulation of self-similar variable bit rate compressed video: a uni ed approach
– Huang, Devetsikiotis, et al.
- 1995
|
|
61
|
Main memory caching of Web documents
– Markatos
- 1996
|
|
55
|
Demand-based document dissemination to reduce tra c and balance load in distributed information systems
– Bestavros
- 1995
|
|
49
|
On the fractal dimension of computer programs and its application to the prediction of the cache miss ratio
– Thiebaut
- 1989
|
|
39
|
Performance engineering of the World Wide Web: Application to dimensioning and cache design
– Bolot, Hoschka
- 1996
|
|
31
|
An efficient scheme for dynamic data replication
– Acharya, Zdonik
- 1993
|
|
25
|
Statistics for Long-Memory Processes. Monographs on Statistics and Applied Probability
– Beran
- 1994
|
|
24
|
Distance string models for program behavior
– Spirn
- 1976
|
|
22
|
Abdelsalam Heddaya, and Sulaiman Mirdad. Application level document caching in the internet
– Bestavros, Carter, et al.
- 1995
|
|
21
|
Caching in Large Scale Distributed File Systems
– Blaze
- 1993
|
|
21
|
Rangan. Information caching for delivery of personalized video programs on home entertainment channels
– Papadimitriou, Ramanathan, et al.
- 1994
|
|
19
|
Locality in page reference strings
– Shedler, ‘Iimg
- 1972
|
|
16
|
A prefetching protocol using client speculation for the www
– Bestavros, Cunha
- 1995
|
|
16
|
Properties of the working set model
– Denning, Schwartz
- 1972
|
|
13
|
Fractal nature of software-cache interaction
– Voldman, Mandelbrot, et al.
- 1983
|
|
12
|
Predicting document access in large, multimedia repositories
– Recker, Pitkow
- 1994
|