MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Characterizing reference locality in the WWW (1996) [174 citations — 23 self]

Download:
Download as a PDF | Download as a PS
by Azer Bestavros, Mark Crovella, Adriana De Oliveira
http://www.cs.bu.edu/fac/best/res/papers/pdis96.ps
Add To MetaCart

Abstract:

In this paper we propose models for both temporal and spatial locality of reference in streams of requests arriving at Web servers. We show that simple models based on document popularity alone are insufficient for capturing either temporal or spatial locality. Instead, we rely on an equivalent, but numerical, representation of a reference stream: a stack distance trace. We show that temporal locality can be characterized by the marginal distribution of the stack distance trace, and we propose models for typical distributions and compare their cache performance to our traces. We also show that spatial locality in a reference stream can be characterized using the notion of self-similarity. Self-similarity describes longrange correlations in the dataset, which is a property that previous researchers have found hard to incorporate into synthetic reference strings. We show that stack distance strings appear to be stongly self-similar, and we provide measurements of the degree of self-similarity in our traces. Finally, we discuss methods for generating synthetic Web traces that exhibit the properties of temporal and spatial locality that we measured in our data. 1

Citations

1367 On the self-similar nature of Ethernet traffic – Leland, Taqqu, et al. - 1993
703 Scale and performance in a distributed file system – Howard, Kazar, et al. - 1988
680 The Fractal Geometry of Nature – Mandelbrot - 1983
542 Human Behavior and the Principle of Least-Effort – Zipf - 1949
537 Cache memories – Smith - 1982
297 Broadcast Disks: Data Management for Asymmetric Communications Environments – Acharya, Alonso, et al. - 1994
264 Cooperative Caching: Using Remote Client Memory to Improve File System Performance. Operating Systems Design and Implementation – Dahlin, Wang, et al. - 1994
255 Characteristics of WWW Client-based Traces – Cunha, Bestavros, et al. - 1995
240 The case for geographical push caching – Gwertzman, Seltzer - 1995
199 Removal policies in network caches for World-Wide Web documents – Williams, Abrams, et al. - 1996
194 Evaluation techniques for storage hierarchies – Mattson, Gecsei, et al.
175 Techniques in Fractal Geometry – Falconer - 2003
166 A Caching Relay for the World Wide Web – Glassman - 1994
137 Caching Proxies: Limitations and Potentials – Abrams, Standridge, et al. - 1995
100 Crovella and Azer Bestavros. Self-similarity in World Wide Web traffic: Evidence and possible causes – Mark - 1997
90 Using speculation to reduce server load and service time on the WWW – Bestavros
88 Multi-level caching in distributed file systems - or - your cache ain’t nuthin’ but trash – Muntz, Honeyman - 1992
84 A Case for Caching File Objects Inside Internetworks – Danzig, Hall, et al. - 1993
78 Fast Approximation of Self-Similar Network Traffic – Paxson - 1995
67 Modeling and simulation of self-similar variable bit rate compressed video: a uni ed approach – Huang, Devetsikiotis, et al. - 1995
61 Main memory caching of Web documents – Markatos - 1996
55 Demand-based document dissemination to reduce tra c and balance load in distributed information systems – Bestavros - 1995
49 On the fractal dimension of computer programs and its application to the prediction of the cache miss ratio – Thiebaut - 1989
39 Performance engineering of the World Wide Web: Application to dimensioning and cache design – Bolot, Hoschka - 1996
31 An efficient scheme for dynamic data replication – Acharya, Zdonik - 1993
25 Statistics for Long-Memory Processes. Monographs on Statistics and Applied Probability – Beran - 1994
24 Distance string models for program behavior – Spirn - 1976
22 Abdelsalam Heddaya, and Sulaiman Mirdad. Application level document caching in the internet – Bestavros, Carter, et al. - 1995
21 Caching in Large Scale Distributed File Systems – Blaze - 1993
21 Rangan. Information caching for delivery of personalized video programs on home entertainment channels – Papadimitriou, Ramanathan, et al. - 1994
19 Locality in page reference strings – Shedler, ‘Iimg - 1972
16 A prefetching protocol using client speculation for the www – Bestavros, Cunha - 1995
16 Properties of the working set model – Denning, Schwartz - 1972
13 Fractal nature of software-cache interaction – Voldman, Mandelbrot, et al. - 1983
12 Predicting document access in large, multimedia repositories – Recker, Pitkow - 1994