MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  ABSTRACT Aliasing on the World Wide Web: Prevalence and Performance Implications

Download:
pdf
by Terence Kelly
http://www.eecs.umich.edu/~tpkelly/papers/p525-kelly.pdf
Add To MetaCart

Abstract:

Aliasing occurs in Web transactions when requests containing different URLs elicit replies containing identical data payloads. Aliasing can cause cache misses, and there is reason to suspect that offthe-shelf Web authoring tools might increase aliasing on the Web. Existing research literature, however, says little about the prevalence of aliasing in user-initiated transactions or its impact on endto-end performance in large multi-level cache hierarchies. This paper quantifies the performance impact and extent of aliasing using a large client trace from WebTV Networks. Fewer than 5 % of reply payloads are aliased (referenced via multiple URLs) but over 54 % of successful transactions involve aliased payloads. Aliased payloads account for under 3.1 % of the trace’s “working set size ” (sum of payload sizes) but over 36 % of bytes transferred. Aliasing accounts for roughly 10 % of browser cache misses and 12–23 % of proxy misses in the workload studied, assuming infinite-capacity caches. The performance impact of aliasing is similar in a large proxy trace from Compaq Corporation. Aliasing does not entirely explain the large number of redundant proxy-to-browser payload transfers previously reported in the WebTV system. We consider other possible causes of redundant transfers (e.g., reply metadata and browser cache management policies) and discuss a simple hop-by-hop protocol extension that completely eliminates all redundant transfers, regardless of cause.

Citations

666 The Art of Computer Systems Performance Analysis – Jain - 1991
557 Generating Representative Web Workloads for Network and Server Performance Evaluation – Barford, Crovella - 1998
554 Web Caching and Zipf-like Distributions: Evidence and Implications – Breslau, Cao, et al. - 1999
255 Characteristics of WWW Client-based Traces – Cunha, Bestavros, et al. - 1995
251 Syntactic clustering of the Web – Broder - 1997
221 On the Scale and Performance of Cooperative Web Proxy Caching – Wolman, Voelker, et al. - 1999
206 System design issues for Internet middleware services: Deductions from a large client trace – Gribble, Brewer - 1997
199 Potential Benefits of Delta-encoding and Data Compression for HTTP – Mogul, Douglis, et al. - 1997
189 Rate of Change and other Metrics: a Live Study of the World Wide Web – Douglis, Feldmann, et al. - 1997
146 A low-bandwidth network file system – Muthitacharoen, Chen, et al. - 2001
124 Improving Web Server Performance by Caching Dynamic Data – Iyengar, Challenger - 1997
95 Characterizing Browsing Strategies – Catledge, Pitkow - 1995
91 How Dynamic is the Web – Brewington, Cybenko - 2000
82 Performance of web proxy caching in heterogeneous bandwidth environments – Feldmann, Caceres, et al. - 1999
79 Organization-based analysis of web-object sharing and caching – Wolman, Voelker, et al. - 1999
59 A ProtocolIndependent Technique for Eliminating Redundant Network Traffic – Spring, Wetherall - 2000
54 Towards a better understanding of web resources and server responses for improved caching – Wills, C, et al. - 1999
46 A comparison of techniques to find mirrored hosts on the WWW – Bharat, Broder, et al. - 2000
44 Finding near-replicas of documents on the web – Shivakumar, Garcia-Molina - 1998
43 Studying the impact of more complete server information on Web caching – Wills, Mikhailov - 2000
39 Workload characterization of a web proxy in a cable modem environment – Arlitt, Friedrich, et al. - 1999
36 Rfc 2616: Hypertext transfer protocol – Fielding, Gettys, et al. - 1999
35 mirror on the web: a study of host pairs with replicated content – Mirror - 1999
26 Thin-client Web access patterns: measurements from a cache-busting proxy – Kelly - 2001
25 Continuous online extraction of HTTP traces from packet traces – Feldmann - 1998
25 M.: Examining the cacheability of user-requested Web resources – Wills, Mikhailov - 1999
23 Web traffic logs: An imperfect resource for evaluation – Davison - 1999
21 The Medusa proxy: A tool for exploring user-perceived web performance – Koletsou, Voelker - 2001
18 Squeezing more bits out of http caches – Mogul - 2000
16 Increasing effective link bandwidth by suppressing replicated data – Santos, Wetherall - 1998
15 Errors in timestamp-based HTTP header values – Mogul - 1999
11 Replica-Aware Caching for Web Proxies – Bahn, Lee, et al. - 2002
11 PRO-COW: Protocol Compliance on the Web—A Longitudinal Study – Krishnamurthy, Arlitt - 2001
11 The HTTP distribution and replication protocol – Hoff, Giannandrea, et al. - 1997
10 Change and relationship-driven content caching, distribution and assembly – Mikhailov, Wills - 2001
9 A trace-based analysis of duplicate suppression in HTTP – Mogul - 1999
6 Observation of changing information sources – Brewington - 2000
6 HTTP 1.0 logs considered harmful – Cáceres, Krishnamurthy, et al. - 1998
6 The content and access dynamics of a busy Web server: Findings and implications – Padmanabhan, Qiu - 2000
5 Cachability of web objects – Zhang - 2000
4 Analyzing Web Robots and their Impact on Caching – Almeida, Menasce, et al. - 2001
4 Patent #6,292,880: “Alias-free content-indexed object cache – S - 2001
3 Improving proxy cache performance—analyzing three cache replacement policies – Dilley, Arlitt - 1999
2 White paper: Creating a cache-friendly Web site – Corporation - 2001
2 Squid cache revalidation and metadata updates. Posting tosquid-dev mailing list – Nordstrom - 2001
2 The third cache-off: The official report – Rousskov, Wessels - 2000
2 Exploiting result equivalence in caching dynamic content – Smith, Acharya, et al. - 1999
1 RFC 3143: Known HTTP proxy/caching problems – Cooper, Dilley - 2001
1 RFC 3229: Delta encoding in HTTP – Mogul, Krishnamurthy, et al. - 2002