Abstract---Caching dynamic pages at a server site is beneficial in reducing server resource demands and it also helps dynamic page caching at proxy sites. Previous work has used fine-grain dependence graphs among individual dynamic pages and underlying data sets to enforce result consistency. This paper proposes a complementary solution for applications that require coarse-grain cache management. The key idea is to partition dynamic pages into classes based on URL patterns so that an application can specify page identification and data dependence, and invoke invalidation for a class of dynamic pages. To make this scheme time-efficient with small space requirement, lazy invalidation is used to minimize slow disk accesses when IDs of dynamic pages are stored in memory with a digest format. Selective precomputing is further proposed to refresh stale pages and smoothen load peaks. A data structure is developed for efficient URL class searching during lazy or eager invalidation. This paper also presents design and implementation of a caching system called Cachuma which integrates the above techniques, runs in tandem with standard Web servers, and allows Web sites to add dynamic page caching capability with minimal changes. The experimental results show that the proposed techniques are effective in supporting coarse-grain cache management and reducing server response times for tested applications. I.
|
556
|
Generating Representative Web Workloads for Network and Server Performance Evaluation
– Barford, Crovella
- 1998
|
|
554
|
Web Caching and Zipf-like Distributions: Evidence and Implications
– Breslau, Cao, et al.
- 1999
|
|
542
|
Human Behavior and the Principle of Least Effort
– Zipf
- 1949
|
|
492
|
Art of Computer Programming, Volume 3: Sorting and Searching (2nd Edition
– Knuth
- 1998
|
|
491
|
The MD5 Message-Digest Algorithm
– Rivest
- 1992
|
|
401
|
Cost-Aware WWW Proxy Caching Algorithms
– Cao, Irani
- 1997
|
|
310
|
Clusterbased scalable network services
– Fox, Gribble, et al.
- 1997
|
|
240
|
Locality-aware Request Distribution in Cluster-based Network Servers
– Pai, Aron, et al.
- 1998
|
|
176
|
Exploring the bounds of web latency reduction from caching and prefetching
– Kroeger, Long, et al.
- 1997
|
|
151
|
IO-Lite: A Unified I/O Buffering and Caching System
– Pai, Druschel, et al.
- 1999
|
|
142
|
World-Wide Web Cache Consistency
– Gwertzman, Seltzer
- 1996
|
|
138
|
Active Cache: Caching Dynamic Contents on the Web
– Cao, Zhang, et al.
- 1998
|
|
124
|
Improving Web Server Performance by Caching Dynamic Data
– Iyengar, Challenger
- 1997
|
|
113
|
A scalable system for consistently caching dynamic web data
– Challenger, Iyengar, et al.
- 1999
|
|
92
|
Scalable Kernel Performance for Internet Servers under Realistic Loads
– Banga, Mogul
- 1998
|
|
86
|
A Scalable Web Cache Consistency Architecture
– Yu, Breslau, et al.
- 1999
|
|
83
|
Optimistic deltas for WWW latency reduction
– Banga, Douglis, et al.
- 1997
|
|
77
|
HPP: HTML macropreprocessing to support dynamic document caching
– Douglis, Haro, et al.
- 1997
|
|
77
|
Adaptive Leases: A Strong Consistency Mechanism for the World Wide
– Duvvuri, Shenoy, et al.
- 2000
|
|
76
|
Web prefetching between lowbandwidth clients and proxies: Potential and performance
– Fan, Cao, et al.
- 1999
|
|
53
|
Hierarchical cache consistency in a WAN
– YIN, ALVISI, et al.
- 1999
|
|
47
|
Business-oriented resource management policies for e-commerce servers. Performance Evaluation 42
– MENASCÉ, ALMEIDA, et al.
- 2000
|
|
46
|
Analysis of task assignment policies in scalable distributed Web-server systems
– COLAJANNI, YU, et al.
- 1998
|
|
42
|
Cooperative caching of dynamic content on a distributed web server
– Holmedahl, Smith, et al.
- 1998
|
|
35
|
Scalable web caching of frequently updated objects using reliable multicast
– Li, Cheriton
- 1999
|
|
27
|
Transparent result caching
– Vahdat, Anderson
- 1997
|
|
22
|
Squid Internet Object Cache
– Wessels
- 1996
|
|
22
|
Adaptive load sharing for clustered digital library servers
– Zhu, Yang, et al.
- 2000
|
|
21
|
SWEB: Towards a Scalable WWW Server on MultiComputers
– Andresen, Yang, et al.
- 1996
|
|
20
|
Neptune: Scalable replication management and programming support for cluster-based network services
– Shen, Yang, et al.
- 2001
|
|
19
|
Scheduling Optimization for Resource-Intensive Web Requests on Server Clusters
– Zhu, Smith, et al.
- 1999
|
|
18
|
Exploiting Result Equivalence in Caching Dynamic Web Content
– Smith, Acharya, et al.
- 1999
|
|
9
|
The WWW prototype of the Alexandria Digital Library
– Andresen
- 1995
|
|
6
|
Adaptive Algorithms for Cache-efficient Trie Search
– Acharya, Zhu, et al.
- 1999
|
|
1
|
Menasce and Vigilio Almeida, Scaling for E-business: technologies, models, performance, and capacity planning
– Daniel
- 2000
|