| T. KELLY AND J. C. MOGUL. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proc. of WWW Conference (2002). |
....Identical files When an identical file appears multiple times in a dataset, it can be trivially encoded against another instance through the use of hash functions such as MD5. Past studies have investigated the prevalence of mirrors on the web [4] and techniques for suppressing duplicate payloads [12]. We chose to suppress duplicates from consideration in our analysis, since they are trivially handled through other means, except when a file contained in a zip archive is duplicated (since two zip files may have many identical files and some changed content, and our unzip rezip procedure would ....
....requires a separate cache of packets exchanged in the past, which may compete with the browser cache and other applications for resources. In some cases, the suppression of redundancy is at a very coarse level, for instance identifying when an entire payload is identical to an earlier payload [12], or when a particular region of a file has not changed. Examples of system taking this approach include rsync [26] a popular protocol for remote file copying, and the Lowbandwidth File System (LBFS) 17] However, there are applications for which identifying an appropriate base version is ....
Terence Kelly and Jeffrey Mogul. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proceedings of the 11th International World Wide Web Conference, May 2002.
....packets exchanged in the past, which may compete with the browser cache and other applications for resources. Bharat, et al. 3, 4] give methods for detecting mirrors (systematic replication of content) Mirrored URLs are likely to have similar content and are good candidates for delta encoding [12]. The extent of duplication on the web is estimated at 30 40 [7, 21] Here we discuss some possible new methods for improving the effectiveness of delta encoding in HTTP, in a method that allows for widespread deployment without the tight cache coupling of earlier approaches. We consider ....
....the delta encoding IFC, and specified in its own Internet draft [13] also can support multiple base files. Figure 1: A Venn diagram showing the relationship between clients, servers, and proxies using multi file delta encoding. The prevalence of aliasing in the web, as found by Kelly and Mogul [12], and the finding by Spring and Weatherall that about 20 of redundancy came from different sites [22] suggest that limiting deltas to resources from the same site may be too restrictive. However, Chan and Woo found that comparing across sites, based simply on URLs, was ineffective. We postulate ....
[Article contains additional citation context not shown here]
Terence Kelly and Jeffrey Mogul. Aliasing on the World Wide Web: Prevalence and Perfor- mance Implications. In Proceedings of the 11th International World Wide Web Conference, May 2002.
....trace. Note that #### only depends on the probabilities of occurrence of the different objects, and not on the relative order in which they occur. It has been shown that the number of distinct references in a segment of a trace increases with the size of the segment, even for very large traces [7]. Thus, to compare different traces, we need to normalize #### for the number of distinct objects referenced; the appropriate normalization is the largest possible value of ####, which is ### # # . Hence we define normalized entropy # as: ### # # (2) where # is the number of distinct ....
Terrence Kelly and Jeffrey Mogul. Aliasing on the world wide web: Prevalence and performance implications. In Performance Track, 11th Intl. World Wide Web Conference, 2002.
No context found.
T. KELLY AND J. C. MOGUL. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proc. of WWW Conference (2002).
No context found.
T. Kelly and J. Mogul. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proceedings of the 11th International World Wide Web Conference, May 2002.
No context found.
T. Kelly and J. Mogul. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proceedings of World Wide Web (WWW) Conference, 2002.
No context found.
T. Kelly and J. Mogul. Aliasing on the world wide web: Prevalence and performance implications. Proc. of the 11th International World Wide Web Conference (2002.
No context found.
T. Kelly and J. Mogul. Aliasing on the World Wide Web: Prevalence and Performance Implications. In Proceedings of the 11th International World Wide Web Conference, May 2002.
No context found.
Terence Kelly and Jeffrey Mogul. Aliasing on the World Wide Web: Prevalence and performance implications. In Proc. of the 11th Intl. WWW Conf., 2002.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC