Characterization of Web Proxy Traffic and Wisconsin Proxy Benchmark 2.0
(Position Paper for W3C Web Characterization Workshop)

Pei Cao
Department of Computer Sciences
University of Wisconsin, Madison
1210 West Dayton Street
Madison, WI 53706 USA
cao@cs.wisc.edu

Introduction

The WisWeb research group at University of Wisconsin-Madison are focusing on two aspects of Web characterization: study of Web proxy traffic and building a Web proxy benchmark. This paper reports our progress and current plans.

Analysis of Web Proxy Traffic

Using six traces from proxies at academic institutions, corporations and ISPs, we have studied a range of characteristics of requests seen by the proxies. The traces include a 26-day proxy log from DEC, a 19-day trace from UC Berkeley, a three-month trace from CS Dept. in Universita di Pisa, Italy, a 7-day trace from Questnet (which operates parent proxies serving child proxies in Australia), a one-day log from NLANR's proxies, and a 10-day log from FUNET, a regional ISP for academic and research communities in Finland. Our main findings are:

The access frequency distribution of HTTP objects observed in the proxy traces follows a Zipf-like distribution, $\Omega/i^\alpha$ , very well. The value of $\alpha$ varies from trace to trace, ranging from 0.64 to 0.83. Traces from a homogeneous community (for example, DEC, Pisa and FuNet) have a larger $\alpha$ (centering around 0.8), and traces from a more diversified environment (for example, UCB, QuestNet and NLANR) have a smaller $\alpha$ (centering around 0.7). Note that the distribution does not follow ``Zipf's Law'', which states that $\alpha$ is 1.
As follows immediately from the Zipf-like distribution, the $\alpha$ value affects the concentration of Web accesses to hot documents in each trace. For example, 25% of all documents accounts for 70% of Web accesses in DEC, Pisa and FuNet traces, while it takes 40% of all document to draw 70% of Web accesses in UCB, QuestNet and NLANR. The ``10/90'' rule (i.e. 90% of accesses go to 10% of items), evident in program execution, does not apply to Web accesses seen by a proxy.
Proxies experience temporal locality that follows the LRU stack model. In particular, it appears that The probability that a document will be referenced k requests after it was last referenced is roughly proportional to 1/k . This result confirms the findings in [4].
There is low statistical correlation between the frequency that an object is accessed and its size, though the average size of cold objects (for example, those accessed less than 10 times) is larger than that of hot objects (for example, those accessed more than 10 times).
The statistical correlation between an object's access frequency and its average modifications per request varies from trace to trace, but is generally quite low.
A separate source [5] reports that there is little statistical correlation between the frequency that a Web document is accessed and the number of embedded objects in it.
While some traces show that the access frequency of Web servers follow a Zipf-like distribution, other traces show that Zipf-like distributions do not characterize server access frequency very well. However, the traces all show that no Web server introduces a large number of very popular documents. Rather, the distribution of popular documents in terms of Web servers is quite even.
The more popular an HTTP object is, the more likely it stays a popular object. We find that the day-to-day variation of the membership in the top 1% hottest objects, in terms of percentage, is much lower than the variation in the top 10% hottest objects.

These results are reported in more detail in our paper ``Web Caching and Zipf-like Distributions: Evidence and Implications'', available at http://www.cs.wisc.edu/ cao/papers/zipf-implications.html. Due to space limitation we do not elaborate further here.

Building a Proxy Benchmark

We have developed a simple proxy benchmark called Wisconsin Proxy Benchmark (WPB) 1.0 in fall 1997, and used it to compare a variety of commercial and free-ware proxy software [1]. The benchmark has also been used by others in measuring proxy performance and projecting the performance benefits of proxy caching. The benchmark emulates server delays and models temporal locality in the request stream. However, the use of the benchmark also exposed its weaknesses, including the overhead at client end, failure to model persistent connections and HTTP 1.1, and failure to capture spatial locality and URL path length.

We are in the process of developing Wisconsin Proxy Benchmark (WPB) 2.0. It uses the core engine of httperf [3], a very lightweight Web server benchmarking and measurement tool. The benchmark already supports persistent connection and HTTP 1.1, and supports trace replay with as much accuracy as possible at user level. We are in the process of adding in temporal locality, spatial locality and a variety of other features described below.

Requirements of a Proxy Benchmark

Through our experience of using WPB 1.0 to compare proxy products, we find that a proxy benchmark should at least reflect the following characteristics of real-life proxy traffic:

Internet delay must be modelled. Internet delay lengthens the duration of each HTTP request, which in turn increases the number of open network descriptors, putting more stress on the descriptor handling and the memory system of the proxy server. We have observed that a benchmark that models the Internet delay produces different results from one that does not.
Some proxy benchmarks run the requests directly over the Internet to real Web servers; however, running requests directly over the Internet has its drawbacks. Short of that, proxy benchmarks must include a pseudo-server module which delays the responses. For portability reasons, the pseudo-server module most likely is implemented at user level, which means that it can at best emulate packet arrival delay (by issuing the sends with some delay), but cannot emulate features such as delays in connection establishment and network loss.
WPB 1.0 includes a pseudo-server module which delays sending back each reply by a configurable duration. WPB 2.0 will include a more elaborate delay mechanism, which amortize the delay over the packets in a reply.
The benchmark should reflect the proper percentage of persistent and non-persistent connections. We do not know of any trace data giving us the relative percentage. However, we have found that Netscape browsers still use one request per connection and no persistent connection. We have also heard that Microsoft Internet Explorer uses persistent connection, but does not use requests pipelining, though we have not confirmed this. Thus, one can probably design the benchmark based on the behavior of these popular browsers.
For each persistent connection, the benchmark must generate proper distribution of the number of requests served. Here again there is no trace data. We hope that information on the distribution of the number of embedded objects in HTML pages can help us here, since for Internet Explorer, it fetches all embedded objects on one connection.
The benchmark should model the same access distribution, temporal locality and spatial locality observed in real traces. These properties stress the replacement algorithms at the caching proxy, and need to be modelled carefully.
While we have a relatively good understanding of temporal locality and access distribution in proxy requests, we do not understand the spatial locality very well. We are working on this problem.
The benchmark should model a collection of On/Off sources of requests, as is done in Surge [2], to induce the proper self-similar behavior.
The benchmark should reflect the same distribution of request types as observed in real traces. In particular, the benchmark should result in the proper percentage of GET, POST and TRACE requests, and among the GET requests, the proper percentage of conditional GET requests such as if-modified-since. It is well known that 20% to 30% of all requests are conditional GET requests with 304 (not modified) replies. These requests stress the proxy server differently from regular GET requests.
The benchmark should also generate the proper percentage of requests carrying cookies, and the responses carrying cookies. Cookies are ubiquitous today and different proxies process them differently.
Similarly, the benchmark should also generate the appropriate distributions of response types, including the relative percentage of HTML, GIF, JPEG and other file types. It should also generate the appropriate distributions of response status codes, including 304, 302 (temporally moved) and errors. In the NLANR trace, we have observed that 8% of the server responses are 302, mainly due to the use of 302 as a mirroring mechanism by some popular Web servers.
The benchmark should also follow the same response size distributions and response delay distributions as in real-life. Furthermore, the response size distributions should be established for each response type (html, gif and jpg).
Our results on the correlation between response size and object popularity aid in the benchmark design here. Since the correlation is very low, one can decouple the code on generating the reference URL from the code calculating the size of the object. We are looking into the correlation between response latency and object popularity.
The benchmark should model the distribution of the component length of the URL and the spatial locality in terms of common URL path prefixes. We have found that this is necessary because proxy softwares use different mechanisms to translate URLs to file names, and they perform differently depending on the URL path length and spatial locality. We have seen such differences comparing CERN HTTPD proxy and Squid.

Finally, the benchmark should measure not only the client latency, outgoing traffic, errors, but also fairness of the proxy. We have seen that process-based proxies can introduce significant unfairness in client latency, whereas event-driven proxies such as Squid treat requests much more fairly.

Wisconsin Proxy Benchmark (WPB) 2.0

We are in the process of constructing WPB 2.0, consisting of a client-side code and a server-side code. All request generation and distribution fittings are done at the client-side code. In other words, the client code generates a request, sets its URL, then generates its response status code, type, size and latency. The server part of the benchmark is a simple pseudo-server that generates a number of random bytes with the specified status code, document type and size, and emulates packet delays based on the specified latency.

Our client and server codes are built through modifications of the httperf tool. httperf is extremely lightweight, using no threads or processes, but rather using an event-driven architecture. It handles various scalability bottlenecks at client side, including limitations of enumpheral ports (see [3]). It supports persistent connections and HTTP 1.1 range requests. The original httperf implements only the client part. We have changed httperf extensively to provide a server counterpart.

Our benchmark can replay proxy logs faithfully. The client-side code can read the trace and generate a request carrying specifications of size, latency etc. The server-side code then responds properly.

The trace replay tool offers a valuable service to any institution wanting to evaluate the benefit of caching proxies. The institution can replay a portion from their log and immediately obtain numbers such as user latency reduction and Internet traffic reduction.

We are now working on the modeling part of the client-side code, hoping to incorporate all of the items listed above.

Summary

We have described our current results through analyzing six Web proxy traces and our plan on building the next version of the Wisconsin Proxy Benchmark. A few data items needed to build a realistic benchmark are still missing, including the average URL path component length, average number of requests serviced by persistent connections, the percentage of persistent connections, etc. New traces that can provide such information would be highly appreciated.

References

1

Jussara Almeida and Pei Cao.
Measuring proxy performance with the wisconsin proxy benchmark.
Technical report, Technical Report 1373, Computer Science Department, Unive rsity of Wisconsin-Madison; Presented at the 3rd Web Caching Workshop, February 1998.
URL http://www.cs.wisc.edu/ cao/papers/cao-wpb/index.html.

2

Paul Barford and Mark Corvella.
Generating representative web workloads for network and server performance evaluation.
In Proceedings of the SIGMETRICS/Performance'98, June 1998.
Can be found at http://www.cs.bu.edu/faculty/crovella/papers.html.

3

David Mosberger and Tai Jin.
httperf--a tool for measuring web server performance.
In Proceedings of the 1998 SIGMETRICS Workshop on Internet Ser ver Performance, June 1998.
URL http://www.cs.wisc.edu/ cao/WISP98/html-versions/davidm/httpe rf/index.html.

4

Luigi Rizzo and Lorenzo Vicisano.
Replacement policies for a proxy cache.
Technical Report RN/98/13, University College London, Department of Computer Science, Gower Street, London WC1E 6BT, UK, 1998.
http://www.iet.unipi.it/~luigi/caching.ps.gz.

5

Lixia Zhang.
Personal communication, 1998.

About this document ...

Characterization of Web Proxy Traffic and Wisconsin Proxy Benchmark 2.0
(Position Paper for W3C Web Characterization Workshop)

This document was generated using the LaTeX2HTML translator Version 97.1 (release) (July 13th, 1997)

The command line arguments were:
latex2html -no_navigation -split 0 position.tex.

The translation was initiated by Pei Cao on 10/12/1998

Pei Cao
10/12/1998

Characterization of Web Proxy Traffic and Wisconsin Proxy Benchmark 2.0 (Position Paper for W3C Web Characterization Workshop)

Characterization of Web Proxy Traffic and Wisconsin Proxy Benchmark 2.0
(Position Paper for W3C Web Characterization Workshop)