next up previous
Next: Handling Failures Up: Architecture and Rationale Previous: Architecture and Rationale

The Costs of a Central Map

The CRISP mapping service allows independent proxy caches (e.g., at the department or branch level) to be coalesced to form a single large cache. However, this central structure imposes new costs to store the central map, and message exchanges to probe the map and notify it of fetches or evictions at participating proxies.

At first approximation, the dollar cost of the mapping service is the price of a machine with enough physical memory to hold what amounts to a replica of the directories of all participating proxy caches. The map is simply a hash of URLs, which are small relative to the Internet objects they name. A recent study [3] indicated that the average object is about 10K bytes. If we conservatively estimate the average URL size at 100 bytes, then the data that passes through the mapping service is about 1% of the aggregate size of the CRISP cache, e.g., a gigabyte of storage for a 100G cache. However, we require that the map reside in physical memory since disk accesses would add unacceptably to the mapping server latency.

The central mapping service is not a latency bottleneck or a significant barrier to scalability for three basic reasons:

The key limitation of CRISP is that the round-trip network latency between each proxy and the mapping service must be low, say, 20 ms at most. This may in practice limit the geographical size of the cache. In general, we envision that mapping service traffic will be carried as high-priority traffic over an ISP's internal network.


next up previous
Next: Handling Failures Up: Architecture and Rationale Previous: Architecture and Rationale

Syam Gadde
Fri Mar 28 10:09:42 EST 1997