next up previous
Next: Optimal Incremental Algorithms for Up: EXTERNAL MEMORY ALGORITHMS, I/O Previous: A Framework for Index

   
Duality Between Prefetching and Queued Writing with Parallel Disks

D. A. Hutchinson, P. Sanders, and J. S. Vitter. ``Duality Between Prefetching and Queued Writing with Parallel Disks,'' SIAM Journal on Computing, 34(6), 1443-1463, June 2005. An extended abstract appears in Proceedings of the 9th Annual European Symposium on Algorithms (ESA '01), Århus, Denmark, August 2001, published in Lecture Notes in Computer Science, 2161, Springer-Verlag, Berlin, Germany.

Full text (Adobe pdf format)

Parallel disks promise to be a cost effective means for achieving high bandwidth in applications involving massive data sets, but algorithms for parallel disks can be difficult to devise. To combat this problem, we define a useful and natural duality between writing to parallel disks and the seemingly more difficult problem of prefetching. We first explore this duality for applications involving read-once accesses using parallel disks. We get a simple linear time algorithm for computing optimal prefetch schedules and analyze the efficiency of the resulting schedules for randomly placed data and for arbitrary interleaved accesses to striped sequences. Duality also provides an optimal schedule for the integrated caching and prefetching problem, in which blocks can be accessed multiple times. Another application of this duality gives us the first parallel disk sorting algorithms that are provably optimal up to lower order terms. One of these algorithms is a simple and practical variant of multiway merge sort, addressing a question that has been open for some time.


next up previous
Next: Optimal Incremental Algorithms for Up: EXTERNAL MEMORY ALGORITHMS, I/O Previous: A Framework for Index
Jeff Vitter
2008-07-05