Duke DBGroup Logo

CPS 216: Advanced Database Systems
(Data-Intensive Computing Systems, Fall 2010)

Course information
Course schedule and notes
Assignments
Readings
Project
Extra Materials
The course schedule will be posted here.
WeekDateTopicLecture slides and reference
108-30Introduction and overview Notes 1: ppt, pdf
09-01Introduction to MapReduce and Hadoop Notes 2: ppt, pdf
209-06Introduction to MapReduce and Hadoop (contd.) Notes 2: ppt, pdf
09-08Some MapReduce Algorithms Notes 3: pdf
309-13Some MapReduce Algorithms (contd.) Notes 3: pdf
09-15How Hadoop Works Notes 4: ppt, pdf
409-20How Hadoop Works (contd.) Notes 4: ppt, pdf
09-22 Overview of query processing Notes 5: ppt, pdf
509-27Pipelining (iterators) and Materialization Notes 6: ppt, pdf
09-29Rule-based optimization Notes 6: ppt, pdf
610-04Block-based data storage Notes 7: ppt, pdf
10-06 Index-based access Notes 8: ppt, pdf
710-11 Fall break (no class)
10-13 Index-based access (contd.) Notes 9: ppt, pdf
810-18 Sort processing Notes 10: ppt, pdf
10-20Introduction to Join processing Notes 10: ppt, pdf
910-25Sort-merge joins, Block and Index nested-loop joins Notes 10: ppt, pdf
10-27Midterm
1011-01Introduction to Pig and Pig Latin Notes 11: ppt, pdf
11-03Hash joins Notes 10: ppt, pdf
1111-08Cost-based Query Optimization Notes 12: ppt, pdf
11-10Failure recovery, Logging Notes 13: ppt, pdf
1111-15HBase: The Hadoop Database Chapter 12 in textbook
by Tom White
11-17Programming and Debugging Large-Scale
Data Processing Workflows
Colloquium by Chris Olston,
Yahoo! Research
1211-22 HBase, Concurrency control, Serializability Notes 14: ppt, pdf, Exercises
1311-29 Scalable key-value stores,
Yahoo! Cloud Serving Benchmark (YCSB)
YCSB page