Bellman Equations

For each state x, define V(x) to be the expected number of turns it takes to finish the game, following an optimal policy. V(x) can be defined in terms of the states it reaches in one step.

\begin{displaymath}
V(x) = \sum_{d=1}^6 \min_c 1/6\; (W(T(x,d,c))),\end{displaymath}

where d is the die roll, c is the choice given the roll, T(x,d,c) is the state reached given choice c is taken from state x on a die roll of d.

The W(x) function gives the expected number of steps to the end of the game, given that we've just landed on x. W(x) =


next up previous
Up: TRIVIAL PURSUIT Previous: MDP