Bellman Equation for MDPs
For all
,
,
This is for ``undiscounted'' MDPs, for reasons that we might not discuss.
Next:
Connection to Other Problems
Up:
Markov Decision Processes
Previous:
Definition