Planning Under Uncertainty (14)
Michael L. Littman
March 9th, 1999
BLACKJACK
Background
Rules
Markov Chain: Review
Markov Chain: Reward
Decisions, Decisions
Expected Value of a Hit
Optimal Strategy
Multiple Hits
Computing the Optimal Strategy
Markov Decision Processes
Single Agent Problems
Definition
Bellman Equation for MDPs
Connection to Other Problems
Solving MDPs
TRIVIAL PURSUIT
MDP
Bellman Equations
Next:
BLACKJACK