Long-Term Values in Markov Decision Processes and Repeated Games, and a New Distance for Probability Spaces
Article
Mathematics of Operations Research
Issue number:
Issue in Advance: Issue 4 (November 2016)
Publisher:
Informs
Year:
2017
We study long-term Markov decision processes (MDPs) and gambling houses, with applications to any partial observation MDPs with finitely many states and zero-sum repeated games with an informed controller.