Long-Term Values in Markov Decision Processes and Repeated Games, and a New Distance for Probability Spaces

Article

Author/s:

Jérôme Renault, Xavier Venel

Mathematics of Operations Research

Issue number:

Issue in Advance: Issue 4 (November 2016)

Publisher:

Informs

Year:

2017

Journal Article

We study long-term Markov decision processes (MDPs) and gambling houses, with applications to any partial observation MDPs with finitely many states and zero-sum repeated games with an informed controller.

Tags:

Game Theory & Graphs

Search form

You are here

Long-Term Values in Markov Decision Processes and Repeated Games, and a New Distance for Probability Spaces