Discounted and undiscounted value-iteration in Markov decision processes: A survey
Abstract
A survey is given of the present state of the art of value-iteration and related successive approximation methods, as well as of resulting turnpike properties, in both the discounted and undiscounted version of finite state and action Markov Decision Problems.
Citation
Federgruen, Awi, and P. J. Schweitzer. "Discounted and undiscounted value-iteration in Markov decision processes: A survey." In Dynamic Programming and its Applications, 23-53. Ed. Martin L. Puterman. Orlando, FL: Academic Press, 1979.
Each author name for a Columbia Business School faculty member is linked to a faculty research page, which lists additional publications by that faculty member.
Each topic is linked to an index of publications on that topic.