Rachelson, Emmanuel and Quesnel, Gauthier and Garcia, Frédérick and Fabiani, Patrick
A Simulation-based Approach for Solving Temporal Markov Problems.
(2008)
In: European Conference on Artificial Intelligence (ECAI 2008), 21 August 2008 - 25 August 2008 (Patras, Greece).
|
(Document in English)
PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader 251kB |
Official URL: http://ebooks.iospress.nl/volume/ecai-2008
Abstract
Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of decision under uncertainty. In this paper, after reviewing and comparing MDP frameworks designed to deal with temporal problems, we focus on Generalized Semi-Markov Decision Processes (GSMDP) with observable time. We highlight the inherent structure and complexity of these problems and present the differences with classical reinforcement learning problems. Finally, we introduce a new simulation-based reinforcement learning method for solving GSMDP, bringing together results from simulation-based policy iteration, regression techniques and simulation theory. We illustrate our approach on a subway network control example.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | ECAI 2008 18th European Conference on Artificial Intelligence Volume 178 Frontiers in Artificial Intelligence and Applications Edited by: Ghallab, M., Spyropoulos, C.D., Fakotakis, N., Avouris, N. July 2008, 972 pp., softcover ISBN: 978-1-58603-891-5 |
Audience (conference): | International conference proceedings |
Uncontrolled Keywords: | |
Institution: | French research institutions > Institut National de la Recherche Agronomique - INRA (FRANCE) French research institutions > Office National d'Etudes et Recherches Aérospatiales - ONERA (FRANCE) |
Laboratory name: | |
Statistics: | download |
Deposited On: | 30 Nov 2017 09:15 |
Repository Staff Only: item control page