[PDF][PDF] Learning and Planning with Timing Information in Markov Decision Processes.

PL Bacon, B Balle, D Precup - UAI, 2015 - auai.org
UAI, 2015auai.org
We consider the problem of learning and planning in Markov decision processes with
temporally extended actions represented in the options framework. We propose to use
predictions about the duration of extended actions to represent the state and show that this
leads to a compact predictive state representation model independent of the set of primitive
actions. Then we develop a consistent and efficient spectral learning algorithm for such
models. Using just the timing information to represent states allows for faster improvement in …
Abstract
We consider the problem of learning and planning in Markov decision processes with temporally extended actions represented in the options framework. We propose to use predictions about the duration of extended actions to represent the state and show that this leads to a compact predictive state representation model independent of the set of primitive actions. Then we develop a consistent and efficient spectral learning algorithm for such models. Using just the timing information to represent states allows for faster improvement in the planning performance. We illustrate our approach with experiments in both synthetic and robot navigation domains.
auai.org
Showing the best result for this search. See all results