Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Robot planning in partially observable continuous domains
Porta, Josep M.; Spaan, Matthijs T. J.; Vlassis, Nikos
2005In Proc. Robotics: Science and Systems
Peer reviewed
 

Files


Full Text
download.pdf
Publisher postprint (327.25 kB)
http://www.roboticsproceedings.org/rss01/p29.pdf
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] We present a value iteration algorithm for learning to act in Partially Observable Markov Decision Processes (POMDPs) with continuous state spaces. Mainstream POMDP research focuses on the discrete case and this complicates its application to, e.g., robotic problems that are naturally modeled using continuous state spaces. The main difficulty in defining a (belief-based) POMDP in a continuous state space is that expected values over states must be defined using integrals that, in general, cannot be computed in closed from. In this paper, we first show that the optimal finite-horizon value function over the continuous infinite-dimensional POMDP belief space is piecewise linear and convex, and is defined by a finite set of supporting α-functions that are analogous to the α-vectors (hyperplanes) defining the value function of a discrete-state POMDP. Second, we show that, for a fairly general class of POMDP models in which all functions of interest are modeled by Gaussian mixtures, all belief updates and value iteration backups can be carried out analytically and exact. A crucial difference with respect to the α-vectors of the discrete case is that, in the continuous case, the α-functions will typically grow in complexity (e.g., in the number of components) in each value iteration. Finally, we demonstrate PERSEUS, our previously proposed randomized point-based value iteration algorithm, in a simple robot planning problem with a continuous domain, where encouraging results are observed.
Disciplines :
Computer science
Identifiers :
UNILU:UL-ARTICLE-2011-720
Author, co-author :
Porta, Josep M.
Spaan, Matthijs T. J.
Vlassis, Nikos ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Language :
English
Title :
Robot planning in partially observable continuous domains
Publication date :
2005
Event name :
Proc. Robotics: Science and Systems
Event date :
2005
Main work title :
Proc. Robotics: Science and Systems
Pages :
217-224
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 17 November 2013

Statistics


Number of views
90 (0 by Unilu)
Number of downloads
112 (0 by Unilu)

Bibliography


Similar publications



Contact ORBilu