Contribution to collective works (Parts of books)
Bayesian Reinforcement Learning
Vlassis, Nikos; Ghavamzadeh, Mohammad; Mannor, Shie et al.
2012In Wiering, Marco; van Otterlo, Martijn (Eds.) Reinforcement Learning: State of the Art
Peer reviewed
 

Files


Full Text
BRLchapter.pdf
Author postprint (201.27 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] This chapter surveys recent lines of work that use Bayesian techniques for reinforcement learning. In Bayesian learning, uncertainty is expressed by a prior distribution over unknown parameters and learning is achieved by computing a posterior distribution based on the data observed. Hence, Bayesian reinforcement learning distinguishes itself from other forms of reinforcement learning by explic- itly maintaining a distribution over various quantities such as the parameters of the model, the value function, the policy or its gradient. This yields several benefits: a) domain knowledge can be naturally encoded in the prior distribution to speed up learning; b) the exploration/exploitation tradeoff can be naturally optimized; and c) notions of risk can be naturally taken into account to obtain robust policies.
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Machine Learning (Vlassis Group)
Disciplines :
Computer science
Identifiers :
UNILU:UL-CHAPTER-2012-428
Author, co-author :
Vlassis, Nikos ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Ghavamzadeh, Mohammad
Mannor, Shie
Poupart, Pascal
External co-authors :
yes
Language :
English
Title :
Bayesian Reinforcement Learning
Publication date :
2012
Main work title :
Reinforcement Learning: State of the Art
Editor :
Wiering, Marco
van Otterlo, Martijn
Publisher :
Springer
ISBN/EAN :
978-3-642-27645-3
Pages :
359-386
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 04 July 2013

Statistics


Number of views
160 (11 by Unilu)
Number of downloads
3364 (7 by Unilu)

Scopus citations®
 
47
Scopus citations®
without self-citations
47
OpenCitations
 
23

Bibliography


Similar publications



Contact ORBilu