Learning model-free robot control by a Monte Carlo EM algorithm

VLASSIS, Nikos; Toussaint, Marc; Kontes, Georgios; Piperidis, Savas

doi:10.1007/s10514-009-9132-0

Download

Article (Scientific journals)

Learning model-free robot control by a Monte Carlo EM algorithm

VLASSIS, Nikos; Toussaint, Marc; Kontes, Georgios et al.

2009 • In Autonomous Robots, 27 (2), p. 123-130

Peer reviewed

Permalink
https://hdl.handle.net/10993/3368

DOI
10.1007/s10514-009-9132-0

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

09-vlassis-et-al-auro.pdf

Author postprint (769.81 kB)

The final publication is available at link.springer.com

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Model-free robot control; Reinforcement learning; Probabilistic inference; EM algorithm

Abstract :

[en] We address the problem of learning robot control by model-free reinforcement learning (RL). We adopt the probabilistic model for model-free RL of Vlassis and Toussaint (Proceedings of the international conference on machine learning, Montreal, Canada, 2009), and we propose a Monte Carlo EM algorithm (MCEM) for control learning that searches directly in the space of controller parameters using information obtained from randomly generated robot trajectories. MCEM is related to, and generalizes, the PoWER algorithm of Kober and Peters (Proceedings of the neural information processing systems, 2009). In the finite-horizon case MCEM reduces precisely to PoWER, but MCEM can also handle the discounted infinite-horizon case. An interesting result is that the infinite-horizon case can be viewed as a 'randomized' version of the finite-horizon case, in the sense that the length of each sampled trajectory is a random draw from an appropriately constructed geometric distribution. We provide some preliminary experiments demonstrating the effects of fixed (PoWER) vs randomized (MCEM) horizon length in two simulated and one real robot control tasks.

Disciplines :

Computer science

Identifiers :

UNILU:UL-ARTICLE-2011-698

Author, co-author :

VLASSIS, Nikos ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)

Toussaint, Marc

Kontes, Georgios

Piperidis, Savas

Language :

English

Title :

Learning model-free robot control by a Monte Carlo EM algorithm

Publication date :

2009

Journal title :

Autonomous Robots

ISSN :

0929-5593

Publisher :

Springer Science & Business Media B.V.

Volume :

Issue :

Pages :

123-130

Peer reviewed :

Peer reviewed

Additional URL :

http://link.springer.com/content/pdf/10.1007%2Fs10514-009-9132-0.pdf

Available on ORBilu :

since 04 July 2013

Statistics

Number of views

111 (2 by Unilu)

Number of downloads

673 (6 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

WoS citations^™