Learning navigation attractors for mobile robots with reinforcement learning and reservoir computing

ANTONELO, Eric Aislan; Depeweg, Stefan; Schrauwen, Benjamin

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Learning navigation attractors for mobile robots with reinforcement learning and reservoir computing

ANTONELO, Eric Aislan; Depeweg, Stefan; Schrauwen, Benjamin

2011 • In Proceedings of the X Brazilian Congress on Computational Intelligence (CBIC)

Peer reviewed

Permalink
https://hdl.handle.net/10993/36470

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

2011_eric_cbic.pdf

Author postprint (2.08 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

reservoir computing; reinforcement learning; robot navigation

Abstract :

[en] Autonomous robot navigation in partially observable environments is a complex task because the state of the environment can not be completely determined only by the current sensory readings of a robot. This work uses the recently introduced paradigm for training recurrent neural networks (RNNs), called reservoir computing (RC), to model multiple navigation attractors in partially observable environments. In RC, the RNN with randomly generated fixed weights, called reservoir, projects the input into a high-dimensional dynamic space. Only the readout output layer is trained using standard linear regression techniques, and in this work, is used to approximate the state-action value function. By using a policy iteration framework, where an alternating sequence of policy improvement (samples generation from environment interaction) and policy evaluation (network training) steps are performed, the system is able to shape navigation attractors so that, after convergence, the robot follows the correct trajectory towards the goal. The experiments are accomplished using an e-puck robot extended with 8 distance sensors in a rectangular environment with an obstacle between the robot and the target region. The task is to reach the goal through the correct side of the environment, which is indicated by a temporary stimulus previously observed at the beginning of the episode. We show that the reservoir-based system (with short-term memory) can model these navigation attractors, whereas a feedforward network without memory fails to do so.

Disciplines :

Computer science

Author, co-author :

ANTONELO, Eric Aislan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Depeweg, Stefan

Schrauwen, Benjamin

External co-authors :

yes

Language :

English

Title :

Learning navigation attractors for mobile robots with reinforcement learning and reservoir computing

Publication date :

2011

Event name :

X Brazilian Congress on Computational Intelligence (CBIC)

Event date :

08-11-2011 to 10-11-2011

Main work title :

Proceedings of the X Brazilian Congress on Computational Intelligence (CBIC)

Peer reviewed :

Peer reviewed

Additional URL :

http://ericantonelo.com/biblio/learning-navigation-attractors-mobile-robots-reinforcement-learning-and-reservoir-computing

Available on ORBilu :

since 29 August 2018

Statistics

Number of views

221 (0 by Unilu)

Number of downloads

194 (0 by Unilu)

More statistics