[en] Autonomous mobile robots must accomplish tasks in unknown and noisy environments. In this context, learning robot behaviors in an imitation based approach would be desirable in the perspective of service robotics as well as of learning robots. In this work, we use Reservoir Computing (RC) for learning robot behaviors by demonstration. In RC, a randomly generated recurrent neural network, the reservoir, projects the input to a dynamic temporal space. The reservoir states are mapped into a readout output layer which is the solely part being trained using standard linear regression. In this paper, we use a two layered modular structure, where the first layer comprises two RC networks, each one for learning primitive behaviors, namely, obstacle avoidance and target seeking. The second layer is composed of one RC network for behavior combination and coordination. The hierarchical RC network learns by examples given by simple controllers which implement the primitive behaviors. We use a simulation model of the e-puck robot which has distance sensors and a camera that serves as input for our system. The experiments show that, after training, the robot learns to coordinate the Goal Seeking (GS) and the Object Avoidance (OA) behaviors in unknown environments, being able to capture targets and navigate efficiently.
Disciplines :
Sciences informatiques
Auteur, co-auteur :
Waegeman, Tim
ANTONELO, Eric Aislan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
wyffels, Francis
Schrauwen, Benjamin
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
Modular reservoir computing networks for imitation learning of multiple robot behaviors
Date de publication/diffusion :
2009
Nom de la manifestation :
IEEE Int. Symp. on Computational Intelligence in Robotics and Automation (CIRA)
Date de la manifestation :
15-12-2009 to 18-12-2009
Manifestation à portée :
International
Titre de l'ouvrage principal :
Proc. of the 2009 IEEE International Symposium on Computational Intelligence in Robotics and Automation - (CIRA)
Alex M. Andrew. Behavior-based robotics by ronald c. arkin, mit press, cambridge, mass., 1998, xiv+491 pp, isbn 0-262-01165-4.
Robotica, 17(2):229-235, 1999.
Eric A. Antonelo, Benjamin Schrauwen, and Jan Van Campenhout. Generative modeling of autonomous robots and their environments using reservoir computing. Neural Processing Letters, 26(3):233-249, 2007.
Eric A. Antonelo, Benjamin Schrauwen, and Dirk Stroobandt. Event detection and localization for small mobile robots using reservoir computing. Neural Networks, 21:862-871, 2008.
Eric A. Antonelo, Benjamin Schrauwen, and Dirk Stroobandt. Mobile robot control in the road sign problem using reservoir computing networks. In IEEE Int. Conf. on Robotics and Automation (ICRA), 2008. (accepted).
Eric A. Antonelo, Benjamin Schrauwen, and Dirk Stroobandt. Modeling multiple autonomous robot behaviors and behavior switching with a single reservoir computing network. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 1843-1848, Singapore, 10 2008.
Valentino Braitenberg. Vehicles: Experiments in synthetic psychology. MIT Press, Cambridge, Massachusetts, 1984.
Harald Burgsteiner. Imitation learning with spiking neural networks and real-world devices. Eng. Appl. Artif. Intell., 19(7):741-752, 2006.
Cédric Hartland and Nicolas Bredeche. Using Echo State Networks for Robot Navigation Behavior Acquisition. In ROBIO 07, pages 201-206, Sanya China, 2007.
Herbert Jaeger. The "echo state" approach to analysing and training recurrent neural networks. Technical Report GMD Report 148, German National Research Center for Information Technology, 2001.
Herbert Jaeger. Short term memory in echo state networks. Technical Report GMD Report 152, German National Research Center for Information Technology, 2001.
Herbert Jaeger and Harald Haas. Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless telecommunication. Science, 308:78-80, April 2 2004.
Herbert Jaeger, Mantas Lukosevicius, and Dan Popovici. Optimization and applications of echo state networks with leaky integrator neurons. Neural Networks, 20:335-352, 2007. (Pubitemid 46876638)
Wolfgang Maass, Thomas Natschläger, and Henry Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Computation, 14(11):2531-2560, 2002.
Benjamin Schrauwen, Michiel D'Haene, David Verstraeten, and Jan Van Campenhout. Compact hardware liquid state machines on fpga for real-time speech recognition. Neural Networks, 21:511-523, 2008.
Benjamin Schrauwen, David Verstraeten, and Jan Van Campenhout. An overview of reservoir computing: theory, applications and implementations. In Proceedings of the European Symposium on Artificial Neural Networks (ESANN), 2007.
David Verstraeten, Benjamin Schrauwen, Michiel D'Haene, and Dirk Stroobandt. An experimental unification of reservoir computing methods. Neural Networks, 20:391-403, 2007.