Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Robust Exploration/Exploitation Trade-Offs in Safety-Critical Applications
Tokic, M.; Ertle, P.; Palm, G. et al.
2012In 8th IFAC Int. Symposium on Fault Detection, Supervision and Safety for Technical Processes, Mexico City 29-31 August 2012
Peer reviewed
 

Files


Full Text
SAFEPROCESS-2012.pdf
Author postprint (372.15 kB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Robotics; Safety; Learning
Abstract :
[en] With regard to future service robots, unsafe exceptional circumstances can occur in complex systems that are hardly to foresee. In this paper, the assumption of having no knowledge about the environment is investigated using reinforcement learning as an option for learning behavior by trial-and-error. In such a scenario, action-selection decisions are made based on future reward predictions for minimizing costs in reaching a goal. It is shown that the selection of safetycritical actions leading to highly negative costs from the environment is directly related to the exploration/exploitation dilemma in temporal-di erence learning. For this, several exploration policies are investigated with regard to worst- and best-case performance in a dynamic environment. Our results show that in contrast to established exploration policies like epsilon-Greedy and Softmax, the recently proposed VDBE-Softmax policy seems to be more appropriate for such applications due to its robustness of the exploration parameter for unexpected situations.
Disciplines :
Computer science
Electrical & electronics engineering
Identifiers :
UNILU:UL-CONFERENCE-2013-025
Author, co-author :
Tokic, M.
Ertle, P.
Palm, G.;  University of Ulm
Soeffker, D.;  University of Duisburg-Essen
Voos, Holger  ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Engineering Research Unit ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Language :
English
Title :
Robust Exploration/Exploitation Trade-Offs in Safety-Critical Applications
Publication date :
2012
Event name :
8th IFAC Int. Symposium on Fault Detection,Supervision and Safety for Technical Processes
Event place :
Mexico City, Mexico
Event date :
29-31 August 2012
Audience :
International
Main work title :
8th IFAC Int. Symposium on Fault Detection, Supervision and Safety for Technical Processes, Mexico City 29-31 August 2012
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 06 December 2013

Statistics


Number of views
63 (1 by Unilu)
Number of downloads
0 (0 by Unilu)

Scopus citations®
 
5
Scopus citations®
without self-citations
3

Bibliography


Similar publications



Contact ORBilu