Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Metaheuristics-based Exploration Strategies for Multi-Objective Reinforcement Learning
Felten, Florian; Danoy, Grégoire; Talbi, El-Ghazali et al.
2022In Proceedings of the 14th International Conference on Agents and Artificial Intelligence
Peer reviewed
 

Files


Full Text
109891.pdf
Publisher postprint (953.07 kB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Reinforcement Learning; Multi-objective; Metaheuristics; Pareto Sets
Abstract :
[en] The fields of Reinforcement Learning (RL) and Optimization aim at finding an optimal solution to a problem, characterized by an objective function. The exploration-exploitation dilemma (EED) is a well known subject in those fields. Indeed, a consequent amount of literature has already been proposed on the subject and shown it is a non-negligible topic to consider to achieve good performances. Yet, many problems in real life involve the optimization of multiple objectives. Multi-Policy Multi-Objective Reinforcement Learning (MPMORL) offers a way to learn various optimised behaviours for the agent in such problems. This work introduces a modular framework for the learning phase of such algorithms, allowing to ease the study of the EED in Inner- Loop MPMORL algorithms. We present three new exploration strategies inspired from the metaheuristics domain. To assess the performance of our methods on various environments, we use a classical benchmark - the Deep Sea Treasure (DST) - as well as propose a harder version of it. Our experiments show all of the proposed strategies outperform the current state-of-the-art ε-greedy based methods on the studied benchmarks.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > Parallel Computing & Optimization Group (PCOG)
Disciplines :
Computer science
Author, co-author :
Felten, Florian  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PCOG
Danoy, Grégoire;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PCOG
Talbi, El-Ghazali ;  University of Lille, CNRS/CRIStAL, Inria Lille, France
Bouvry, Pascal ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PCOG
External co-authors :
yes
Language :
English
Title :
Metaheuristics-based Exploration Strategies for Multi-Objective Reinforcement Learning
Publication date :
2022
Event name :
14th International Conference on Agents and Artificial Intelligence
Event date :
from 3-02-2022 to 5-02-2022
Audience :
International
Main work title :
Proceedings of the 14th International Conference on Agents and Artificial Intelligence
Publisher :
SCITEPRESS - Science and Technology Publications, Online Streaming, Unknown/unspecified
ISBN/EAN :
978-989-758-547-0
Pages :
662--673
Peer reviewed :
Peer reviewed
FnR Project :
FNR14762457 - Automating The Design Of Autonomous Robot Swarms, 2020 (01/05/2021-30/04/2024) - Gregoire Danoy
Funders :
FNR - Fonds National de la Recherche [LU]
Available on ORBilu :
since 22 February 2022

Statistics


Number of views
463 (123 by Unilu)
Number of downloads
4 (4 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
WoS citations
 
1

Bibliography


Similar publications



Contact ORBilu