Paper published in a book (Scientific congresses, symposiums and conference proceedings)
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning
FELTEN, Florian; Alegre, Lucas N.; Nowé, Ann et al.
2024In A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning
Peer reviewed
 

Files


Full Text
512_a_toolkit_for_reliable_benchma (1).pdf
Author postprint (33.52 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Multi-objective; Reinforcement Learning; Benchmarking
Abstract :
[en] Multi-objective reinforcement learning algorithms (MORL) extend standard reinforcement learning (RL) to scenarios where agents must optimize multiple---potentially conflicting---objectives, each represented by a distinct reward function. To facilitate and accelerate research and benchmarking in multi-objective RL problems, we introduce a comprehensive collection of software libraries that includes: (i) MO-Gymnasium, an easy-to-use and flexible API enabling the rapid construction of novel MORL environments. It also includes more than 20 environments under this API. This allows researchers to effortlessly evaluate any algorithms on any existing domains; (ii) MORL-Baselines, a collection of reliable and efficient implementations of state-of-the-art MORL algorithms, designed to provide a solid foundation for advancing research. Notably, all algorithms are inherently compatible with MO-Gymnasium; and (iii) a thorough and robust set of benchmark results and comparisons of MORL-Baselines algorithms, tested across various challenging MO-Gymnasium environments. These benchmarks were constructed to serve as guidelines for the research community, underscoring the properties, advantages, and limitations of each particular state-of-the-art method.
Research center :
ULHPC - University of Luxembourg: High Performance Computing
Disciplines :
Computer science
Author, co-author :
FELTEN, Florian   ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PCOG ; Farama Foundation
Alegre, Lucas N. ;  Federal University of Rio Grande do Sul > Institute of Informatics ; VUB - Vrije Universiteit Brussel [BE] > Artificial Intelligence Lab ; Farama Foundation
Nowé, Ann;  VUB - Vrije Universiteit Brussel [BE] > Artificial Intelligence Lab
L. C. Bazzan, Ana;  Federal University of Rio Grande do Sul > Institute of Informatics
TALBI, El-Ghazali ;  University of Luxembourg ; ULille - University of Lille [FR] > CNRS/CRIStAL
DANOY, Grégoire  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; Unilu - Université du Luxembourg [LU] > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > PCOG
C. da Silva, Bruno;  University of Massachusetts
 These authors have contributed equally to this work.
External co-authors :
yes
Language :
English
Title :
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning
Publication date :
2024
Event name :
Thirty-seventh Conference on Neural Information Processing Systems
Event place :
United States
Event date :
10/12/2023
Audience :
International
Main work title :
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning
Publisher :
Curran Associates, United States
Peer reviewed :
Peer reviewed
FnR Project :
FNR14762457 - Automating The Design Of Autonomous Robot Swarms, 2020 (01/05/2021-30/04/2024) - Gregoire Danoy
Name of the research project :
R-AGR-3933 - C20/IS/14762457/ADARS (01/05/2021 - 30/04/2024) - DANOY Grégoire
Funders :
FNR - Luxembourg National Research Fund [LU]
Available on ORBilu :
since 11 January 2024

Statistics


Number of views
100 (9 by Unilu)
Number of downloads
0 (0 by Unilu)

Scopus citations®
 
5
Scopus citations®
without self-citations
4

Bibliography


Similar publications



Contact ORBilu