References of "Talbi, El-Ghazali 50035271"
     in
Bookmark and Share    
Full Text
Peer Reviewed
See detailMO-Gym: A Library of Multi-Objective Reinforcement Learning Environments
Alegre, Lucas Nunes; Felten, Florian UL; Talbi, El-Ghazali UL et al

Scientific Conference (2022, November)

Detailed reference viewed: 192 (92 UL)
Full Text
Peer Reviewed
See detailLearning to Optimise a Swarm of UAVs
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

in Applied Sciences (2022), 12(19 9587),

The use of Unmanned Aerial Vehicles (UAVs) has shown a drastic increase in interest in the past few years. Current applications mainly depend on single UAV operations, which face critical limitations such ... [more ▼]

The use of Unmanned Aerial Vehicles (UAVs) has shown a drastic increase in interest in the past few years. Current applications mainly depend on single UAV operations, which face critical limitations such as mission range or resilience. Using several autonomous UAVs as a swarm is a promising approach to overcome these. However, designing an efficient swarm is a challenging task, since its global behaviour emerges solely from local decisions and interactions. These properties make classical multirobot design techniques not applicable, while evolutionary swarm robotics is typically limited to a single use case. This work, thus, proposes an automated swarming algorithm design approach, and more precisely, a generative hyper-heuristic relying on multi-objective reinforcement learning, that permits us to obtain not only efficient but also reusable swarming behaviours. Experimental results on a three-objective variant of the Coverage of a Connected UAV Swarm problem demonstrate that it not only permits one to generate swarming heuristics that outperform the state-of-the-art in terms of coverage by a swarm of UAVs but also provides high stability. Indeed, it is empirically demonstrated that the model trained on a certain class of instances generates heuristics and is capable of performing well on instances with a different size or swarm density. [less ▲]

Detailed reference viewed: 70 (8 UL)
Full Text
Peer Reviewed
See detailMORL/D: Multi-Objective Reinforcement Learning based on Decomposition
Felten, Florian UL; Talbi, El-Ghazali UL; Danoy, Grégoire UL

in International Conference in Optimization and Learning (OLA2022) (2022)

Detailed reference viewed: 126 (23 UL)
Full Text
Peer Reviewed
See detailA Framework of Hyper-Heuristics based on Q-Learning
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

in International Conference in Optimization and Learning (OLA2022) (2022)

Detailed reference viewed: 120 (14 UL)
Full Text
Peer Reviewed
See detailMetaheuristics-based Exploration Strategies for Multi-Objective Reinforcement Learning
Felten, Florian UL; Danoy, Grégoire; Talbi, El-Ghazali UL et al

in Proceedings of the 14th International Conference on Agents and Artificial Intelligence (2022)

The fields of Reinforcement Learning (RL) and Optimization aim at finding an optimal solution to a problem, characterized by an objective function. The exploration-exploitation dilemma (EED) is a well ... [more ▼]

The fields of Reinforcement Learning (RL) and Optimization aim at finding an optimal solution to a problem, characterized by an objective function. The exploration-exploitation dilemma (EED) is a well known subject in those fields. Indeed, a consequent amount of literature has already been proposed on the subject and shown it is a non-negligible topic to consider to achieve good performances. Yet, many problems in real life involve the optimization of multiple objectives. Multi-Policy Multi-Objective Reinforcement Learning (MPMORL) offers a way to learn various optimised behaviours for the agent in such problems. This work introduces a modular framework for the learning phase of such algorithms, allowing to ease the study of the EED in Inner- Loop MPMORL algorithms. We present three new exploration strategies inspired from the metaheuristics domain. To assess the performance of our methods on various environments, we use a classical benchmark - the Deep Sea Treasure (DST) - as well as propose a harder version of it. Our experiments show all of the proposed strategies outperform the current state-of-the-art ε-greedy based methods on the studied benchmarks. [less ▲]

Detailed reference viewed: 243 (66 UL)
Full Text
Peer Reviewed
See detailA Q-Learning Based Hyper-Heuristic for Generating Efficient UAV Swarming Behaviours
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

in Intelligent Information and Database Systems - 13th Asian Conference ACIIDS 2021, Phuket, Thailand, April 7-10, 2021, Proceedings (2021)

Detailed reference viewed: 79 (13 UL)
Full Text
Peer Reviewed
See detailAutomating the Design of Efficient Distributed Behaviours for a Swarm of UAVs
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

in IEEE Symposium Series on Computational Intelligence, Canberra 1-4 December 2020 (2020, December)

The usage of Unmanned Aerial Vehicles (UAVs) is gradually gaining momentum for commercial applications. The vast majority considers a single UAV, which comes with several constraints such as its range of ... [more ▼]

The usage of Unmanned Aerial Vehicles (UAVs) is gradually gaining momentum for commercial applications. The vast majority considers a single UAV, which comes with several constraints such as its range of operations or the number of sensors it can carry. Using multiple autonomous UAVs simultaneously as a swarm makes it possible to overcome these limitations. However, manually designing complex emerging behaviours like swarming is a difficult and tedious task especially for such distributed systems which performance is hardly predictable. This article therefore proposes to automate the design of UAV swarming behaviours by defining a multi-objective optimisation problem, so called Coverage of a Connected-UAV Swarm (CCUS), and designing a Q-Learning based Hyper-Heuristic (QLHH) for generating distributed CCUS heuristics. Experimental results demonstrate the capacity of QLHH to generate efficient heuristics for any instance from a given class. [less ▲]

Detailed reference viewed: 152 (39 UL)
Full Text
Peer Reviewed
See detailA Q-Learning Hyper-Heuristic for UAV Swarming
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

Scientific Conference (2020)

Detailed reference viewed: 145 (33 UL)
Full Text
Peer Reviewed
See detailAutomated design of efficient swarming behaviours: a Q-learning hyper-heuristic approach
Duflo, Gabriel UL; Danoy, Grégoire UL; Talbi, El-Ghazali UL et al

in GECCO '20: Genetic and Evolutionary Computation Conference, Companion Volume, Cancún, Mexico, July 8-12, 2020 (2020)

Detailed reference viewed: 129 (22 UL)