Paper published in a book (Scientific congresses, symposiums and conference proceedings)
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Grooten, Bram; Tomilin, Tristan; Vasan, Gautham et al.
2024In AAMAS '24: Proceedings of the 2024 International Conference on Autonomous Agents and Multiagent Systems
Peer reviewed
 

Files


Full Text
2312.15339.pdf
Author postprint (16.29 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Computer Science - Learning; Computer Science - Artificial Intelligence; Computer Science - Computer Vision and Pattern Recognition; Computer Science - Robotics; Deep Reinforcement Learning
Abstract :
[en] The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks with additional loss functions. We introduce MaDi, a novel algorithm that learns to mask distractions by the reward signal only. In MaDi, the conventional actor-critic structure of deep reinforcement learning agents is complemented by a small third sibling, the Masker. This lightweight neural network generates a mask to determine what the actor and critic will receive, such that they can focus on learning the task. The masks are created dynamically, depending on the current input. We run experiments on the DeepMind Control Generalization Benchmark, the Distracting Control Suite, and a real UR5 Robotic Arm. Our algorithm improves the agent's focus with useful masks, while its efficient Masker network only adds 0.2% more parameters to the original structure, in contrast to previous work. MaDi consistently achieves generalization results better than or competitive to state-of-the-art methods.
Disciplines :
Computer science
Author, co-author :
Grooten, Bram;  Eindhoven University of Technology [NL]
Tomilin, Tristan;  Eindhoven University of Technology [NL]
Vasan, Gautham;  UAlberta - University of Alberta [CA]
Taylor, Matthew E.;  UAlberta - University of Alberta [CA] ; Alberta Machine Intelligence Institute (Amii)
Mahmood, Rupam A.;  UAlberta - University of Alberta [CA] ; Alberta Machine Intelligence Institute (Amii)
Fang, Meng;  University of Liverpool [GB]
Pechenizkiy, Mykola;  Eindhoven University of Technology [NL]
MOCANU, Decebal Constantin  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; Eindhoven University of Technology [NL]
External co-authors :
yes
Language :
English
Title :
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning
Publication date :
06 May 2024
Event name :
AAMAS '24: 2024 International Conference on Autonomous Agents and Multiagent Systems
Event date :
from 6 to 10 May 2024
Audience :
International
Main work title :
AAMAS '24: Proceedings of the 2024 International Conference on Autonomous Agents and Multiagent Systems
Publisher :
International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Development Goals :
9. Industry, innovation and infrastructure
Commentary :
Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48
Available on ORBilu :
since 15 January 2024

Statistics


Number of views
262 (1 by Unilu)
Number of downloads
84 (0 by Unilu)

Scopus citations®
 
8
Scopus citations®
without self-citations
7

Bibliography


Similar publications



Contact ORBilu