Paper published on a website (Scientific congresses, symposiums and conference proceedings)
MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked Autoencoders for Earth Observation Tasks
SOSA MARTINEZ, Jose Angel; RUKHOVICH, Danila; KACEM, Anis et al.
20252025 IEEE International Conference on Image Processing
Peer reviewed
 

Files


Full Text
2505.14951v1.pdf
Author preprint (4.26 MB) Creative Commons License - Attribution
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Multi-modal; Multi-task; Earth Observation; Masked Autoencoders
Abstract :
[en] Multi-modal data in Earth Observation (EO) presents a huge opportunity for improving transfer learning capabilities when pre-training deep learning models. Unlike prior work that often overlooks multi-modal EO data, recent methods have started to include it, resulting in more effective pre-training strategies. However, existing approaches commonly face challenges in effectively transferring learning to downstream tasks where the structure of available data differs from that used during pre-training. This paper addresses this limitation by exploring a more flexible multi-modal, multi-task pre-training strategy for EO data. Specifically, we adopt a Multi-modal Multi-task Masked Autoencoder (MultiMAE) that we pre-train by reconstructing diverse input modalities, including spectral, elevation, and segmentation data. The pre-trained model demonstrates robust transfer learning capabilities, outperforming state-of-the-art methods on various EO datasets for classification and segmentation tasks. Our approach exhibits significant flexibility, handling diverse input configurations without requiring modality-specific pre-trained models. Code will be available at: https://github.com/josesosajs/multimae-meets-eo.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > CVI² - Computer Vision Imaging & Machine Intelligence
Disciplines :
Computer science
Author, co-author :
SOSA MARTINEZ, Jose Angel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > CVI2
RUKHOVICH, Danila ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > CVI2
KACEM, Anis  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > CVI2
AOUADA, Djamila  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > CVI2
External co-authors :
yes
Language :
English
Title :
MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked Autoencoders for Earth Observation Tasks
Publication date :
20 May 2025
Event name :
2025 IEEE International Conference on Image Processing
Event organizer :
IEEE
Event place :
Anchorage, United States
Event date :
14 - 17 September 2025
Audience :
International
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
FnR Project :
FNR17978225 - AI4CC - Hpc And Advanced Ai For Crop Classification Using High Resolution Satellites, 2023 (01/07/2024-30/06/2026) - Djamila Aouada
Available on ORBilu :
since 08 July 2025

Statistics


Number of views
165 (38 by Unilu)
Number of downloads
152 (5 by Unilu)

Bibliography


Similar publications



Contact ORBilu