Article (Scientific journals)
Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach
WANG, Yiqun; LI, Lujun; YUE, Meiru et al.
2025In ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Peer Reviewed verified by ORBi
 

Files


Full Text
2512.09471v1 (1).pdf
Author postprint (5 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Computer Science - Computer Vision and Pattern Recognition; Computer Science - Artificial Intelligence
Abstract :
[en] Cloud cover in multispectral imagery (MSI) significantly hinders early-season crop mapping by corrupting spectral information. Existing Vision Transformer(ViT)-based time-series reconstruction methods, like SMTS-ViT, often employ coarse temporal embeddings that aggregate entire sequences, causing substantial information loss and reducing reconstruction accuracy. To address these limitations, a Video Vision Transformer (ViViT)-based framework with temporal-spatial fusion embedding for MSI reconstruction in cloud-covered regions is proposed in this study. Non-overlapping tubelets are extracted via 3D convolution with constrained temporal span $(t=2)$, ensuring local temporal coherence while reducing cross-day information degradation. Both MSI-only and SAR-MSI fusion scenarios are considered during the experiments. Comprehensive experiments on 2020 Traill County data demonstrate notable performance improvements: MTS-ViViT achieves a 2.23\% reduction in MSE compared to the MTS-ViT baseline, while SMTS-ViViT achieves a 10.33\% improvement with SAR integration over the SMTS-ViT baseline. The proposed framework effectively enhances spectral reconstruction quality for robust agricultural monitoring.
Disciplines :
Computer science
Author, co-author :
WANG, Yiqun  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > SEDAN > Team Radu STATE
LI, Lujun  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
YUE, Meiru ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
STATE, Radu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
External co-authors :
no
Language :
English
Title :
Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach
Publication date :
2025
Journal title :
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
ISSN :
2194-9042
eISSN :
2194-9050
Publisher :
Copernicus, Goettingen, Germany
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 05 March 2026

Statistics


Number of views
31 (2 by Unilu)
Number of downloads
15 (1 by Unilu)

Bibliography


Similar publications



Contact ORBilu