Article (Scientific journals)
Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing
Li, Lin; SPRATLING, Michael
2025In Pattern Recognition, 163, p. 111474
Peer Reviewed verified by ORBi
 

Files


Full Text
reprint.pdf
Publisher postprint (2.34 MB) Creative Commons License - Attribution, Non-Commercial
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Adversarial robustness; Adversarial training; Instance adaptive; Loss smoothing; Neural-networks; Overfitting; Training methods; Computer Vision and Pattern Recognition; Artificial Intelligence
Abstract :
[en] Deep neural networks are highly susceptible to adversarial perturbations: artificial noise that corrupts input data in ways imperceptible to humans but causes incorrect predictions. Among the various defenses against these attacks, adversarial training has emerged as the most effective. In this work, we aim to enhance adversarial training to improve robustness against adversarial attacks. We begin by analyzing how adversarial vulnerability evolves during training from an instance-wise perspective. This analysis reveals two previously unrecognized phenomena: robust shortcut and disordered robustness. We then demonstrate that these phenomena are related to robust overfitting, a well-known issue in adversarial training. Building on these insights, we propose a novel adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). This method jointly smooths the input and weight loss landscapes in an instance-adaptive manner, preventing the exploitation of robust shortcut and thereby mitigating robust overfitting. Extensive experiments demonstrate the efficacy of ISEAT and its superiority over existing adversarial training methods. Code is available at https://github.com/TreeLLi/ISEAT.
Disciplines :
Computer science
Author, co-author :
Li, Lin ;  Department of Informatics, King's College London, London, United Kingdom
SPRATLING, Michael  ;  University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Behavioural and Cognitive Sciences (DBCS) > Cognitive Science and Assessment ; Department of Informatics, King's College London, London, United Kingdom
External co-authors :
yes
Language :
English
Title :
Robust shortcut and disordered robustness: Improving adversarial training through adaptive smoothing
Publication date :
July 2025
Journal title :
Pattern Recognition
ISSN :
0031-3203
eISSN :
1873-5142
Publisher :
Elsevier Ltd
Volume :
163
Pages :
111474
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
China Scholarship Council
Funding text :
The authors gratefully acknowledge use of the King's Computational Research, Engineering and Technology Environment (CREATE) and the Joint Academic Data science Endeavour (JADE) facility for carrying out the experiments described in this paper. This work was funded by a scholarship from the King's - China Scholarship Council (K-CSC).The authors gratefully acknowledge use of the King\u2019s Computational Research, Engineering and Technology Environment (CREATE) and the Joint Academic Data science Endeavour (JADE) facility for carrying out the experiments described in this paper. This work was funded by a scholarship from the King\u2019s - China Scholarship Council (K-CSC) .
Available on ORBilu :
since 21 August 2025

Statistics


Number of views
31 (0 by Unilu)
Number of downloads
22 (0 by Unilu)

Scopus citations®
 
2
Scopus citations®
without self-citations
2
OpenCitations
 
0
OpenAlex citations
 
1
WoS citations
 
1

Bibliography


Similar publications



Contact ORBilu