Eprint already available on another site (E-prints, Working papers and Research blog)
NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling
Grooten, Bram; Hasanov, Farid; ZHANG, Chenxiang et al.
2025
 

Files


Full Text
2505.17909v1.pdf
Author postprint (861.24 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Model ensembles; Sparse multi-head architecture; Model-agnostic training; Sparse training
Abstract :
[en] Model ensembles have long been a cornerstone for improving generalization and robustness in deep learning. However, their effectiveness often comes at the cost of substantial computational overhead. To address this issue, state-of-the-art methods aim to replicate ensemble-class performance without requiring multiple independently trained networks. Unfortunately, these algorithms often still demand considerable compute at inference. In response to these limitations, we introduce NeuroTrails, a sparse multi-head architecture with dynamically evolving topology. This unexplored model-agnostic training paradigm improves ensemble performance while reducing the required resources. We analyze the underlying reason for its effectiveness and observe that the various neural trails induced by dynamic sparsity attain a Goldilocks zone of prediction diversity. NeuroTrails displays efficacy with convolutional and transformer-based architectures on computer vision and language tasks. Experiments on ResNet-50/ImageNet, LLaMA-350M/C4, among many others, demonstrate increased accuracy and stronger robustness in zero-shot generalization, while requiring significantly fewer parameters.
Disciplines :
Computer science
Author, co-author :
Grooten, Bram ;  Eindhoven University of Technology, Netherlands
Hasanov, Farid ;  Eindhoven University of Technology, Netherlands ; Unilu - University of Luxembourg
ZHANG, Chenxiang ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
Xiao, Qiao;  Eindhoven University of Technology, Netherlands
WU, Boqian ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; University of Twente, Netherlands
Atashgahi, Zahra;  IKEA Netherlands
Sokar, Ghada;  Google DeepMind ; Eindhoven University of Technology, Netherlands
Liu, Shiwei;  University of Oxford ; Eindhoven University of Technology, Netherlands
Yin, Lu;  University of Surrey ; Eindhoven University of Technology, Netherlands
Mocanu, Elena;  University of Twente, Netherlands
Pechenizkiy, Mykola;  Eindhoven University of Technology, Netherlands
MOCANU, Decebal Constantin  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; Eindhoven University of Technology, Netherlands
 These authors have contributed equally to this work.
Language :
English
Title :
NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling
Publication date :
23 May 2025
Focus Area :
Computational Sciences
Development Goals :
9. Industry, innovation and infrastructure
Available on ORBilu :
since 05 March 2026

Statistics


Number of views
45 (0 by Unilu)
Number of downloads
19 (0 by Unilu)

Bibliography


Similar publications



Contact ORBilu