Article (Scientific journals)
Decision support model for time series data augmentation method selection
JOUBAUD, Dorian; KUBLER, Sylvain; Lourenço, Raoni et al.
2024In IEEE Access, p. 1-1
Peer Reviewed verified by ORBi
 

Files


Full Text
BALANCER.pdf
Author preprint (2.58 MB) Creative Commons License - Attribution
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] Data augmentation (DA) plays a crucial role in machine learning by improving model generalization and tackling data scarcity issues, particularly prevalent in domains with limited access to sensitive information or rare events. Despite the availability of various DA techniques for handling imbalanced time-series classification (ITSC) problems, there is a lack of comprehensive guidelines for selecting the most appropriate technique based on input data features and the chosen classifier. This paper empirically demonstrates the limitations of conventional data balancing practices through experiments conducted on 720 ITSC datasets, using 7 classifier architectures and 6 DA techniques (TimeGAN, SMOTE, ADASYN, Random Oversampling, Jittering, Time Warping). Our study not only explores the relationship between DA techniques and the inherent characteristics of ITSC datasets and classifiers but also introduces a novel ML-based decision support system, BALANCER (imBALanced AugmeNtation reCommendER), which has been trained based on empirical data to offer an automated approach for ML practitioners to select the most appropriate DA method for their own/specific application. BALANCER’s recommendation model comes with a prediction of the performance enhancement that is expected from data balancing using the recommended method. Evaluation of BALANCER against traditional mean rank recommendations reveals significant improvements, with BALANCER achieving an average Kendall’s tau of 0.36 (compared to −0.01 for traditional mean rank recommendations) and a root mean square error of 1.5 * 10 −2 on individual predictions. The reasons behind the notable disparity in results between the mean rank recommendation strategy and BALANCER are analyzed using eXplainable AI (XAI), demonstrating that BALANCER can uncover deeper and more complex feature interactions compared to a mean rank recommendation-like strategy.
Disciplines :
Computer science
Author, co-author :
JOUBAUD, Dorian  ;  University of Luxembourg
KUBLER, Sylvain  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
Lourenço, Raoni;  Interdisciplinary Centre for Security, Reliability & Trust (SnT), University of Luxembourg, 29 Av. John F. Kennedy, Kirchberg, Luxembourg
CORDY, Maxime  ;  University of Luxembourg
Traon, Yves Le ;  Interdisciplinary Centre for Security, Reliability & Trust (SnT), University of Luxembourg, 29 Av. John F. Kennedy, Kirchberg, Luxembourg
External co-authors :
no
Language :
English
Title :
Decision support model for time series data augmentation method selection
Publication date :
2024
Journal title :
IEEE Access
ISSN :
2169-3536
Publisher :
Institute of Electrical and Electronics Engineers (IEEE)
Pages :
1-1
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
Fonds National de la Recherche Luxembourg
Available on ORBilu :
since 18 December 2024

Statistics


Number of views
69 (1 by Unilu)
Number of downloads
39 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu