Paper published in a book (Scientific congresses, symposiums and conference proceedings)
On the Impact of Industrial Delays when Mitigating Distribution Drifts: an Empirical Study on Real-world Financial Systems
SIMONETTO, Thibault Jean Angel; CORDY, Maxime; GHAMIZI, Salah et al.
2024In KDD Workshop on Discovering Drift Phenomena in Evolving Data Landscape
Peer reviewed
 

Files


Full Text
on_the_impact_of_industrial_delays_when_mitigating_distribution_drifts_an_empirical_study_on_real_world_financial_systems.pdf
Author preprint (558.94 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
ML; distribution-drift; real-world system; AI in finance
Abstract :
[en] An increasing number of financial software system relies on Machine learning models to support human decision-makers. Although these models have shown satisfactory performance to support human decision-makers in classifying financial transactions, the maintenance of such ML systems remains a challenge. After deployment in production, the performance of the models tends to degrade over time due to concept drift. Methods have been proposed to detect concept drift and retrain new models upon detection to mitigate the drop in performance. However, little is known about the effectiveness of such methods in an industrial context. In particular, their evaluation fails to consider the delay between the detection of the drift and the deployment of a new model. This delay is inherent to the strict quality assurance and manual validation processes that financial (and other critical) institutions impose on their software systems. To circumvent this limitation, we formalize the problem of retraining ML models against distribution drift in the presence of delay and propose a novel protocol to evaluate drift detectors. % We report on an empirical study conducted on the transaction system of our industrial partner, BGL BNP Paribas, and two publicly available datasets: Lending Club Loan Data and Electricity. We release our tool and benchmark on GitHub. We demonstrate for the first time how ignoring the delays in the evaluation of the drift detectors overestimates their ability to mitigate performance drift, up to 39.86% for our industrial application.
Research center :
NCER-FT - FinTech National Centre of Excellence in Research
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SerVal - Security, Reasoning & Validation
Disciplines :
Computer science
Author, co-author :
SIMONETTO, Thibault Jean Angel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
CORDY, Maxime  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal
GHAMIZI, Salah ;  LIST - Luxembourg Institute of Science and Technology [LU] > Intelligent Clean Energy Systems
LE TRAON, Yves ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Lefebvre, Clément
Boystov, Andrey;  BGL BNP Paribas
Goujon, Anne;  BGL BNP Paribas
External co-authors :
no
Language :
English
Title :
On the Impact of Industrial Delays when Mitigating Distribution Drifts: an Empirical Study on Real-world Financial Systems
Publication date :
2024
Event name :
Discovering Drift Phenomena in Evolving Landscape (DELTA 2024)
Event date :
2024
Main work title :
KDD Workshop on Discovering Drift Phenomena in Evolving Data Landscape
Publisher :
Springer
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Name of the research project :
U-AGR-7180 - BRIDGES2022-1/17437536/TIMELESS BGL Cont - CORDY Maxime
Available on ORBilu :
since 08 December 2024

Statistics


Number of views
137 (5 by Unilu)
Number of downloads
81 (2 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu