On the Impact of Industrial Delays when Mitigating Distribution Drifts: an Empirical Study on Real-world Financial Systems

SIMONETTO, Thibault Jean Angel; CORDY, Maxime; GHAMIZI, Salah; LE TRAON, Yves; Lefebvre, Clément; Boystov, Andrey; Goujon, Anne

doi:10.1007/978-3-031-82346-6_4

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

On the Impact of Industrial Delays when Mitigating Distribution Drifts: an Empirical Study on Real-world Financial Systems

SIMONETTO, Thibault Jean Angel; CORDY, Maxime; GHAMIZI, Salah et al.

2024 • In KDD Workshop on Discovering Drift Phenomena in Evolving Data Landscape

Peer reviewed

Permalink
https://hdl.handle.net/10993/62929

DOI
10.1007/978-3-031-82346-6_4

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

on_the_impact_of_industrial_delays_when_mitigating_distribution_drifts_an_empirical_study_on_real_world_financial_systems.pdf

Author preprint (558.94 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

ML; distribution-drift; real-world system; AI in finance

Abstract :

[en] An increasing number of financial software system relies on Machine learning models to support human decision-makers. Although these models have shown satisfactory performance to support human decision-makers in classifying financial transactions, the maintenance of such ML systems remains a challenge. After deployment in production, the performance of the models tends to degrade over time due to concept drift. Methods have been proposed to detect concept drift and retrain new models upon detection to mitigate the drop in performance. However, little is known about the effectiveness of such methods in an industrial context. In particular, their evaluation fails to consider the delay between the detection of the drift and the deployment of a new model. This delay is inherent to the strict quality assurance and manual validation processes that financial (and other critical) institutions impose on their software systems. To circumvent this limitation, we formalize the problem of retraining ML models against distribution drift in the presence of delay and propose a novel protocol to evaluate drift detectors. % We report on an empirical study conducted on the transaction system of our industrial partner, BGL BNP Paribas, and two publicly available datasets: Lending Club Loan Data and Electricity. We release our tool and benchmark on GitHub. We demonstrate for the first time how ignoring the delays in the evaluation of the drift detectors overestimates their ability to mitigate performance drift, up to 39.86% for our industrial application.

Research center :

NCER-FT - FinTech National Centre of Excellence in Research
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SerVal - Security, Reasoning & Validation

Disciplines :

Computer science

Author, co-author :

SIMONETTO, Thibault Jean Angel ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal

CORDY, Maxime ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal

GHAMIZI, Salah ; LIST - Luxembourg Institute of Science and Technology [LU] > Intelligent Clean Energy Systems

LE TRAON, Yves ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Lefebvre, Clément

Boystov, Andrey; BGL BNP Paribas

Goujon, Anne; BGL BNP Paribas

External co-authors :

Language :

English

Title :

On the Impact of Industrial Delays when Mitigating Distribution Drifts: an Empirical Study on Real-world Financial Systems

Publication date :

2024

Event name :

Discovering Drift Phenomena in Evolving Landscape (DELTA 2024)

Event date :

2024

Main work title :

KDD Workshop on Discovering Drift Phenomena in Evolving Data Landscape

Publisher :

Springer

Peer reviewed :

Peer reviewed

Focus Area :

Computational Sciences

Name of the research project :

U-AGR-7180 - BRIDGES2022-1/17437536/TIMELESS BGL Cont - CORDY Maxime

Available on ORBilu :

since 08 December 2024

Statistics

Number of views

143 (5 by Unilu)

Number of downloads

90 (2 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenAlex citations

Bibliography

Amershi, S., et al.: Software engineering for machine learning: a case study. In: Proceedings - ICSE-SEIP 2019, pp. 291–300 (2019)
Baena-García, M., Campo-Ávila, J., Fidalgo-Merino, R., Bifet, A., Gavald, R., Morales-Bueno, R.: Early drift detection method (2006)
Bifet, A., Gavaldà, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 7th SIAM International Conference on Data Mining, pp. 443–448 (2007)
Bifet, A., Gavaldà, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp. 443–448. Society for Industrial and Applied Mathematics (2007)
Blanchard, G., Lee, G., Scott, C.: Generalizing from several related classification tasks to a new unlabeled sample. In: NeurIPS (2011)
Frias-Blanco, I., Campo-Avila, J.D., Ramos-Jimenez, G., Morales-Bueno, R., Ortiz-Diaz, A., Caballero-Mota, Y.: Online and non-parametric drift detection methods based on Hoeffding’s bounds. IEEE Trans. Knowl. Data Eng. 27(3), 810–823 (2015)
Gama, A., Bifet, A., Barcelona, R.: A survey on concept drift adaptation. ACM Comput. Surv 46 (2014)
Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. 8, 286–295 (2004)
Harries, M.B.: SPLICE-2 comparative evaluation: electricity pricing (1999). https://api.semanticscholar.org/CorpusID:151207670
Hu, Q., et al.: Aries: efficient testing of deep neural networks via labeling-free accuracy estimation (2023)
Evidently AI Inc.: Evidently AI: data drift algorithm (2021)
Kaggle: All lending club loan data (2019)
Lim, C.P., Harrison, R.: Online pattern classification with multiple neural network systems: an experimental study. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 33(2), 235–247 (2003)
Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., Zhang, G.: Learning under concept drift: a review. IEEE Trans. Knowl. Data Eng. 31(12), 2346–2363 (2019)
Masud, M., Gao, J., Khan, L., Han, J., Thuraisingham, B.M.: Classification and novel class detection in concept-drifting data streams under time constraints. IEEE Trans. Knowl. Data Eng. 23(6), 859–874 (2010)
Page, E.S.: Continuous inspection schemes. Biometrika 41(1/2), 100–115 (1954)
Plasse, J., Adams, N.: Handling delayed labels in temporally evolving data streams. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 2416–2424. IEEE (2016)
Poenaru-Olaru, L., Cruz, L., van Deursen, A., Rellermeyer, J.S.: Are concept drift detectors reliable alarming systems? – A comparative study (2022)
Qahtan, A.A., Alharbi, B., Wang, S., Zhang, X.: A PCA-based change detection framework for multidimensional data streams. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 935–944. ACM, Sydney NSW Australia (2015)
Raab, C., Heusinger, M., Schleif, F.M.: Reactive soft prototype computing for concept drift streams. Neurocomputing 416 (2020)
Shaker, M.H., Hüllermeier, E.: Aleatoric and epistemic uncertainty with random forests (2020)
Storkey, A., et al.: When training and test sets are different: characterizing learning transfer. Dataset Shift Mach. Learn. 30(3–28), 6 (2009)
Van Looveren, A., et al.: Alibi detect: algorithms for outlier, adversarial and drift detection (2019)
Zhou, K., Liu, Z., Qiao, Y., Xiang, T., Loy, C.C.: Domain generalization: a survey. IEEE Trans. Pattern Anal. Mach. Intell., 1-20 (2022). https://doi.org/10.1109/tpami.2022.3195549
Žliobaité, I.: Change with delayed labeling: when is it detectable? In: 2010 IEEE International Conference on Data Mining Workshops, pp. 843–850 (2010)