Eprint already available on another site (E-prints, Working papers and Research blog)
Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy
WANG, Xin Lin; Kräussl, Zsófia; BRORSSON, Mats Håkan
2024
 

Files


Full Text
2411.01928v1.pdf
Author postprint (1.86 MB) Creative Commons License - Attribution, Non-Commercial
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] Bankruptcy prediction is an important research area that heavily relies on data science. It aims to help investors, managers, and regulators better understand the operational status of corporations and predict potential financial risks in advance. To improve prediction, researchers and practitioners have begun to utilize a variety of different types of data, ranging from traditional financial indicators to unstructured data, to aid in the construction and optimization of bankruptcy forecasting models. Over time, not only instrumentalized data improved, but also instrumentalized methodology for data structuring, cleaning, and analysis. With the aid of advanced analytical techniques that deploy machine learning and deep learning algorithms, bankruptcy assessment became more accurate over time. However, due to the sensitivity of financial data, the scarcity of valid public datasets remains a key bottleneck for the rapid modeling and evaluation of machine learning algorithms for targeted tasks. This study therefore introduces a taxonomy of datasets for bankruptcy research, and summarizes their characteristics. This paper also proposes a set of metrics to measure the quality and the informativeness of public datasets The taxonomy, coupled with the informativeness measure, thus aims at providing valuable insights to better assist researchers and practitioners in developing potential applications for various aspects of credit assessment and decision making by pointing at appropriate datasets for their studies.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems
NCER-FT - FinTech National Centre of Excellence in Research
Disciplines :
Computer science
Author, co-author :
WANG, Xin Lin  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
Kräussl, Zsófia
BRORSSON, Mats Håkan  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
Language :
English
Title :
Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy
Publication date :
November 2024
FnR Project :
FNR15403349 - SCRiPT - Sme Credit Risk Platform, 2020 (01/04/2021-31/03/2024) - Radu State
Name of the research project :
SCRIPT
Available on ORBilu :
since 06 January 2025

Statistics


Number of views
89 (10 by Unilu)
Number of downloads
183 (1 by Unilu)

Bibliography


Similar publications



Contact ORBilu