Automatic feature engineering, Bankruptcy prediction, Credit risk, Imbalanced data
Abstract :
[en] Feature engineering on financial records for
bankruptcy prediction has traditionally relied significantly on
domain knowledge and typically results in a range of financial
ratios but with limited complexity and feature utilization due
to manual design. It is often a time-consuming and error-prone
procedure, confined to the domain experts’ experience, without
taking into account the characteristics of different data sets.
In this paper, we propose an automated feature engineering
approach to generate effective, explainable, and extensible model
training features. The experiments have been conducted using
a publicly available record of financial statements submitted
to the Luxembourg Business Registers. This approach aims to
improve bankruptcy prediction for professionals who may not
possess the necessary engineering expertise or efficient data.
The experimental results suggest that the proposed approach
can provide valuable features for model training and in most
of the cases, the model’s outcomes outperforms predominantly
as compared to the traditional approaches and the well-known
approaches the models, thus can provide valuable features for
model training.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems NCER-FT - FinTech National Centre of Excellence in Research
Disciplines :
Computer science
Author, co-author :
WANG, Xin Lin ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
Kraussl, Zsofia; Unilu - Université du Luxembourg [LU] > Faculty of Law, Economics and Finance > Department of Finance
ZURAD, Maciej; Yoba S.A.
BRORSSON, Mats Håkan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
External co-authors :
yes
Language :
English
Title :
Effective Automatic Feature Engineering on Financial Statements for Bankruptcy Prediction
Publication date :
19 July 2023
Event name :
2023 3rd International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)
Event date :
19-21 July 2023
By request :
Yes
Main work title :
Effective Automatic Feature Engineering on Financial Statements for Bankruptcy Prediction
Author, co-author :
WANG, Xin Lin ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
R. A. Howard, "Information value theory, " IEEE Transactions on systems science and cybernetics, vol. 2, no. 1, pp. 22-26, 1966.
J. A. Ohlson, "Financial ratios and the probabilistic prediction of bankruptcy, " Journal of accounting research, pp. 109-131, 1980.
A. Finkelstein, J. Kramer, B. Nuseibeh, L. Finkelstein, and M. Goedicke, "Viewpoints: A framework for integrating multiple perspectives in system development, " International Journal of Software Engineering and Knowledge Engineering, vol. 2, no. 01, pp. 31-57, 1992.
P. P. Pompe and A. Feelders, "Using machine learning, neural networks, and statistics to predict corporate bankruptcy, " Computer-Aided Civil and Infrastructure Engineering, vol. 12, no. 4, pp. 267-276, 1997.
S. A. Ross, R. Westerfield, and J. F. Jaffe, Corporate finance. Irwin/McGraw-Hill, 1999.
A. F. Atiya, "Bankruptcy prediction for credit risk using neural networks: A survey and new results, " IEEE Transactions on neural networks, vol. 12, no. 4, pp. 929-935, 2001.
J. Gordijn and J. Akkermans, "Value-based requirements engineering: Exploring innovative e-commerce ideas, " Requirements engineering, vol. 8, no. 2, pp. 114-134, 2003.
Z. Derzsi and J. Gordijn, "A framework for Business/IT alignment in networked value constellations., " in BUSITAL, 2006.
C.-F. Tsai and J.-W. Wu, "Using neural network ensembles for bankruptcy prediction and credit scoring, " Expert systems with applications, vol. 34, no. 4, pp. 2639-2649, 2008.
L. Nanni and A. Lumini, "An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring, " Expert systems with applications, vol. 36, no. 2, pp. 3028-3033, 2009.
A. Altmann, L. Tolosi, O. Sander, and T. Lengauer, "Permutation importance: A corrected feature importance measure, " Bioinformatics, vol. 26, no. 10, pp. 1340-1347, 2010.
S. Rendle, "Factorization machines, " in 2010 IEEE International conference on data mining, IEEE, 2010, pp. 995-1000.
V. Boguslauskas, R. Mileris, and R. Adlyte, "The selection of financial ratios as independent variables for credit risk assessment, " Economics and management, vol. 16, no. 4, pp. 1032-1040, 2011.
G. Wang, J. Hao, J. Ma, and H. Jiang, "A comparative assessment of ensemble learning for credit scoring, " Expert systems with applications, vol. 38, no. 1, pp. 223-230, 2011.
M. Cowling, W. Liu, and A. Ledger, "Small business financing in the uk before and during the current financial crisis, " International Small Business Journal, vol. 30, no. 7, pp. 778-800, 2012.
N. Siddiqi, Credit risk scorecards: Developing and implementing intelligent credit scoring. John Wiley & Sons, 2012, vol. 3.
J. Kruppa, A. Schwarz, G. Arminger, and A. Ziegler, "Consumer credit risk: Individual probability estimates using machine learning, " Expert Systems with Applications, vol. 40, no. 13, pp. 5125-5131, 2013.
N. Gordini, "A genetic algorithm approach for smes bankruptcy prediction: Empirical evidence from italy, " Expert systems with applications, vol. 41, no. 14, pp. 6433-6445, 2014.
Q. Yu, Y. Miche, E. Severin, and A. Lendasse, "Bankruptcy prediction using extreme learning machine and financial expertise, " Neurocomputing, vol. 128, pp. 296-302, 2014.
J. M. Kanter and K. Veeramachaneni, "Deep feature synthesis: Towards automating data science endeavors, " in 2015 IEEE international conference on data science and advanced analytics (DSAA), IEEE, 2015, pp. 1-10.
M. Zieba, S. K. Tomczak, and J. M. Tomczak, "Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction, " Expert systems with applications, vol. 58, pp. 93-101, 2016.
F. Barboza, H. Kimura, and E. Altman, "Machine learning models and bankruptcy prediction, " Expert Systems with Applications, vol. 83, pp. 405-417, 2017.
A. Beque, K. Coussement, R. Gayler, and S. Lessmann, "Approaches for credit scorecard calibration: An empirical analysis, " Knowledge-Based Systems, vol. 134, pp. 213-227, 2017.
G. Ke, Q. Meng, T. Finley, et al., "Lightgbm: A highly efficient gradient boosting decision tree, " Advances in neural information processing systems, vol. 30, 2017.
T. Hosaka, "Bankruptcy prediction using imaged financial ratios and convolutional neural networks, " Expert systems with applications, vol. 117, pp. 287-299, 2019.
Y. Luo, M. Wang, H. Zhou, et al., "Autocross: Automatic feature crossing for tabular data in real-world applications, " in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019, pp. 1936-1945.
Y. Shi and X. Li, "An overview of bankruptcy prediction models for corporate firms: A systematic literature review, " Intangible Capital, vol. 15, no. 2, pp. 114-127, 2019.
H. Son, C. Hyun, D. Phan, and H. J. Hwang, "Data analytic approach for bankruptcy prediction, " Expert Systems with Applications, vol. 138, p. 112 816, 2019.
W. Song, C. Shi, Z. Xiao, et al., "Autoint: Automatic feature interaction learning via self-attentive neural networks, " in Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, pp. 1161-1170.
Y. Zhu, L. Zhou, C. Xie, G.-J. Wang, and T. V. Nguyen, "Forecasting smes' credit risk in supply chain finance with an enhanced hybrid ensemble machine learning approach, " International Journal of Production Economics, vol. 211, pp. 22-33, 2019.
B. Liu, C. Zhu, G. Li, et al., "Autofis: Automatic feature interaction selection in factorization models for clickthrough rate prediction, " in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020, pp. 2636-2645.
B. Prasetiyo, M. Muslim, N. Baroroh, et al., "Artificial neural network model for banckrupty prediction, " in Journal of Physics: Conference Series, IOP Publishing, vol. 1567, 2020, p. 032 022.
G. Lombardo, M. Pellegrino, G. Adosoglou, S. Cagnoni, P. M. Pardalos, and A. Poggi, "Machine learning for bankruptcy prediction in the american stock market: Dataset and benchmarks, " Future Internet, vol. 14, no. 8, p. 244, 2022