Benchmarking Machine Learning Models for Corporate Bankruptcy Prediction using Financial Ratios

Authors

  • Sigit Mulyanto* Department of Management, Faculty of Business, Universitas Darwan Ali, Sampit, Indonesia https://orcid.org/0009-0002-4917-8718
  • Dwika Lovitasari Yonia Department of Information Systems, Faculty of Computer Science, Universitas Darwan Ali, Sampit, Indonesia https://orcid.org/0009-0003-3694-189X
  • Muhammad Arif Department of Islamic Business Management, Faculty of Islamic Economics and Business, Universitas Islam Negeri Syekh Ali Hasan Ahmad Addary Padangsidimpuan, Indonesia
  • Bambang Sutejo Department of Management, Faculty of Business, Universitas Darwan Ali, Sampit, Indonesia

DOI:

https://doi.org/10.55047/jekombital.v4i2.1071

Keywords:

Bankruptcy Prediction, Ensemble Learning, Machine Learning, Model Evaluation, Voting Classifier

Abstract

Corporate bankruptcy prediction is a critical task in financial risk management, particularly under conditions of economic uncertainty and highly imbalanced datasets. This study presents a comprehensive benchmarking framework that evaluates multiple supervised learning models and a voting ensemble approach for corporate bankruptcy prediction. Using a publicly available dataset comprising 78,682 financial records from US-listed companies on NYSE and NASDAQ (1999-2018), we compare the performance of Random Forest, XGBoost, Gradient Boosting, Support Vector Machine, Decision Tree, and a Voting Classifier. Extensive preprocessing, including outlier removal, normalization, and feature selection, and cost-sensitive learning to mitigate severe class imbalance was conducted to ensure data quality. Model performance was assessed using multiple evaluation metrics such as accuracy, F1-score, and ROC AUC to account for class imbalance. Results demonstrate that the Voting Classifier, integrating Random Forest, XGBoost, and Gradient Boosting via hard voting, achieves superior overall performance with an accuracy of 93.6%, F1-score of 96.5%, and ROC AUC of 82.6%, outperforming individual models. The findings underscore the value of ensemble approaches in improving prediction robustness while addressing class imbalance challenges in financial distress forecasting. This study contributes a reproducible experimental design that can guide future research and practical implementation of learning models in corporate bankruptcy risk assessment.

References

Abir, M. I. H., & Salam, T. (2024). Comparative Analysis and Prediction of Machine Learning Algorithms for MRI-Based Alzheimer’s Detection Using Multi-modal Data. 2024 IEEE International Conference on Computing, Applications and Systems (COMPAS), 1–5. https://doi.org/10.1109/COMPAS60761.2024.10797119

Ainan, U. H., Por, L. Y., Chen, Y.-L., Yang, J., & Ku, C. S. (2024). Advancing Bankruptcy Forecasting With Hybrid Machine Learning Techniques: Insights From an Unbalanced Polish Dataset. IEEE Access, 12, 9369–9381. https://doi.org/10.1109/ACCESS.2024.3354173

Akinjole, A., Shobayo, O., Popoola, J., Okoyeigbo, O., & Ogunleye, B. (2024). Ensemble-Based Machine Learning Algorithm for Loan Default Risk Prediction. Mathematics, 12(21), 3423. https://doi.org/10.3390/math12213423

Alam, T. M., Shaukat, K., Mushtaq, M., Ali, Y., Khushi, M., Luo, S., & Wahab, A. (2021). Corporate Bankruptcy Prediction: An Approach Towards Better Corporate World. The Computer Journal, 64(11), 1731–1746. https://doi.org/10.1093/comjnl/bxaa056

Amirshahi, B., & Lahmiri, S. (2024). Bankruptcy prediction using optimal ensemble models under balanced and imbalanced data. Expert Systems, 41(8), e13599. https://doi.org/10.1111/exsy.13599

Arora, I., & Singh, N. (2020). Prediction of Corporate Bankruptcy using Financial Ratios and News. International Journal of Engineering and Management Research, 10(5), 82–87. https://doi.org/10.31033/ijemr.10.5.15

Brygała, M. (2022). Consumer Bankruptcy Prediction Using Balanced and Imbalanced Data. Risks, 10(2), 24. https://doi.org/10.3390/risks10020024

Brygała, M., & Korol, T. (2024). Personal bankruptcy prediction using machine learning techniques. Economics and Business Review, 10(2), 118–142. https://doi.org/10.18559/ebr.2024.2.1149

Chaising, S., & Srimaharaj, W. (2024). Ensemble Framework for Bankruptcy Prediction. 2024 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT & NCON), 545–550. https://doi.org/10.1109/ECTIDAMTNCON60518.2024.10480004

Chakraborty, D. B., & Ranjan, R. (2024). Missing Data Imputation With Contextual Granules and AI-driven Bankruptcy Prediction. 2024 14th International Conference on Pattern Recognition Systems (ICPRS), 1–8. https://doi.org/10.1109/ICPRS62101.2024.10677843

Chen, M., Fan, W., Tang, W., Liu, T., Li, D., & Dib, O. (2024). Review of Machine Learning Algorithms for Breast Cancer Diagnosis (pp. 229–243). https://doi.org/10.1007/978-981-97-0844-4_17

da Silva Mattos, E., & Shasha, D. (2024). Bankruptcy prediction with low-quality financial information. Expert Systems with Applications, 237, 121418. https://doi.org/10.1016/j.eswa.2023.121418

Dasilas, A., & Rigani, A. (2024). Machine learning techniques in bankruptcy prediction: A systematic literature review. Expert Systems with Applications, 255, 124761. https://doi.org/10.1016/j.eswa.2024.124761

Doroshenko, A. V., & Savchuk, D. Y. (2024). Research of data mining methods for classification of imbalanced data sets. Ukrainian Journal of Information Technology, 6(1), 48–57. https://doi.org/10.23939/ujit2024.01.048

Gabrielli, G., Melioli, A., & Bertini, F. (2023). High-dimensional Data from Financial Statements for a Bankruptcy Prediction Model. 2023 IEEE 39th International Conference on Data Engineering Workshops (ICDEW), 1–7. https://doi.org/10.1109/ICDEW58674.2023.00005

Gholampoor, H., & Asadi, M. (2024). Risk Analysis of Bankruptcy in the U.S. Healthcare Industries Based on Financial Ratios: A Machine Learning Analysis. Journal of Theoretical and Applied Electronic Commerce Research, 19(2), 1303–1320. https://doi.org/10.3390/jtaer19020066

Giordani, P., Jacobson, T., Schedvin, E. von, & Villani, M. (2014). Taking the Twists into Account: Predicting Firm Bankruptcy Risk with Splines of Financial Ratios. Journal of Financial and Quantitative Analysis, 49(4), 1071–1099. https://doi.org/10.1017/S0022109014000623

Gnip, P., Kanász, R., Zoričak, M., & Drotár, P. (2025). An experimental survey of imbalanced learning algorithms for bankruptcy prediction. Artificial Intelligence Review, 58(4), 104. https://doi.org/10.1007/s10462-025-11107-y

Gohil, R., S, D., M, V., & J, J. (2023). Election Forecasting with Machine Learning and Sentiment Analysis: Karnataka 2023. 2023 International Conference on Ambient Intelligence, Knowledge Informatics and Industrial Electronics (AIKIIE), 1–6. https://doi.org/10.1109/AIKIIE60097.2023.10390333

Hassan, A., & Yousaf, N. (2022). Bankruptcy Prediction using Diverse Machine Learning Algorithms. 2022 International Conference on Frontiers of Information Technology (FIT), 106–111. https://doi.org/10.1109/FIT57066.2022.00029

Imani, M., Beikmohammadi, A., & Arabnia, H. R. (2025). Comprehensive Analysis of Random Forest and XGBoost Performance with SMOTE, ADASYN, and GNUS Under Varying Imbalance Levels. Technologies, 13(3), 88. https://doi.org/10.3390/technologies13030088

Irmalasari, I., & Dwiyanti, L. (2023). Algorithm Analysis of Decision Tree, Gradient Boosting Decision Tree, and Random Forest for Classification (Case Study: West Java House of Representatives Election 2019). 2023 International Conference on Electrical Engineering and Informatics (ICEEI), 1–5. https://doi.org/10.1109/ICEEI59426.2023.10346727

Islam, J., Saha, S., Hasan, M., Mahmud, A., & Jannat, M. (2024). Cognitive Modelling of Bankruptcy Risk: A Comparative Analysis of Machine Learning Models to Predict the Bankruptcy. 2024 12th International Symposium on Digital Forensics and Security (ISDFS), 1–6. https://doi.org/10.1109/ISDFS60797.2024.10527269

Jandaghi, G., Saranj, A., Rajaei, R., Ghasemi, A., & Tehrani, R. (2021). Identification of the Most Critical Factors in Bankruptcy Prediction and Credit Classification of Companies. Interdisciplinary Journal of Management Studies, 14(4), 817–834. https://doi.org/10.22059/ijms.2021.285398.673712

Liu, X., & He, W. (2022). Adaptive kernel scaling support vector machine with application to a prostate cancer image study. Journal of Applied Statistics, 49(6), 1465–1484. https://doi.org/10.1080/02664763.2020.1870669

Liu, Z., Zhang, H., Xiong, F., Huang, X., Yu, S., Sun, Q., Diao, L., Li, Z., Wu, Y., Zeng, Y., & Huang, C. (2024). Prediction of clinical pregnancy outcome after single fresh blastocyst transfer during in vitro fertilization: an ensemble learning perspective. Human Fertility, 27(1), 1–12. https://doi.org/10.1080/14647273.2024.2422918

Mulyanto, S., Yonia, D. L., & Sutejo, B. (2025). Finance Loan Risk Assessment Using Machine Learning for Credit Eligibility Prediction and Model Optimization. IJISTECH (International Journal of Information System and Technology), 8(5), 303–311. https://doi.org/10.30645/ijistech.v8i5.376

Narvekar, A., & Guha, D. (2021). Bankruptcy prediction using machine learning and an application to the case of the COVID-19 recession. Data Science in Finance and Economics, 1(2), 180–195. https://doi.org/10.3934/DSFE.2021010

Nguyen, H. H., Viviani, J.-L., & Ben Jabeur, S. (2025). Bankruptcy prediction using machine learning and Shapley additive explanations. Review of Quantitative Finance and Accounting, 65(1), 107–148. https://doi.org/10.1007/s11156-023-01192-x

Noh, S.-H. (2023). Comparing the Performance of Corporate Bankruptcy Prediction Models Based on Imbalanced Financial Data. Sustainability, 15(6), 4794. https://doi.org/10.3390/su15064794

Pawełek, B., & Pociecha, J. (2020). Corporate Bankruptcy Prediction with the Use of the Logit Leaf Model. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 129–146). Springer. https://doi.org/10.1007/978-3-030-52348-0_9

Premalatha, G., Priyanka, R., & Chaitya, K. (2023). Feature selection for predicting bankruptcy: Comparative analysis. 2023 Fifth International Conference on Electrical, Computer and Communication Technologies (ICECCT), 1–5. https://doi.org/10.1109/ICECCT56650.2023.10179633

Pretnar Žagar, A., & Demšar, J. (2022). Model Evaluation. In Tourism on the Verge (pp. 253–274). Springer. https://doi.org/10.1007/978-3-030-88389-8_13

Rahman, M. M., Kobir, K. H., Akther, S., & Kallol, M. A. H. (2024). Ensemble Machine Learning for Enhanced Breast Cancer Prediction: A Comparative Study. International Journal of Advanced Computer Science and Applications, 15(7). https://doi.org/10.14569/IJACSA.2024.0150792

Sharmily, R. R., Karthik, B., & Vijayan, T. (2024). Improved MRI based Automatic Brain Tumor Categorization Employing Deep Learning Techniques. 2024 4th Asian Conference on Innovation in Technology (ASIANCON), 1–5. https://doi.org/10.1109/ASIANCON62057.2024.10837709

Shetty, S., Musa, M., & Brédart, X. (2022). Bankruptcy Prediction Using Machine Learning Techniques. Journal of Risk and Financial Management, 15(1), 35. https://doi.org/10.3390/jrfm15010035

Smiti, S., Soui, M., & Ghedira, K. (2024). Tri-XGBoost model improved by BLSmote-ENN: an interpretable semi-supervised approach for addressing bankruptcy prediction. Knowledge and Information Systems, 66(7), 3883–3920. https://doi.org/10.1007/s10115-024-02067-w

Tien, H. L. Q., Quang Tran, L., & Hop Do, T. (2022). An Empirical Study on Bankruptcy Prediction using Ensemble Learning. 2022 RIVF International Conference on Computing and Communication Technologies (RIVF), 173–178. https://doi.org/10.1109/RIVF55975.2022.10013848

Yotsawat, W., Phodong, K., Promrat, T., & Wattuya, P. (2023). Bankruptcy prediction model using cost-sensitive extreme gradient boosting in the context of imbalanced datasets. International Journal of Electrical and Computer Engineering (IJECE), 13(4), 4683. https://doi.org/10.11591/ijece.v13i4.pp4683-4691

Downloads

Published

2025-11-29

Issue

Section

Articles

How to Cite

Benchmarking Machine Learning Models for Corporate Bankruptcy Prediction using Financial Ratios. (2025). JURNAL EKONOMI KREATIF DAN MANAJEMEN BISNIS DIGITAL, 4(2), 357-374. https://doi.org/10.55047/jekombital.v4i2.1071