Academic Journal

Predicting financial distress in high-dimensional imbalanced datasets: a multi-heterogeneous self-paced ensemble learning framework

التفاصيل البيبلوغرافية
العنوان: Predicting financial distress in high-dimensional imbalanced datasets: a multi-heterogeneous self-paced ensemble learning framework
المؤلفون: Ruize Gao, Shaoze Cui, Yu Wang, Wei Xu
المصدر: Financial Innovation, Vol 11, Iss 1, Pp 1-34 (2025)
بيانات النشر: SpringerOpen, 2025.
سنة النشر: 2025
المجموعة: LCC:Public finance
LCC:Finance
مصطلحات موضوعية: Financial distress prediction, Feature selection, Imbalanced data, Ensemble learning, Particle swarm optimization, Public finance, K4430-4675, Finance, HG1-9999
الوصف: Abstract Financial distress prediction (FDP) is a critical area of study for researchers, industry stakeholders, and regulatory authorities. However, FDP tasks present several challenges, including high-dimensional datasets, class imbalances, and the complexity of parameter optimization. These issues often hinder the predictive model’s ability to accurately identify companies at high risk of financial distress. To mitigate these challenges, we introduce FinMHSPE—a novel multi-heterogeneous self-paced ensemble (MHSPE) FDP learning framework. The proposed model uses pairwise comparisons of data from multiple time frames combined with the maximum relevance and minimum redundancy method to select an optimal subset of features, effectively resolving the high dimensionality issue. Furthermore, the proposed framework incorporates the MHSPE model to iteratively identify the most informative majority class data samples, effectively addressing the class imbalance issue. To optimize the model’s parameters, we leverage the particle swarm optimization algorithm. The robustness of our proposed model is validated through extensive experiments performed on a financial dataset of Chinese listed companies. The empirical results demonstrate that the proposed model outperforms existing competing models in the field of FDP. Specifically, our FinMHSPE framework achieves the highest performance, achieving an area under the curve (AUC) value of 0.9574, considerably surpassing all existing methods. A comparative analysis of AUC values further reveals that FinMHSPE outperforms state-of-the-art approaches that rely on financial features as inputs. Furthermore, our investigation identifies several valuable features for enhancing FDP model performance, notably those associated with a company’s information and growth potential.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2199-4730
Relation: https://doaj.org/toc/2199-4730
DOI: 10.1186/s40854-024-00745-w
URL الوصول: https://doaj.org/article/0bfa2dc32ff94535ab5a4167fdf4f7ee
رقم الانضمام: edsdoj.0bfa2dc32ff94535ab5a4167fdf4f7ee
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:21994730
DOI:10.1186/s40854-024-00745-w