Academic Journal

Enhanced Machine Learning Framework for Autonomous Depression Detection Using Modwave Cepstral Fusion and Stochastic Embedding

التفاصيل البيبلوغرافية
العنوان: Enhanced Machine Learning Framework for Autonomous Depression Detection Using Modwave Cepstral Fusion and Stochastic Embedding
المؤلفون: Jithin Jacob, K.S. Kannan
المصدر: Информатика и автоматизация, Vol 23, Iss 6, Pp 1754-1783 (2024)
بيانات النشر: Russian Academy of Sciences, St. Petersburg Federal Research Center, 2024.
سنة النشر: 2024
المجموعة: LCC:Electronic computers. Computer science
مصطلحات موضوعية: depression detection, machine learning, modwave cepstral fusion, background noise, xgboost classifier, daic-woz dataset, autonomous detection system, accuracy, Electronic computers. Computer science, QA75.5-76.95
الوصف: Depression is a prevalent mental illness that requires autonomous detection systems due to its complexity. Existing machine learning techniques face challenges such as background noise sensitivity, slow adaptation speed, and imbalanced data. To address these limitations, this study proposes a novel ModWave Cepstral Fusion and Stochastic Embedding Framework for depression prediction. Then, the Gain Modulated Wavelet Technique removes background noise and normalises audio signals. Difficulties with generalisation, which results in a lack of interpretability, hinder extracting relevant characteristics from speech. To address these issues, an Auto Cepstral Fusion extracts relevant features from speech, capturing temporal and spectral characteristics caused by background voice. Feature selection becomes imperative when choosing relevant features for classification. Selecting irrelevant features can result in overfitting, the curse of dimensionality, and less robustness to noise. Hence, the Principal Stochastic Embedding technique handles high-dimensional data, minimising noise influence and dimensionality. Furthermore, the XGBoost classifier differentiates between depressed and non-depressed individuals. As a result, the proposed method uses the DAIC-WOZ dataset from USC for detecting depressions, achieving an accuracy of 97.02%, precision of 97.02%, recall of 97.02%, F1-score of 97.02%, RMSE of 2.00, and MAE of 0.9, making it a promising tool for autonomous depression detection.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
Russian
تدمد: 2713-3192
2713-3206
Relation: https://ia.spcras.ru/index.php/sp/article/view/16527; https://doaj.org/toc/2713-3192; https://doaj.org/toc/2713-3206
DOI: 10.15622/ia.23.6.7
URL الوصول: https://doaj.org/article/848eb638dfeb463388e4b6361ce3a18b
رقم الانضمام: edsdoj.848eb638dfeb463388e4b6361ce3a18b
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:27133192
27133206
DOI:10.15622/ia.23.6.7