التفاصيل البيبلوغرافية
العنوان: |
ArabRecognizer: modern standard Arabic speech recognition inspired by DeepSpeech2 utilizing Franco-Arabic. |
المؤلفون: |
Nasef, Mohammed M., Elshall, Amr A., Sauber, Amr M. |
المصدر: |
International Journal of Speech Technology; Sep2024, Vol. 27 Issue 3, p673-686, 14p |
مصطلحات موضوعية: |
ARABIC language, SPEECH perception, ORAL communication, ENGLISH language, NEUROLINGUISTICS |
مستخلص: |
Speech recognition is a critical task in spoken language applications. Globally known models such as DeepSpeech2 are effective for English speech recognition, however, they are not well-suited for languages like Arabic. This paper is interested in recognizing the Arabic language, especially Modern Standard Arabic (MSA). This paper proposed two models that utilize "Franco-Arabic" as an encoding mechanism and additional enhancements to recognize MSA. The first model uses Mel-Frequency Cepstral Coefficients (MFCCs) as input features, while the second employs six sequential Gated Recurrent Unit (GRU) layers. Each model is then followed by a fully connected layer with a dropout layer which helped reduce overfitting. The Connectionist Temporal Classification (CTC) loss is used to calculate the prediction error and to maximize the correct transcription likelihood. Two experiments were conducted for each model. The first experiment involved 41 h of continuous speech over 15 epochs. Whereas, the second one utilized 69 h over 30 epochs. The experiments showed that the first model excels in speed while the second excels in accuracy, and both outperformed the well-known DeepSpeech2. [ABSTRACT FROM AUTHOR] |
|
Copyright of International Journal of Speech Technology is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
قاعدة البيانات: |
Complementary Index |