Academic Journal
Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus
العنوان: | Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus |
---|---|
المؤلفون: | Omar Nematullah, Shavan Askar, Shahab Wahhab, Bzar Khidir |
المصدر: | Zanco Journal of Pure and Applied Sciences, Vol 36, Iss 5 (2024) |
بيانات النشر: | Salahaddin University-Erbil, 2024. |
سنة النشر: | 2024 |
المجموعة: | LCC:Technology LCC:Science |
مصطلحات موضوعية: | Emotion recognition, Speech analysis, Sorani Kurdish, Technology, Science |
الوصف: | Given the increasing need for interactive human-computer applications, the field of employing machine learning algorithms to discern emotions from speech has seen a substantial surge in interest. While emotion recognition systems have made substantial progress in languages like German, English, Spanish, Dutch, and Danish, the availability of comprehensive datasets for the Kurdish language remains notably limited. This paper addresses this gap by focusing on emotion recognition in Sorani Kurdish dialect speech data, which was carefully gathered from openly available videos from the YouTube platform and categorized into four clear supposed emotions: neutral, sadness, happiness, and anger. The study applied both natural Mel Spectrogram and Mel-Frequency Cepstral Coefficient (MFCC) features for various spectrals, followed by the classification models K-Nearest Neighbor (KNN), Multi-Layer Perceptron (MLP), and Support Vector Machine (SVM) to evaluate the results. By closely examining and contrasting the results of using several methods for feature extraction, it was found that SVM obtained a higher accuracy, reaching as much as 85.57%. This is so much more than the first Kurdish emotion classification technique for the recognition of the emotion of the words. |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 2218-0230 2412-3986 |
Relation: | https://zancojournal.su.edu.krd/index.php/JPAS/article/view/2402; https://doaj.org/toc/2218-0230; https://doaj.org/toc/2412-3986 |
DOI: | 10.21271/ZJPAS.36.5.10 |
URL الوصول: | https://doaj.org/article/7c5f05e43a004f20a8b50a2e6527bd98 |
رقم الانضمام: | edsdoj.7c5f05e43a004f20a8b50a2e6527bd98 |
قاعدة البيانات: | Directory of Open Access Journals |
ResultId |
1 |
---|---|
Header |
edsdoj Directory of Open Access Journals edsdoj.7c5f05e43a004f20a8b50a2e6527bd98 1038 3 Academic Journal academicJournal 1038.01672363281 |
PLink |
https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsdoj&AN=edsdoj.7c5f05e43a004f20a8b50a2e6527bd98&custid=s6537998&authtype=sso |
FullText |
Array
(
[Availability] => 0
)
Array ( [0] => Array ( [Url] => https://doaj.org/article/7c5f05e43a004f20a8b50a2e6527bd98 [Name] => EDS - DOAJ [Category] => fullText [Text] => View record in DOAJ [MouseOverText] => View record in DOAJ ) ) |
Items |
Array
(
[Name] => Title
[Label] => Title
[Group] => Ti
[Data] => Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus
)
Array ( [Name] => Author [Label] => Authors [Group] => Au [Data] => <searchLink fieldCode="AR" term="%22Omar+Nematullah%22">Omar Nematullah</searchLink><br /><searchLink fieldCode="AR" term="%22Shavan+Askar%22">Shavan Askar</searchLink><br /><searchLink fieldCode="AR" term="%22Shahab+Wahhab%22">Shahab Wahhab</searchLink><br /><searchLink fieldCode="AR" term="%22Bzar+Khidir%22">Bzar Khidir</searchLink> ) Array ( [Name] => TitleSource [Label] => Source [Group] => Src [Data] => Zanco Journal of Pure and Applied Sciences, Vol 36, Iss 5 (2024) ) Array ( [Name] => Publisher [Label] => Publisher Information [Group] => PubInfo [Data] => Salahaddin University-Erbil, 2024. ) Array ( [Name] => DatePubCY [Label] => Publication Year [Group] => Date [Data] => 2024 ) Array ( [Name] => Subset [Label] => Collection [Group] => HoldingsInfo [Data] => LCC:Technology<br />LCC:Science ) Array ( [Name] => Subject [Label] => Subject Terms [Group] => Su [Data] => <searchLink fieldCode="DE" term="%22Emotion+recognition%22">Emotion recognition</searchLink><br /><searchLink fieldCode="DE" term="%22Speech+analysis%22">Speech analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Sorani+Kurdish%22">Sorani Kurdish</searchLink><br /><searchLink fieldCode="DE" term="%22Technology%22">Technology</searchLink><br /><searchLink fieldCode="DE" term="%22Science%22">Science</searchLink> ) Array ( [Name] => Abstract [Label] => Description [Group] => Ab [Data] => Given the increasing need for interactive human-computer applications, the field of employing machine learning algorithms to discern emotions from speech has seen a substantial surge in interest. While emotion recognition systems have made substantial progress in languages like German, English, Spanish, Dutch, and Danish, the availability of comprehensive datasets for the Kurdish language remains notably limited. This paper addresses this gap by focusing on emotion recognition in Sorani Kurdish dialect speech data, which was carefully gathered from openly available videos from the YouTube platform and categorized into four clear supposed emotions: neutral, sadness, happiness, and anger. The study applied both natural Mel Spectrogram and Mel-Frequency Cepstral Coefficient (MFCC) features for various spectrals, followed by the classification models K-Nearest Neighbor (KNN), Multi-Layer Perceptron (MLP), and Support Vector Machine (SVM) to evaluate the results. By closely examining and contrasting the results of using several methods for feature extraction, it was found that SVM obtained a higher accuracy, reaching as much as 85.57%. This is so much more than the first Kurdish emotion classification technique for the recognition of the emotion of the words. ) Array ( [Name] => TypeDocument [Label] => Document Type [Group] => TypDoc [Data] => article ) Array ( [Name] => Format [Label] => File Description [Group] => SrcInfo [Data] => electronic resource ) Array ( [Name] => Language [Label] => Language [Group] => Lang [Data] => English ) Array ( [Name] => ISSN [Label] => ISSN [Group] => ISSN [Data] => 2218-0230<br />2412-3986 ) Array ( [Name] => NoteTitleSource [Label] => Relation [Group] => SrcInfo [Data] => https://zancojournal.su.edu.krd/index.php/JPAS/article/view/2402; https://doaj.org/toc/2218-0230; https://doaj.org/toc/2412-3986 ) Array ( [Name] => DOI [Label] => DOI [Group] => ID [Data] => 10.21271/ZJPAS.36.5.10 ) Array ( [Name] => URL [Label] => Access URL [Group] => URL [Data] => <link linkTarget="URL" linkTerm="https://doaj.org/article/7c5f05e43a004f20a8b50a2e6527bd98" linkWindow="_blank">https://doaj.org/article/7c5f05e43a004f20a8b50a2e6527bd98</link> ) Array ( [Name] => AN [Label] => Accession Number [Group] => ID [Data] => edsdoj.7c5f05e43a004f20a8b50a2e6527bd98 ) |
RecordInfo |
Array
(
[BibEntity] => Array
(
[Identifiers] => Array
(
[0] => Array
(
[Type] => doi
[Value] => 10.21271/ZJPAS.36.5.10
)
)
[Languages] => Array
(
[0] => Array
(
[Text] => English
)
)
[Subjects] => Array
(
[0] => Array
(
[SubjectFull] => Emotion recognition
[Type] => general
)
[1] => Array
(
[SubjectFull] => Speech analysis
[Type] => general
)
[2] => Array
(
[SubjectFull] => Sorani Kurdish
[Type] => general
)
[3] => Array
(
[SubjectFull] => Technology
[Type] => general
)
[4] => Array
(
[SubjectFull] => Science
[Type] => general
)
)
[Titles] => Array
(
[0] => Array
(
[TitleFull] => Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus
[Type] => main
)
)
)
[BibRelationships] => Array
(
[HasContributorRelationships] => Array
(
[0] => Array
(
[PersonEntity] => Array
(
[Name] => Array
(
[NameFull] => Omar Nematullah
)
)
)
[1] => Array
(
[PersonEntity] => Array
(
[Name] => Array
(
[NameFull] => Shavan Askar
)
)
)
[2] => Array
(
[PersonEntity] => Array
(
[Name] => Array
(
[NameFull] => Shahab Wahhab
)
)
)
[3] => Array
(
[PersonEntity] => Array
(
[Name] => Array
(
[NameFull] => Bzar Khidir
)
)
)
)
[IsPartOfRelationships] => Array
(
[0] => Array
(
[BibEntity] => Array
(
[Dates] => Array
(
[0] => Array
(
[D] => 01
[M] => 10
[Type] => published
[Y] => 2024
)
)
[Identifiers] => Array
(
[0] => Array
(
[Type] => issn-print
[Value] => 22180230
)
[1] => Array
(
[Type] => issn-print
[Value] => 24123986
)
)
[Numbering] => Array
(
[0] => Array
(
[Type] => volume
[Value] => 36
)
[1] => Array
(
[Type] => issue
[Value] => 5
)
)
[Titles] => Array
(
[0] => Array
(
[TitleFull] => Zanco Journal of Pure and Applied Sciences
[Type] => main
)
)
)
)
)
)
)
|
IllustrationInfo |