Multiple-Instance Multiple-Label Learning for the Classification of Frog Calls with Acoustic Event Detection

التفاصيل البيبلوغرافية
العنوان: Multiple-Instance Multiple-Label Learning for the Classification of Frog Calls with Acoustic Event Detection
المؤلفون: Jie Xie, Paul Roe, Lin Schwarzkopf, Liang Zhang, Kiyomi Yasumiba, Jinglan Zhang, Michael Towsey
المصدر: Lecture Notes in Computer Science ISBN: 9783319336176
ICISP
بيانات النشر: Springer International Publishing, 2016.
سنة النشر: 2016
مصطلحات موضوعية: Learning problem, Computer science, business.industry, Speech recognition, Pattern recognition, 02 engineering and technology, 01 natural sciences, ComputingMethodologies_PATTERNRECOGNITION, Acoustic event detection, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, 010301 acoustics, Classifier (UML)
الوصف: Frog call classification has received increasing attention due to its importance for ecosystem. Traditionally, the classification of frog calls is solved by means of the single-instance single-label classification classifier. However, since different frog species tend to call simultaneously, classifying frog calls becomes a multiple-instance multiple-label learning problem. In this paper, we propose a novel method for the classification of frog species using multiple-instance multiple-label (MIML) classifiers. To be specific, continuous recordings are first segmented into audio clips (10 s). For each audio clip, acoustic event detection is used to segment frog syllables. Then, three feature sets are extracted from each syllable: mask descriptor, profile statistics, and the combination of mask descriptor and profile statistics. Next, a bag generator is applied to those extracted features. Finally, three MIML classifiers, MIML-SVM, MIML-RBF, and MIML-kNN, are employed for tagging each audio clip with different frog species. Experimental results show that our proposed method can achieve high accuracy (81.8 % true positive/negatives) for frog call classification.
ردمك: 978-3-319-33617-6
DOI: 10.1007/978-3-319-33618-3_23
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::fd4bb02812b773f3d92ea5b33649b621
https://doi.org/10.1007/978-3-319-33618-3_23
Rights: CLOSED
رقم الانضمام: edsair.doi...........fd4bb02812b773f3d92ea5b33649b621
قاعدة البيانات: OpenAIRE
الوصف
ردمك:9783319336176
DOI:10.1007/978-3-319-33618-3_23