التفاصيل البيبلوغرافية
العنوان: |
Bangla language textual image description by hybrid neural network model |
المؤلفون: |
Jishan, Md. Asifuzzaman, Mahmud, Khan Raqib, Azad, Abul Kalam Al, Rashid, Mohammad Rifat Ahmmad, Paul, Bijan, Alam, Md. Shahabub |
المصدر: |
Indonesian Journal of Electrical Engineering and Computer Science, 21(2), 757-767, (2021-02-01) |
بيانات النشر: |
Zenodo |
سنة النشر: |
2021 |
المجموعة: |
Zenodo |
مصطلحات موضوعية: |
Bangla natural language descriptors, Convolutional neural network, Hybrid recurrent neural network, Long short-term memory bidirectional recurrent neural network |
الوصف: |
Automatic image captioning task in different language is a challenging task which has not been well investigated yet due to the lack of dataset and effective models. It also requires good understanding of scene and contextual embedding for robust semantic interpretation of images for natural language image descriptor. To generate image descriptor in Bangla, we created a new Bangla dataset of images paired with target language label, named as Bangla natural language image to text (BNLIT) dataset. To deal with the image understanding, we propose a hybrid encoder-decoder model based on encoder-decoder architecture and the model is evaluated on our newly created dataset. This proposed approach achieves significance performance improvement on task of semantic retrieval of images. Our hybrid model uses the convolutional neural network as an encoder whereas the bidirectional long short term memory is used for the sentence representation that decreases the computational complexities without trading off the exactness of the descriptor. The model yielded benchmark accuracy in recovering Bangla natural language and we also conducted a thorough numerical analysis of the model performance on the BNLIT dataset. |
نوع الوثيقة: |
article in journal/newspaper |
اللغة: |
English |
Relation: |
oai:zenodo.org:7069274 |
DOI: |
10.11591/ijeecs.v21.i2.pp757-767 |
الاتاحة: |
https://doi.org/10.11591/ijeecs.v21.i2.pp757-767 |
Rights: |
info:eu-repo/semantics/openAccess ; Creative Commons Attribution 4.0 International ; https://creativecommons.org/licenses/by/4.0/legalcode |
رقم الانضمام: |
edsbas.CF30201 |
قاعدة البيانات: |
BASE |