-
1Academic Journal
المؤلفون: Wei Bi, Qingzhen Xiong, Xingyi Chen, Qingkun Du, Jun Wu, Zhaoyu Zhuang
المصدر: Alexandria Engineering Journal, Vol 118, Iss , Pp 325-336 (2025)
مصطلحات موضوعية: Internet of Things (IoT), TCM education, Visual question answering (VQA), VisualBERT, Multimodal fusion, Deep learning, Engineering (General). Civil engineering (General), TA1-2040
وصف الملف: electronic resource
-
2Academic Journal
المؤلفون: Faheem Shehzad, Aniello Minutolo, Massimo Esposito
المصدر: IEEE Access, Vol 12, Pp 195561-195574 (2024)
مصطلحات موضوعية: Visual question answering (VQA), transformer models, natural language processing, dual-stream architecture, multimodal question answering, attention mechanisms, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
3Academic Journal
المؤلفون: Jinlong He, Gang Liu, Pengfei Li, Xiaonan Su, Wenhua Jiang, Dongze Zhang, Shenjun Zhong
المصدر: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol 17, Pp 14823-14835 (2024)
مصطلحات موضوعية: Multimodal representation learning, parameter-efficient transfer learning, remote sensing (RS) visual question answering (VQA), Ocean engineering, TC1501-1800, Geophysics. Cosmic physics, QC801-809
وصف الملف: electronic resource
-
4Conference
المؤلفون: Dayou Chen, Long Chen, Yiheng Zeng, Craig Hancock, Russell Lock, Simon Solvsten
مصطلحات موضوعية: Fire Safety Compliance, Automated Compliance Checking (ACC), Vision Large Language Models (vLLM), Visual Question Answering (VQA), Computer Vision, Operational Phase Monitoring
Relation: 2134/28023164.v1
-
5Academic Journal
المؤلفون: WANG Yu, SUN Haichun
المصدر: Jisuanji kexue yu tansuo, Vol 17, Iss 7, Pp 1487-1505 (2023)
مصطلحات موضوعية: visual question answering (vqa), modal fusion, visual dialogue, intelligent question answering, cross-modal technology, Electronic computers. Computer science, QA75.5-76.95
وصف الملف: electronic resource
-
6Academic Journal
المؤلفون: Jiangfan Feng, Hui Wang
المصدر: International Journal of Applied Earth Observations and Geoinformation, Vol 126, Iss , Pp 103641- (2024)
مصطلحات موضوعية: Remote sensing, Visual question answering (VQA), Cross-modal, Attention, Multi-scales, Physical geography, GB3-5030, Environmental sciences, GE1-350
وصف الملف: electronic resource
-
7Academic Journal
المؤلفون: Thapa, Surendrabikram, Naseem, Usman, Zhou, Luping, Kim, Jinman
المصدر: Thapa , S , Naseem , U , Zhou , L & Kim , J 2024 , Vision-language models for biomedical applications . in VLM4Bio '24 : proceedings of the First International Workshop on Vision-Language Models for Biomedical Applications . Association for Computing Machinery , New York , pp. 1-2 , First International Workshop on Vision-Language Models for Biomedical Applications (1st : 2024) , Melbourne , Victoria , Australia , 28/10/24 . https://doi.org/10.1145/3689096.3690770
مصطلحات موضوعية: Vision-Language Models (VLMs), Multimodal Biomedical AI, Visual Question Answering (VQA), Clinical Decision Support Systems, Healthcare Applications
وصف الملف: application/pdf
Relation: urn:ISBN:9798400712074
-
8Academic Journal
المؤلفون: Ricci, Riccardo, Bazi, Yakoub, Melgani, Farid
المساهمون: Ricci, Riccardo, Bazi, Yakoub, Melgani, Farid
مصطلحات موضوعية: ChatGPT, image captioning, visual dialoguing, visual question answering (VQA), visual question generation (VQG)
Relation: info:eu-repo/semantics/altIdentifier/wos/WOS:001160070300001; volume:16; issue:3; firstpage:44101; lastpage:44118; numberofpages:18; journal:REMOTE SENSING; https://hdl.handle.net/11572/437939
-
9Academic Journal
المساهمون: Bazi, Yakoub, Bashmal, Laila, Al Rahhal, Mohamad Mahmoud, Ricci, Riccardo, Melgani, Farid
مصطلحات موضوعية: captioning, instruction tuning, Large Language and Vision Assistant Model (LLaVA), large language models (LLMs), remote sensing (RS), visual question answering (VQA)
Relation: info:eu-repo/semantics/altIdentifier/wos/WOS:001219825600001; volume:2024, 16; issue:9; firstpage:147701; lastpage:147718; numberofpages:18; journal:REMOTE SENSING; https://hdl.handle.net/11572/437938
-
10Academic Journal
المؤلفون: Faris Alasmary, Saad Al-Ahmadi
المصدر: IEEE Access, Vol 11, Pp 140967-140980 (2023)
مصطلحات موضوعية: Speech-based visual question answering (SBVQA), question answering, visual question answering (VQA), machine learning, multimodal, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
11Report
المؤلفون: Dheeraj Pai, Deigant Yadava, João Monteiro, Vinay Nair
مصطلحات موضوعية: Knowledge Representation and Machine Learning, Natural Language Processing, multimodal machine learning (ML), Machine Learning Methods, alignment, Reasoning in Machine Learning, Reasoning, Visual Question Answering (VQA)
-
12Academic Journal
المؤلفون: Jiangfan Feng, Etao Tang, Maimai Zeng, Zhujun Gu, Pinglang Kou, Wei Zheng
المصدر: International Journal of Applied Earth Observations and Geoinformation, Vol 122, Iss , Pp 103427- (2023)
مصطلحات موضوعية: Remote sensing, Visual question answering (VQA), Cross-modal, Transformer, Physical geography, GB3-5030, Environmental sciences, GE1-350
وصف الملف: electronic resource
-
13Academic Journal
المؤلفون: Lin, Fang
المصدر: International Journal of Emerging Technologies in Learning (iJET); Vol. 18 No. 22 (2023); pp. 167-182 ; 1863-0383
مصطلحات موضوعية: visual question answering (VQA), education and teaching, college students
وصف الملف: application/pdf
-
14Academic Journal
المؤلفون: Chin-Chen CHANG, Chongqing CHEN, Dezhi HAN, Dun LI, Huimin LI, Kuan-Ching LI
المصدر: IEICE Transactions on Information and Systems. 2023, E106.D(5):581
-
15Academic Journal
المؤلفون: William Ferguson, Dhruv Batra, Raymond Mooney, Devi Parikh, Antonio Torralba, David Bau, David Diller, Josh Fasching, Jaden Fiotto‐Kaufman, Yash Goyal, Jeff Miller, Kerry Moffitt, Alex Montes de Oca, Ramprasaath R. Selvaraju, Ayush Shrivastava, Jialin Wu, Stefan Lee
المصدر: Applied AI Letters, Vol 2, Iss 4, Pp n/a-n/a (2021)
مصطلحات موضوعية: explainable artificial intelligence (XAI), human/computer interaction (HCI), tasking and adapting agents, visual question answering (VQA), Electronic computers. Computer science, QA75.5-76.95
وصف الملف: electronic resource
Relation: https://doaj.org/toc/2689-5595
-
16Conference
المؤلفون: Kervadec, Corentin, Jaunet, Theo, Antipov, Grigory, Baccouche, Moez, Vuillemot, Romain, Wolf, Christian
المساهمون: Laboratoire d'InfoRmatique en Image et Systèmes d'information (LIRIS), Université Lumière - Lyon 2 (UL2)-École Centrale de Lyon (ECL), Université de Lyon-Université de Lyon-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS), Orange Labs, 35512 Cesson-Sévigné, France, Orange Labs R&D Rennes, France Télécom-France Télécom, Situated Interaction, Collaboration, Adaptation and Learning (SICAL), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS)-Université Lumière - Lyon 2 (UL2)-École Centrale de Lyon (ECL), Extraction de Caractéristiques et Identification (imagine), ANR-20-CHIA-0018,REMEMBER,Apprendre Raisonnement, Mémoire et Contrôle(2020)
المصدر: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ; https://hal.science/hal-03192949 ; IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021, Nashville, Tennessee, United States. ⟨10.1109/CVPR46437.2021.00419⟩ ; http://cvpr2021.thecvf.com/
مصطلحات موضوعية: Visual Question Answering (VQA), Deep Learning, Visual Reasoning, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
جغرافية الموضوع: Nashville, Tennessee, United States
-
17Academic Journal
المصدر: Proceedings of the AAAI Symposium Series; Vol. 3 No. 1: Proceedings of the 2024 AAAI Spring Symposium Series; 234-242 ; 2994-4317
مصطلحات موضوعية: Large Multi-Modal Models (LMMs), Visual Question Answering (VQA), Vision-Language Instruction Tuning (VLIT), Parameter Efficient Fine-Tuning (PEFTs), Semiconductor Science
وصف الملف: application/pdf
-
18Dissertation/ Thesis
المساهمون: Sánchez Ruiz-Granados, Antonio Alejandro, Díaz Agudo, María Belén
مصطلحات موضوعية: 004(043.3), Visual Question Answering (VQA), Case-Based Reasoning (CBR), Razonamiento basado en experiencia, Inteligencia Artificial (IA), Similitud entre imágenes, Detección de objetos, COCO, Descripción de imágenes, Embeddings, IA Explicable (XAI), Experience based reasoning, Artificial Intelligence (AI), Image similarity, Object detection, Image description, Explainable AI (XAI), Informática (Informática), 33 Ciencias Tecnológicas
وصف الملف: application/pdf
Relation: https://hdl.handle.net/20.500.14352/106900; XXXX-XXXX
-
19
المؤلفون: Mobarack Islam, Matt Clarkson, Sophia Bano, Danail Stoyanov, Hani Marcus
مصطلحات موضوعية: Biomedical imaging, Intelligent robotics, Natural language processing, Computer vision, Image processing, Multimodal analysis and synthesis, Visual Question Answering (VQA), large language models in medicine, Large language models (LLMs) in healthcare, Vision Language Models, Pituitary surgery, artificial intelligence analysis, surgical data science
-
20Book
المساهمون: Kondi, L P, Boulgouris, N
المصدر: Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP)
مصطلحات موضوعية: Hierarchical relational attention, Visual Question Answering (VQA), scene understanding
وصف الملف: application/pdf
Relation: https://eprints.qut.edu.au/122718/1/icip_2018_eprint.pdf; Chowdhury, Muhammad Iqbal Hasan, Sridharan, Sridha, Fookes, Clinton, & Nguyen Thanh, Kien (2018) Hierarchical relational attention for video question answering. In Kondi, L P & Boulgouris, N (Eds.) Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP). Institute of Electrical and Electronics Engineers Inc., United States of America, pp. 599-603.; https://eprints.qut.edu.au/122718/; Institute for Future Environments; Science & Engineering Faculty
الاتاحة: https://eprints.qut.edu.au/122718/