Academic Journal

Military Image Captioning for Low-Altitude UAV or UGV Perspectives

التفاصيل البيبلوغرافية
العنوان: Military Image Captioning for Low-Altitude UAV or UGV Perspectives
المؤلفون: Lizhi Pan, Chengtian Song, Xiaozheng Gan, Keyu Xu, Yue Xie
المصدر: Drones, Vol 8, Iss 9, p 421 (2024)
بيانات النشر: MDPI AG, 2024.
سنة النشر: 2024
المجموعة: LCC:Motor vehicles. Aeronautics. Astronautics
مصطلحات موضوعية: unmanned aerial vehicle (UAV), military image captioning, unmanned ground vehicle (UGV), image understanding, visual-language model, Motor vehicles. Aeronautics. Astronautics, TL1-4050
الوصف: Low-altitude unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), which boast high-resolution imaging and agile maneuvering capabilities, are widely utilized in military scenarios and generate a vast amount of image data that can be leveraged for textual intelligence generation to support military decision making. Military image captioning (MilitIC), as a visual-language learning task, provides innovative solutions for military image understanding and intelligence generation. However, the scarcity of military image datasets hinders the advancement of MilitIC methods, especially those based on deep learning. To overcome this limitation, we introduce an open-access benchmark dataset, which was termed the Military Objects in Real Combat (MOCO) dataset. It features real combat images captured from the perspective of low-altitude UAVs or UGVs, along with a comprehensive set of captions. Furthermore, we propose a novel encoder–augmentation–decoder image-captioning architecture with a map augmentation embedding (MAE) mechanism, MAE-MilitIC, which leverages both image and text modalities as a guiding prefix for caption generation and bridges the semantic gap between visual and textual data. The MAE mechanism maps both image and text embeddings onto a semantic subspace constructed by relevant military prompts, and augments the military semantics of the image embeddings with attribute-explicit text embeddings. Finally, we demonstrate through extensive experiments that MAE-MilitIC surpasses existing models in performance on two challenging datasets, which provides strong support for intelligence warfare based on military UAVs and UGVs.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2504-446X
Relation: https://www.mdpi.com/2504-446X/8/9/421; https://doaj.org/toc/2504-446X
DOI: 10.3390/drones8090421
URL الوصول: https://doaj.org/article/80df56e7615547e2b5976a7b3c83bb8b
رقم الانضمام: edsdoj.80df56e7615547e2b5976a7b3c83bb8b
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:2504446X
DOI:10.3390/drones8090421