Connectivity-Inspired Network for Context-Aware Recognition

التفاصيل البيبلوغرافية
العنوان: Connectivity-Inspired Network for Context-Aware Recognition
المؤلفون: Carloni, Gianluca, Colantonio, Sara
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Image and Video Processing, I.2, I.4, I.5, J.3, J.6
الوصف: The aim of this paper is threefold. We inform the AI practitioner about the human visual system with an extensive literature review; we propose a novel biologically motivated neural network for image classification; and, finally, we present a new plug-and-play module to model context awareness. We focus on the effect of incorporating circuit motifs found in biological brains to address visual recognition. Our convolutional architecture is inspired by the connectivity of human cortical and subcortical streams, and we implement bottom-up and top-down modulations that mimic the extensive afferent and efferent connections between visual and cognitive areas. Our Contextual Attention Block is simple and effective and can be integrated with any feed-forward neural network. It infers weights that multiply the feature maps according to their causal influence on the scene, modeling the co-occurrence of different objects in the image. We place our module at different bottlenecks to infuse a hierarchical context awareness into the model. We validated our proposals through image classification experiments on benchmark data and found a consistent improvement in performance and the robustness of the produced explanations via class activation. Our code is available at https://github.com/gianlucarloni/CoCoReco.
Comment: ECCV 2024 - HCV Workshop, Accepted for presentation, Submitted Manuscript Version (adapted to include author names, Acknowledgements, and reference DOIs): the version of the manuscript improved after peer review will appear in the Proceedings later
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2409.04360
رقم الانضمام: edsarx.2409.04360
قاعدة البيانات: arXiv
ResultId 1
Header edsarx
arXiv
edsarx.2409.04360
1128
3
Report
report
1128.00402832031
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2409.04360&custid=s6537998&authtype=sso
FullText Array ( [Availability] => 0 )
Array ( [0] => Array ( [Url] => http://arxiv.org/abs/2409.04360 [Name] => EDS - Arxiv [Category] => fullText [Text] => View record in Arxiv [MouseOverText] => View record in Arxiv ) )
Items Array ( [Name] => Title [Label] => Title [Group] => Ti [Data] => Connectivity-Inspired Network for Context-Aware Recognition )
Array ( [Name] => Author [Label] => Authors [Group] => Au [Data] => <searchLink fieldCode="AR" term="%22Carloni%2C+Gianluca%22">Carloni, Gianluca</searchLink><br /><searchLink fieldCode="AR" term="%22Colantonio%2C+Sara%22">Colantonio, Sara</searchLink> )
Array ( [Name] => DatePubCY [Label] => Publication Year [Group] => Date [Data] => 2024 )
Array ( [Name] => Subset [Label] => Collection [Group] => HoldingsInfo [Data] => Computer Science )
Array ( [Name] => Subject [Label] => Subject Terms [Group] => Su [Data] => <searchLink fieldCode="DE" term="%22Computer+Science+-+Computer+Vision+and+Pattern+Recognition%22">Computer Science - Computer Vision and Pattern Recognition</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+Science+-+Artificial+Intelligence%22">Computer Science - Artificial Intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Electrical+Engineering+and+Systems+Science+-+Image+and+Video+Processing%22">Electrical Engineering and Systems Science - Image and Video Processing</searchLink><br /><searchLink fieldCode="DE" term="%22I%2E2%22">I.2</searchLink><br /><searchLink fieldCode="DE" term="%22I%2E4%22">I.4</searchLink><br /><searchLink fieldCode="DE" term="%22I%2E5%22">I.5</searchLink><br /><searchLink fieldCode="DE" term="%22J%2E3%22">J.3</searchLink><br /><searchLink fieldCode="DE" term="%22J%2E6%22">J.6</searchLink> )
Array ( [Name] => Abstract [Label] => Description [Group] => Ab [Data] => The aim of this paper is threefold. We inform the AI practitioner about the human visual system with an extensive literature review; we propose a novel biologically motivated neural network for image classification; and, finally, we present a new plug-and-play module to model context awareness. We focus on the effect of incorporating circuit motifs found in biological brains to address visual recognition. Our convolutional architecture is inspired by the connectivity of human cortical and subcortical streams, and we implement bottom-up and top-down modulations that mimic the extensive afferent and efferent connections between visual and cognitive areas. Our Contextual Attention Block is simple and effective and can be integrated with any feed-forward neural network. It infers weights that multiply the feature maps according to their causal influence on the scene, modeling the co-occurrence of different objects in the image. We place our module at different bottlenecks to infuse a hierarchical context awareness into the model. We validated our proposals through image classification experiments on benchmark data and found a consistent improvement in performance and the robustness of the produced explanations via class activation. Our code is available at https://github.com/gianlucarloni/CoCoReco.<br />Comment: ECCV 2024 - HCV Workshop, Accepted for presentation, Submitted Manuscript Version (adapted to include author names, Acknowledgements, and reference DOIs): the version of the manuscript improved after peer review will appear in the Proceedings later )
Array ( [Name] => TypeDocument [Label] => Document Type [Group] => TypDoc [Data] => Working Paper )
Array ( [Name] => URL [Label] => Access URL [Group] => URL [Data] => <link linkTarget="URL" linkTerm="http://arxiv.org/abs/2409.04360" linkWindow="_blank">http://arxiv.org/abs/2409.04360</link> )
Array ( [Name] => AN [Label] => Accession Number [Group] => ID [Data] => edsarx.2409.04360 )
RecordInfo Array ( [BibEntity] => Array ( [Subjects] => Array ( [0] => Array ( [SubjectFull] => Computer Science - Computer Vision and Pattern Recognition [Type] => general ) [1] => Array ( [SubjectFull] => Computer Science - Artificial Intelligence [Type] => general ) [2] => Array ( [SubjectFull] => Electrical Engineering and Systems Science - Image and Video Processing [Type] => general ) [3] => Array ( [SubjectFull] => I.2 [Type] => general ) [4] => Array ( [SubjectFull] => I.4 [Type] => general ) [5] => Array ( [SubjectFull] => I.5 [Type] => general ) [6] => Array ( [SubjectFull] => J.3 [Type] => general ) [7] => Array ( [SubjectFull] => J.6 [Type] => general ) ) [Titles] => Array ( [0] => Array ( [TitleFull] => Connectivity-Inspired Network for Context-Aware Recognition [Type] => main ) ) ) [BibRelationships] => Array ( [HasContributorRelationships] => Array ( [0] => Array ( [PersonEntity] => Array ( [Name] => Array ( [NameFull] => Carloni, Gianluca ) ) ) [1] => Array ( [PersonEntity] => Array ( [Name] => Array ( [NameFull] => Colantonio, Sara ) ) ) ) [IsPartOfRelationships] => Array ( [0] => Array ( [BibEntity] => Array ( [Dates] => Array ( [0] => Array ( [D] => 06 [M] => 09 [Type] => published [Y] => 2024 ) ) ) ) ) ) )
IllustrationInfo