Attentional Feature Fusion

Dai, Yimian; Gieseke, Fabian; Oehmcke, Stefan; Wu, Yiquan; Barnard, Kobus

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.14082 (cs)

[Submitted on 29 Sep 2020 (v1), last revised 9 Nov 2020 (this version, v2)]

Title:Attentional Feature Fusion

Authors:Yimian Dai, Fabian Gieseke, Stefan Oehmcke, Yiquan Wu, Kobus Barnard

View PDF

Abstract:Feature fusion, the combination of features from different layers or branches, is an omnipresent part of modern network architectures. It is often implemented via simple operations, such as summation or concatenation, but this might not be the best choice. In this work, we propose a uniform and general scheme, namely attentional feature fusion, which is applicable for most common scenarios, including feature fusion induced by short and long skip connections as well as within Inception layers. To better fuse features of inconsistent semantics and scales, we propose a multi-scale channel attention module, which addresses issues that arise when fusing features given at different scales. We also demonstrate that the initial integration of feature maps can become a bottleneck and that this issue can be alleviated by adding another level of attention, which we refer to as iterative attentional feature fusion. With fewer layers or parameters, our models outperform state-of-the-art networks on both CIFAR-100 and ImageNet datasets, which suggests that more sophisticated attention mechanisms for feature fusion hold great potential to consistently yield better results compared to their direct counterparts. Our codes and trained models are available online.

Comments:	Accepted by WACV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2009.14082 [cs.CV]
	(or arXiv:2009.14082v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.14082

Submission history

From: Yimian Dai [view email]
[v1] Tue, 29 Sep 2020 15:10:18 UTC (11,200 KB)
[v2] Mon, 9 Nov 2020 17:41:20 UTC (3,588 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Attentional Feature Fusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Attentional Feature Fusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators