Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

Xu, Xinyi; Wang, Zhengyang; Deng, Cheng; Yuan, Hao; Ji, Shuiwang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.08877 (cs)

[Submitted on 17 Nov 2020 (v1), last revised 25 Aug 2021 (this version, v3)]

Title:Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

Authors:Xinyi Xu, Zhengyang Wang, Cheng Deng, Hao Yuan, Shuiwang Ji

View PDF

Abstract:Grouping has been commonly used in deep metric learning for computing diverse features. However, current methods are prone to overfitting and lack interpretability. In this work, we propose an improved and interpretable grouping method to be integrated flexibly with any metric learning framework. Our method is based on the attention mechanism with a learnable query for each group. The query is fully trainable and can capture group-specific information when combined with the diversity loss. An appealing property of our method is that it naturally lends itself interpretability. The attention scores between the learnable query and each spatial position can be interpreted as the importance of that position. We formally show that our proposed grouping method is invariant to spatial permutations of features. When used as a module in convolutional neural networks, our method leads to translational invariance. We conduct comprehensive experiments to evaluate our method. Our quantitative results indicate that the proposed method outperforms prior methods consistently and significantly across different datasets, evaluation metrics, base models, and loss functions. For the first time to the best of our knowledge, our interpretation results clearly demonstrate that the proposed method enables the learning of distinct and diverse features across groups. The code is available on this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2011.08877 [cs.CV]
	(or arXiv:2011.08877v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.08877

Submission history

From: Xinyi Xu [view email]
[v1] Tue, 17 Nov 2020 19:08:24 UTC (21,511 KB)
[v2] Thu, 19 Nov 2020 03:19:55 UTC (21,511 KB)
[v3] Wed, 25 Aug 2021 05:42:12 UTC (4,138 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators