Generative Multi-Label Zero-Shot Learning

Gupta, Akshita; Narayan, Sanath; Khan, Salman; Khan, Fahad Shahbaz; Shao, Ling; van de Weijer, Joost

Computer Science > Computer Vision and Pattern Recognition

arXiv:2101.11606 (cs)

[Submitted on 27 Jan 2021 (v1), last revised 31 Jul 2023 (this version, v3)]

Title:Generative Multi-Label Zero-Shot Learning

Authors:Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Joost van de Weijer

View PDF

Abstract:Multi-label zero-shot learning strives to classify images into multiple unseen categories for which no data is available during training. The test samples can additionally contain seen categories in the generalized variant. Existing approaches rely on learning either shared or label-specific attention from the seen classes. Nevertheless, computing reliable attention maps for unseen classes during inference in a multi-label setting is still a challenge. In contrast, state-of-the-art single-label generative adversarial network (GAN) based approaches learn to directly synthesize the class-specific visual features from the corresponding class attribute embeddings. However, synthesizing multi-label features from GANs is still unexplored in the context of zero-shot setting. In this work, we introduce different fusion approaches at the attribute-level, feature-level and cross-level (across attribute and feature-levels) for synthesizing multi-label features from their corresponding multi-label class embedding. To the best of our knowledge, our work is the first to tackle the problem of multi-label feature synthesis in the (generalized) zero-shot setting. Comprehensive experiments are performed on three zero-shot image classification benchmarks: NUS-WIDE, Open Images and MS COCO. Our cross-level fusion-based generative approach outperforms the state-of-the-art on all three datasets. Furthermore, we show the generalization capabilities of our fusion approach in the zero-shot detection task on MS COCO, achieving favorable performance against existing methods. The source code is available at this https URL.

Comments:	Accepted by TPAMI: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2101.11606 [cs.CV]
	(or arXiv:2101.11606v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2101.11606

Submission history

From: Akshita Gupta [view email]
[v1] Wed, 27 Jan 2021 18:56:46 UTC (394 KB)
[v2] Thu, 28 Jan 2021 16:14:42 UTC (394 KB)
[v3] Mon, 31 Jul 2023 14:08:22 UTC (25,373 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Multi-Label Zero-Shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generative Multi-Label Zero-Shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators