Adding New Tasks to a Single Network with Weight Transformations using Binary Masks

Mancini, Massimiliano; Ricci, Elisa; Caputo, Barbara; Bulò, Samuel Rota

Computer Science > Computer Vision and Pattern Recognition

arXiv:1805.11119 (cs)

[Submitted on 28 May 2018 (v1), last revised 14 Jun 2018 (this version, v2)]

Title:Adding New Tasks to a Single Network with Weight Transformations using Binary Masks

Authors:Massimiliano Mancini, Elisa Ricci, Barbara Caputo, Samuel Rota Bulò

View PDF

Abstract:Visual recognition algorithms are required today to exhibit adaptive abilities. Given a deep model trained on a specific, given task, it would be highly desirable to be able to adapt incrementally to new tasks, preserving scalability as the number of new tasks increases, while at the same time avoiding catastrophic forgetting issues. Recent work has shown that masking the internal weights of a given original conv-net through learned binary variables is a promising strategy. We build upon this intuition and take into account more elaborated affine transformations of the convolutional weights that include learned binary masks. We show that with our generalization it is possible to achieve significantly higher levels of adaptation to new tasks, enabling the approach to compete with fine tuning strategies by requiring slightly more than 1 bit per network parameter per additional task. Experiments on two popular benchmarks showcase the power of our approach, that achieves the new state of the art on the Visual Decathlon Challenge.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1805.11119 [cs.CV]
	(or arXiv:1805.11119v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1805.11119

Submission history

From: Massimiliano Mancini [view email]
[v1] Mon, 28 May 2018 18:22:42 UTC (446 KB)
[v2] Thu, 14 Jun 2018 17:26:08 UTC (446 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Massimiliano Mancini
Elisa Ricci
Barbara Caputo
Samuel Rota Bulò

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Adding New Tasks to a Single Network with Weight Transformations using Binary Masks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Adding New Tasks to a Single Network with Weight Transformations using Binary Masks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators