Compressing deep quaternion neural networks with targeted regularization

Vecchi, Riccardo; Scardapane, Simone; Comminiello, Danilo; Uncini, Aurelio

doi:10.1049/trit.2020.0020

Computer Science > Machine Learning

arXiv:1907.11546 (cs)

[Submitted on 26 Jul 2019 (v1), last revised 13 Jul 2020 (this version, v3)]

Title:Compressing deep quaternion neural networks with targeted regularization

Authors:Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

View PDF

Abstract:In recent years, hyper-complex deep networks (such as complex-valued and quaternion-valued neural networks) have received a renewed interest in the literature. They find applications in multiple fields, ranging from image reconstruction to 3D audio processing. Similar to their real-valued counterparts, quaternion neural networks (QVNNs) require custom regularization strategies to avoid overfitting. In addition, for many real-world applications and embedded implementations, there is the need of designing sufficiently compact networks, with few weights and neurons. However, the problem of regularizing and/or sparsifying QVNNs has not been properly addressed in the literature as of now. In this paper, we show how to address both problems by designing targeted regularization strategies, which are able to minimize the number of connections and neurons of the network during training. To this end, we investigate two extensions of l1 and structured regularization to the quaternion domain. In our experimental evaluation, we show that these tailored strategies significantly outperform classical (real-valued) regularization approaches, resulting in small networks especially suitable for low-power and real-time applications.

Comments:	Published on CAAI Transactions on Intelligence Technology, this https URL
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1907.11546 [cs.LG]
	(or arXiv:1907.11546v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.11546
Related DOI:	https://doi.org/10.1049/trit.2020.0020

Submission history

From: Simone Scardapane [view email]
[v1] Fri, 26 Jul 2019 12:55:55 UTC (7,891 KB)
[v2] Tue, 28 Jan 2020 14:50:23 UTC (8,145 KB)
[v3] Mon, 13 Jul 2020 08:23:37 UTC (7,889 KB)

Computer Science > Machine Learning

Title:Compressing deep quaternion neural networks with targeted regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Compressing deep quaternion neural networks with targeted regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators