Dual Discriminator Adversarial Distillation for Data-free Model Compression

Zhao, Haoran; Sun, Xin; Dong, Junyu; Yu, Hui; Zhou, Huiyu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.05382 (cs)

[Submitted on 12 Apr 2021 (v1), last revised 5 Oct 2021 (this version, v2)]

Title:Dual Discriminator Adversarial Distillation for Data-free Model Compression

Authors:Haoran Zhao, Xin Sun, Junyu Dong, Hui Yu, Huiyu Zhou

View PDF

Abstract:Knowledge distillation has been widely used to produce portable and efficient neural networks which can be well applied on edge devices for computer vision tasks. However, almost all top-performing knowledge distillation methods need to access the original training data, which usually has a huge size and is often unavailable. To tackle this problem, we propose a novel data-free approach in this paper, named Dual Discriminator Adversarial Distillation (DDAD) to distill a neural network without any training data or meta-data. To be specific, we use a generator to create samples through dual discriminator adversarial distillation, which mimics the original training data. The generator not only uses the pre-trained teacher's intrinsic statistics in existing batch normalization layers but also obtains the maximum discrepancy from the student model. Then the generated samples are used to train the compact student network under the supervision of the teacher. The proposed method obtains an efficient student network which closely approximates its teacher network, despite using no original training data. Extensive experiments are conducted to to demonstrate the effectiveness of the proposed approach on CIFAR-10, CIFAR-100 and Caltech101 datasets for classification tasks. Moreover, we extend our method to semantic segmentation tasks on several public datasets such as CamVid and NYUv2. All experiments show that our method outperforms all baselines for data-free knowledge distillation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2104.05382 [cs.CV]
	(or arXiv:2104.05382v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.05382

Submission history

From: Haoran Zhao [view email]
[v1] Mon, 12 Apr 2021 12:01:45 UTC (18,247 KB)
[v2] Tue, 5 Oct 2021 01:06:21 UTC (10,371 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dual Discriminator Adversarial Distillation for Data-free Model Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dual Discriminator Adversarial Distillation for Data-free Model Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators