Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem

Wang, Zheng; Ruan, Wenjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2208.00906 (cs)

[Submitted on 1 Aug 2022]

Title:Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem

Authors:Zheng Wang, Wenjie Ruan

View PDF

Abstract:Recent research on the robustness of deep learning has shown that Vision Transformers (ViTs) surpass the Convolutional Neural Networks (CNNs) under some perturbations, e.g., natural corruption, adversarial attacks, etc. Some papers argue that the superior robustness of ViT comes from the segmentation of its input images; others say that the Multi-head Self-Attention (MSA) is the key to preserving the robustness. In this paper, we aim to introduce a principled and unified theoretical framework to investigate such an argument on ViT's robustness. We first theoretically prove that, unlike Transformers in Natural Language Processing, ViTs are Lipschitz continuous. Then we theoretically analyze the adversarial robustness of ViTs from the perspective of the Cauchy Problem, via which we can quantify how the robustness propagates through layers. We demonstrate that the first and last layers are the critical factors to affect the robustness of ViTs. Furthermore, based on our theory, we empirically show that unlike the claims from existing research, MSA only contributes to the adversarial robustness of ViTs under weak adversarial attacks, e.g., FGSM, and surprisingly, MSA actually comprises the model's adversarial robustness under stronger attacks, e.g., PGD attacks.

Comments:	Accepted by ECML-PKDD 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2208.00906 [cs.CV]
	(or arXiv:2208.00906v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2208.00906

Submission history

From: Zheng Wang [view email]
[v1] Mon, 1 Aug 2022 14:50:29 UTC (730 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators