The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models

Caldarella, Simone; Mancini, Massimiliano; Ricci, Elisa; Aljundi, Rahaf

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.01228 (cs)

[Submitted on 2 Aug 2024 (v1), last revised 19 Aug 2024 (this version, v2)]

Title:The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models

Authors:Simone Caldarella, Massimiliano Mancini, Elisa Ricci, Rahaf Aljundi

View PDF HTML (experimental)

Abstract:Vision-Language Models (VLMs) combine visual and textual understanding, rendering them well-suited for diverse tasks like generating image captions and answering visual questions across various domains. However, these capabilities are built upon training on large amount of uncurated data crawled from the web. The latter may include sensitive information that VLMs could memorize and leak, raising significant privacy concerns. In this paper, we assess whether these vulnerabilities exist, focusing on identity leakage. Our study leads to three key findings: (i) VLMs leak identity information, even when the vision-language alignment and the fine-tuning use anonymized data; (ii) context has little influence on identity leakage; (iii) simple, widely used anonymization techniques, like blurring, are not sufficient to address the problem. These findings underscore the urgent need for robust privacy protection strategies when deploying VLMs. Ethical awareness and responsible development practices are essential to mitigate these risks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.01228 [cs.CV]
	(or arXiv:2408.01228v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.01228

Submission history

From: Simone Caldarella [view email]
[v1] Fri, 2 Aug 2024 12:36:13 UTC (4,584 KB)
[v2] Mon, 19 Aug 2024 13:35:05 UTC (4,584 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators