Watermarking LLMs with Weight Quantization

Li, Linyang; Jiang, Botian; Wang, Pengyu; Ren, Ke; Yan, Hang; Qiu, Xipeng

Computer Science > Computation and Language

arXiv:2310.11237 (cs)

[Submitted on 17 Oct 2023]

Title:Watermarking LLMs with Weight Quantization

Authors:Linyang Li, Botian Jiang, Pengyu Wang, Ke Ren, Hang Yan, Xipeng Qiu

View PDF

Abstract:Abuse of large language models reveals high risks as large language models are being deployed at an astonishing speed. It is important to protect the model weights to avoid malicious usage that violates licenses of open-source large language models. This paper proposes a novel watermarking strategy that plants watermarks in the quantization process of large language models without pre-defined triggers during inference. The watermark works when the model is used in the fp32 mode and remains hidden when the model is quantized to int8, in this way, the users can only inference the model without further supervised fine-tuning of the model. We successfully plant the watermark into open-source large language model weights including GPT-Neo and LLaMA. We hope our proposed method can provide a potential direction for protecting model weights in the era of large language model applications.

Comments:	Accepted by Findings of EMNLP2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.11237 [cs.CL]
	(or arXiv:2310.11237v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.11237

Submission history

From: Pengyu Wang [view email]
[v1] Tue, 17 Oct 2023 13:06:59 UTC (309 KB)

Computer Science > Computation and Language

Title:Watermarking LLMs with Weight Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Watermarking LLMs with Weight Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators