research-article

Public Access

When Machine Unlearning Jeopardizes Privacy

Authors:

Yang ZhangAuthors Info & Claims

CCS '21: Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security

Pages 896 - 911

https://doi.org/10.1145/3460120.3484756

Published: 13 November 2021 Publication History

PDF eReader

Abstract

The right to be forgotten states that a data owner has the right to erase their data from an entity storing it. In the context of machine learning (ML), the right to be forgotten requires an ML model owner to remove the data owner's data from the training set used to build the ML model, a process known asmachine unlearning. While originally designed to protect the privacy of the data owner, we argue that machine unlearning may leave some imprint of the data in the ML model and thus create unintended privacy risks. In this paper, we perform the first study on investigating the unintended information leakage caused by machine unlearning. We propose a novel membership inference attack that leverages the different outputs of an ML model's two versions to infer whether a target sample is part of the training set of the original model but out of the training set of the corresponding unlearned model. Our experiments demonstrate that the proposed membership inference attack achieves strong performance. More importantly, we show that our attack in multiple cases outperforms the classical membership inference attack on the original ML model, which indicates that machine unlearning can have counterproductive effects on privacy. We notice that the privacy degradation is especially significant for well-generalized ML models where classical membership inference does not perform well. We further investigate four mechanisms to mitigate the newly discovered privacy risks and show that releasing the predicted label only, temperature scaling, and differential privacy are effective. We believe that our results can help improve privacy protection in practical implementations of machine unlearning. \footnoteOur code is available at \urlhttps://github.com/MinChen00/UnlearningLeaks.

Supplementary Material

MP4 File (CCS21-fp212.mp4)

The right to be forgotten states that a data owner has the right to erase their data from an entity storing it. Under its protection, a data owner of the ML model can require the model provider to erase their data and the corresponding influence, a process known as machine unlearning. While initially designed to protect the privacy of the data owner, we found that machine unlearning may leave an imprint of the data in the ML model and create unintended privacy risks. This video introduces the unintended information leakage caused by machine unlearning through a novel membership inference attack, which infers whether a target sample is part of the original model's training set but revoked later. Our attack in multiple cases outperforms the classical membership inference attack on the original ML model. We further investigate four mechanisms to mitigate the newly discovered privacy risks. We believe that our findings can help improve privacy protection in practical implementations of machine unlearning.

Download
146.07 MB

References

[1]

https://gdpr-info.eu/.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

SFTC: Machine Unlearning via Selective Fine-tuning and Targeted Confusion

Graph Unlearning

A Review on Machine Unlearning

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations