research-article

Public Access

Manipulating Out-Domain Uncertainty Estimation in Deep Neural Networks via Targeted Clean-Label Poisoning

Authors:

Dong WangAuthors Info & Claims

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

Pages 3114 - 3123

https://doi.org/10.1145/3583780.3614957

Published: 21 October 2023 Publication History

PDF eReader

Abstract

Robust out-domain uncertainty estimation has gained growing attention for its capacity of providing adversary-resistant uncertainty estimates on out-domain samples. However, existing work on robust uncertainty estimation mainly focuses on evasion attacks that happen during test time. The threat of poisoning attacks against uncertainty models is largely unexplored. Compared to evasion attacks, poisoning attacks do not necessarily modify test data, and therefore, would be more practical in real-world applications. In this work, we systematically investigate the robustness of state-of-the-art uncertainty estimation algorithms against data poisoning attacks, with the ultimate objective of developing robust uncertainty training methods. In particular, we focus on attacking the out-domain uncertainty estimation. Under the proposed attack, the training process of models is affected. A fake high-confidence region is established around the targeted out-domain sample, which originally would have been rejected by the model due to low confidence. More fatally, our attack is clean-label and targeted: it leaves the poisoned data with clean labels and attacks a specific targeted test sample without degrading the overall model performance. We evaluate the proposed attack on several image benchmark datasets and a real-world application of COVID-19 misinformation detection. The extensive experimental results on different tasks suggest that the state-of-the-art uncertainty estimation methods could be extremely vulnerable and easily corrupted by our proposed attack.

Supplementary Material

MP4 File (full1241-video.mp4)

This is a presentation record for a CIKM 2023 paper: Manipulating Out-Domain Uncertainty Estimation in Deep Neural Networks via Targeted Clean-label Poisoning

Download
25.05 MB

References

[1]

Ismail Alarab and Simant Prakoonwit. 2021. Adversarial Attack for Uncertainty Estimation: Identifying Critical Regions in Neural Networks. arXiv preprint arXiv:2107.07618 (2021).

Abstract

Supplementary Material

References

Index Terms

Recommendations

Poison frogs! targeted clean-label poisoning attacks on neural networks

Deep k-NN Defense Against Clean-Label Data Poisoning Attacks

Label Sanitization Against Label Flipping Poisoning Attacks

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

PDF

eReader

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations