๐ฎ
๐ฎ
The Ethereal
How to Craft Backdoors with Unlabeled Data Alone?
April 10, 2024 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: .gitignore, README.md, assets, main_linear.py, main_poison.py, main_pretrain.py, misc, poisoning_utils.py, requirements.txt, script, solo
Authors
Yifei Wang, Wenhan Ma, Stefanie Jegelka, Yisen Wang
arXiv ID
2404.06694
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.CR
Citations
0
Venue
arXiv.org
Repository
https://github.com/PKU-ML/nlb
โญ 5
Last Checked
3 months ago
Abstract
Relying only on unlabeled data, Self-supervised learning (SSL) can learn rich features in an economical and scalable way. As the drive-horse for building foundation models, SSL has received a lot of attention recently with wide applications, which also raises security concerns where backdoor attack is a major type of threat: if the released dataset is maliciously poisoned, backdoored SSL models can behave badly when triggers are injected to test samples. The goal of this work is to investigate this potential risk. We notice that existing backdoors all require a considerable amount of \emph{labeled} data that may not be available for SSL. To circumvent this limitation, we explore a more restrictive setting called no-label backdoors, where we only have access to the unlabeled data alone, where the key challenge is how to select the proper poison set without using label information. We propose two strategies for poison selection: clustering-based selection using pseudolabels, and contrastive selection derived from the mutual information principle. Experiments on CIFAR-10 and ImageNet-100 show that both no-label backdoors are effective on many SSL methods and outperform random poisoning by a large margin. Code will be available at https://github.com/PKU-ML/nlb.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal