Code for the paper "Free Record-Level Privacy Risk Evaluation through Artifact-Based Methods"

Abstract

Membership inference attacks (MIAs) are widely used to empirically assess the privacy risks of samples used to train a target machine learning model. State-of-the- art methods however require training hundreds of shadow models, with the same size and architecture of the target model, solely to evaluate the privacy risk. While one might be able to afford this for small models, the cost often becomes prohibitive for medium and large models. We here instead propose a novel approach to identify the at-risk samples using only artifacts available during training, with little to no additional computational overhead. Our method analyzes individual per-sample loss traces and uses them to identify the vulnerable data samples. We demonstrate the effectiveness of our artifact-based approach through experiments on the CIFAR10 dataset, showing high precision in identifying vulnerable samples as determined by SOTA shadow model-based MIA(LiRA ¹). Impressively, our method reaches the same precision than SOTA MIAs with one another despite despite it being orders of magnitude cheaper. We then show LT-IQR to outperform alternative loss aggregation methods, perform ablation studies on hy- perparameters, and validate the robustness of our method to the target metric. Finally, we study the evolution of the vulnerability score distribution across the training process throughout training as a metric for model-level risk assess-

Setup

To install dependencies, run:


pip install -r requirements.txt

Next create a config.py file with the following and fill in the relevant paths:


LOCAL_DIR = # path to this folder
# paths to store stuff...
STORAGE_DIR = 
MY_STORAGE_DIR = 
MODEL_DIR = 
DATA_DIR =

The relevant code blocks written below along with the hyperparamters used in our paper to train target models, shadow models, and to run LiRA are also available in scripts/walkthrough.sh.

To run training on the target model:


python $norm_tracker_path/main.py --track_grad_norms --gpu $gpu --dataset $dataset --seed 34568 --arch $arch --batchsize $bs --lr $lr --epochs $epochs --exp_id $exp_id

If averaging over duplicates, run a bash script with the following lines:


for ((i=0; i<$dual_count; i++))
do
    echo "Training dual model $i"
    start_time=$(date +%s) #
    taskset -c $cpus python $norm_tracker_path/main.py --track_grad_norms --gpu $gpu --dataset $dataset --seed 8574${i} --dual track_both_$i --arch $arch --batchsize $bs --lr $lr --epochs $epochs --exp_id $exp_id
    end_time=$(date +%s)
    elapsed_time=$((end_time - start_time))
    echo "Took $elapsed_time seconds"
done

To run attackR ², run the script in scripts/attack_r.sh which contains the following:


echo "Took $elapsed_time seconds"
for ((i=0; i<$dual_count; i++))
do
    echo "Starting attack-r dual ${i}"
    taskset -c $cpus python $norm_tracker_path/attacks/attack_r.py --gpu $gpu --exp_id $exp_id --target_id dual_track_both_$i --batchsize $bs
done

References

N. Carlini, S. Chien, M. Nasr, S. Song, A. Terzis, and F. Tramer, “Membership inference attacks from first principles,” in 2022 IEEE Symposium on Security and Privacy (SP). IEEE, 2022, pp. 1897– 1914 ↩
J. Ye, A. Maddi, S. K. Murakonda, V. Bindschaedler, and R. Shokri, “Enhanced membership inference attacks against machine learning models,” in Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, 2022, pp. 3093–3106. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
attacks		attacks
data_processing		data_processing
models		models
results		results
scripts		scripts
.gitignore		.gitignore
README.md		README.md
attacks.py		attacks.py
main.py		main.py
plot_style.mplstyle		plot_style.mplstyle
requirements.txt		requirements.txt
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for the paper "Free Record-Level Privacy Risk Evaluation through Artifact-Based Methods"

Abstract

Setup

References

About

Releases

Packages

Contributors 3

Languages

computationalprivacy/loss_traces

Folders and files

Latest commit

History

Repository files navigation

Code for the paper "Free Record-Level Privacy Risk Evaluation through Artifact-Based Methods"

Abstract

Setup

References

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages