Property Inference Attacks

In this repository, we propose a modular framework to run Property Inference Attacks on Machine Learning models.

Installation

You can get this package directly from pip:

python -m pip install propinfer

Please note that PyTorch is required to run this framework. Please find installation instructions corresponding to you here.

Usage

This framework is made modular for any of your experiments: you simply should define subclasses of Generator and Model to represent your data source and your evaluated model respectively.

From these, you can create a specific experiment configuration file. We suggest using hydra for your configurations, but parameters can also be passed in a standard dict.

Alternatively, you can extend the Experiment class.

Threat models and attacks

White-Box

In this threat model, we have access to the model's parameters directly. In this case, [1] defines three different attacks:

Simple meta-classifier attack
Simple meta-classifier attack, with layer weights' sorting
DeepSets attack

They are respectively designated by the keywords Naive, Sortand DeepSets.

Grey- and Black-Box

In this threat model, we have only query access to the model (we do not know its parameters). In the scope of the Grey-Box threat model, we know the model's architecture and hyperparameters - in the scope of Black-Box we do not.

For the Grey-Box case, [2] describes two simple attacks:

The Loss Test (represented by the LossTest keyword)
The Threshold Test (represented by the ThresholdTest keyword)

[3] also proposes a meta-classifier-based attack, that we use for both the Grey-Box and Black-Box cases: these are respectively represented by the GreyBox and BlackBox keywords. For the latter case, we simply default on a pre-defined model architecture.

Unit tests

The framework is provided with a few, simple unit tests. Run them with:

python -m unittest discover

to check the correctness of your installation.

Running an experiment

To run a simple experiment, please simply use the provided run.py. You can change any experiment parameter with the help of the yaml config files, inside the config folder.

To run an experiment using a specific my_experiments.yaml config file, you should place its yaml config file in /config/experiments, and then run:

python run.py experiments=my_experiments

Alternatively, you can instanciate an Experiment object using a specific Generator and Model, and then run both targets and shadows before performing an attack.

It is possible to provide a list as a model hyperparameter: in that case, the framework will automatically optimise between the given options.

Citation

If you use this library for your work, please cite our paper as follows:

V. Hartmann, L. Meynent, M. Peyrard, D. Dimitriadis, S. Tople and R. West, "Distribution Inference Risks: Identifying and Mitigating Sources of Leakage," 2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), Raleigh, NC, USA, 2023, pp. 136-149, doi: 10.1109/SaTML54575.2023.00018.

@INPROCEEDINGS{10136150,
  author={Hartmann, Valentin and Meynent, Léo and Peyrard, Maxime and Dimitriadis, Dimitrios and Tople, Shruti and West, Robert},
  booktitle={2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML)}, 
  title={Distribution Inference Risks: Identifying and Mitigating Sources of Leakage}, 
  year={2023},
  volume={},
  number={},
  pages={136-149},
  doi={10.1109/SaTML54575.2023.00018}
}

References

[1] Karan Ganju, Qi Wang, Wei Yang, Carl A. Gunter, and Nikita Borisov. 2018. Property Inference Attacks on Fully Connected Neural Networks using Permutation Invariant Representations. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security (CCS '18). Association for Computing Machinery, New York, NY, USA, 619–633. DOI:https://doi.org/10.1145/3243734.3243834

[2] Anshuman Suri, David Evans. 2021. Formalizing Distribution Inference Risks. 2021 Workshop on Theory and Practice of Differential Privacy, ICML. https://arxiv.org/abs/2106.03699

[3] Wanrong Zhang, Shruti Tople, Olga Ohrimenko. 2021. Leakage of Dataset Properties in Multi-Party Machine Learning. https://arxiv.org/abs/2006.07267

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
config		config
docs		docs
propinfer		propinfer
tests		tests
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
clean.sh		clean.sh
env.yml		env.yml
logging.ini		logging.ini
pyproject.toml		pyproject.toml
run.py		run.py
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Property Inference Attacks

Installation

Usage

Threat models and attacks

White-Box

Grey- and Black-Box

Unit tests

Running an experiment

Citation

References

About

Releases

Packages

Contributors 2

Languages

License

epfl-dlab/property-inference-attacks

Folders and files

Latest commit

History

Repository files navigation

Property Inference Attacks

Installation

Usage

Threat models and attacks

White-Box

Grey- and Black-Box

Unit tests

Running an experiment

Citation

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages