Model Stealing Attacks Against Inductive Graph Neural Networks

This is a modified PyTorch implementation of Model Stealing Attacks Against Inductive Graph Neural Networks, as described in our paper:

Yun Shen, Xinlei He, Yufei Han, Yang Zhang, Model Stealing Attacks Against Inductive Graph Neural Networks (IEEE S&P 2022)

The project provided by the author of the paper encounters some errors when configuring it, and I modified parts of it to run smoothly on my computer. I also provide a configured docker image if you need:

链接: https://pan.baidu.com/s/1fSIKJMCSoNIeY4aFCJWkcw?pwd=usmu 提取码: usmu

To import this image:

docker import - new_yixin_project < yixin_project.tar

Step 0: Setup the environments

conda env create --file environment.yaml
conda activate gnn_model_stealing_backend
# Install GraphGallery
# wget https://github.com/EdisonLeeeee/GraphGallery/archive/refs/tags/1.0.0.tar.gz
tar -zxvf 1.0.0.tar.gz
cd GraphGallery-1.0.0/
pip install -e . --verbose
cd ..

Step 1: Train the target models:

cd code;
python train_target_model.py --dataset citeseer_full --target-model gat  --num-hidden 256

# You can also run it with a specified gpu (e.g., gpu02):
python train_target_model.py --dataset citeseer_full --target-model gat  --num-hidden 256 --gpu 2

Error 1: TypeError: object of type 'DGLHeteroGraph' has no len()

When you start training, you may encounter the error:

TypeError: object of type 'DGLHeteroGraph' has no len()

This might be a hidden tiny bug of DGL=0.7. To solve this problem, you may consider degrading DGL to a lower version like 0.6x or consider the tiny change in file 'dgl/data/utils.py':(note that the path of file 'dgl/data/utils.py' is shown in your error message)

# See also https://github.com/dmlc/dgl/blob/master/python/dgl/data/utils.py#L78
num_data = len(dataset) => num_data = len(dataset.nodes())

Error 2: RuntimeError: cuda runtime error (10) : invalid device ordinal

If you are using the project provided by the author of the paper, you may encounter this error:

RuntimeError: cuda runtime error (10) : invalid device ordinal

This is because there is only one GPU in your computer, consider the tiny change in line 12 of train_target_model.py:

default=1 => default=0

Do the same thing in attack.py.

Note that we use the following datasets, target model architectures, and numbers of hidden neurons in our paper:

--dataset:      ['dblp', 'pubmed', 'citeseer_full', 'coauthor_phy', 'acm', 'amazon_photo']
--target-model: ['gat', 'gin', 'sage']
--num-hidden:   [64, 128, 256]

Step 2: Conduct the model stealing attacks

# Type I attack:
python3 attack.py --dataset citeseer_full --target-model-dim 256 --num-hidden 256 --target-model gat --surrogate-model gin --recovery-from prediction --query_ratio 1.0 --structure original

# Type II attack:
python3 attack.py --dataset citeseer_full --target-model-dim 256 --num-hidden 256 --target-model gat --surrogate-model gin --recovery-from prediction --query_ratio 1.0 --structure idgl

Explainations:

--dataset:          ['dblp', 'pubmed', 'citeseer_full', 'coauthor_phy', 'acm', 'amazon_photo']  # Datasets used to train the surrogate model
--target-model-dim: [64, 128, 256]                                                              # Numbers of hidden neurons for the target model
--num-hidden:       [64, 128, 256]                                                              # Numbers of hidden neurons for the surrogate model
--target-model:     ['gat', 'gin', 'sage']                                                      # Target model's architecuture
--surrogate-model:  ['gat', 'gin', 'sage']                                                      # Surrogate model's architecuture
--recovery-from:    ['prediction', 'embedding', 'projection']                                   # Target model's response
--query_ratio:      [0.1, 0.2, ..., 1.0]                                                        # Ratio of query graph used to train the surrogate model, e.g., 1.0 means we use the whole query graph (30% of the whole dataset); 0.5 means we use half of the query graph (15% of the whole dataset);
--structure:        ['original', 'idgl']                                                        # Type I/II attacks, 'original' means we use the original graph structure and 'idgl' means we use idgl to reconstruct the graph structure.

Notes

To train the target model, we randomly sample 60% of the nodes to construct the training graph;
To train the surrogate model, for each dataset, we split them into three parts.
- The first part consists of 20% randomly sampled nodes that are left;
- The second part consists of 30% randomly sampled nodes, forming our query graph $\mathbf{G}_Q$.
- The third part consists of the rest 50% of the nodes, functioning as the testing data for both $\mathcal{M}_T$ and $\mathcal{M}_S$.
We follow the official IDGL implementation from IDGL.

Cite

If you use this code, please consider citing the following papers:

@inproceedings{SHHZ22,
author = {Yun Shen and Xinlei He and Yufei Han and Yang Zhang},
title = {{Model Stealing Attacks Against Inductive Graph Neural Networks}},
booktitle = {{IEEE Symposium on Security and Privacy (S\&P)}},
publisher = {IEEE},
year = {2022}
}

@inproceedings{CWZ20,
author = {Yu Chen and Lingfei Wu and Mohammed J. Zaki},
title = {{Iterative Deep Graph Learning for Graph Neural Networks: Better and Robust Node Embeddings}},
booktitle = {{Annual Conference on Neural Information Processing Systems (NeurIPS)}},
publisher = {NeurIPS},
year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
code		code
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model Stealing Attacks Against Inductive Graph Neural Networks

Step 0: Setup the environments

Step 1: Train the target models:

Error 1: TypeError: object of type 'DGLHeteroGraph' has no len()

Error 2: RuntimeError: cuda runtime error (10) : invalid device ordinal

Step 2: Conduct the model stealing attacks

Notes

Cite

About

Releases

Packages

Languages

01WarpDrive/GNNStealing

Folders and files

Latest commit

History

Repository files navigation

Model Stealing Attacks Against Inductive Graph Neural Networks

Step 0: Setup the environments

Step 1: Train the target models:

Error 1: TypeError: object of type 'DGLHeteroGraph' has no len()

Error 2: RuntimeError: cuda runtime error (10) : invalid device ordinal

Step 2: Conduct the model stealing attacks

Notes

Cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages