Pipeline for "Benchmarking Graph Pretraining Methods" (BGPM)

Instructions

Why we build this pipeline?

There is a lot of parameters to set. It's not practical to set all of them manually in your main function.
There is a lot of results to record. It's necessary to record the results with corresponding settings.

How can this pipeline help us?

The ConfigParser will read the config file of your settings(e.g., learning rate,hidden_size) or default settings and set them for datasets or the corresponding executor of the model chosed by you.
The ./libgptb/log folder will restore the config settings and the ./libgptb/cache folder will restore the evaluate results with the exam_id. Then you can easily collect them.
The execution is something like the screeshots.

How is this pipeline organised?

You can find the whole process in libgptb/pipeline/pipeline.py

In brief the whole process will be like

ConfigParser will load the config file and default config file for the Dataset and Executor
Dataset will load the data and get some features(e.g., input feature dims)
Executor will load the chosen model and the model will be trained and evaluated with stored evaluation results.

How to use this pipeline to execute my model?

You will need to create you own Dataset and Executor.You can refer to my reference for DGI. We will talk more about this in the following sections.

Anything else I need to know about this pipeline?

The execution environment

I am inspired with the PyGCL repo and my CIKM23-HIEST repo.

[PyGCL Examples](PyGCL/examples at main · PyGCL/PyGCL (github.com)) may not contains all our target models, so the singularity image I prepared for it may not suitable for those excluded models.

In this case yo may need to build a new one.If you are using singularity for HPC then you can find a [guide](VAN-QIAN/CIKM23-HIEST: CIKM23-HIEST (github.com)) in my repo or ask for help.

Quick Start to execute your model.

Taking my implementation for DGI as an example. Let's see what you need to implement to execute your model.

1. Add default config

You may need to add default configs for your Executor and Model. Check libgptb/config/executors/DGIExecutor.json and libgptb/config/model/GCL/DGI.json (Here GCL is the task name) and create yours.

So far for the implementation for dataset we use the pytorch_geometric.datasets. If there is a need for default config maybe we will need to use our implemtation in the futhure.

The config parameters specified by the user have the highest priority. In other words the default config will be overwrittern.

2. Add your Dataset implementation

You may need to implemetn two methods get_data and get_data_feature . Then put it under the folder libgptb/data/dataset/

Then you will need to complete the import in libgptb/data/dataset/__init__.py

You can refer to my implementation libgptb/data/dataset/pyg_dataset.py

3. Add your Model implementation

You may need to define your model structure and put it under the folder libgptb/model.

Then you will need to complete the import in libgptb/model/__init__.py

You can refer to my implementation in libgptb/model/DGI.py

4. Add you executor implementation

You may need to define how to train and evaluate your model and put it under the folder libgptb/executors/.

Then you will need to complete the import in libgptb/executors/__init__.py

You can refer to my implementation in libgptb/executors/DGI_executor.py

5. Add your task configuration

You may beed to add your task configuration in libgptb/config/task_config.json

You can refer to my implementation in libgptb/config/task_config.json for DGI

"$YourModelName": {
            "dataset_class": "$YourDatasetName",
            "executor": "$YourExecutorName",
            "evaluator": "$YourEvaluatorName"
        }

These variables are what you imported in the above steps. Now everything should be ready for execution.

The execution environment

Docerfile_x86

FROM pytorch/pytorch:1.13.1-cuda11.6-cudnn8-devel
COPY requirements.txt /opt
RUN pip install -r /opt/requirements.txt \
    && pip install tensorboard \
    && pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-1.13.0+cu116.html \
    && pip install torch_geometric\
    && pip install  dgl -f https://data.dgl.ai/wheels/cu116/repo.html \
    && pip install  dglgo -f https://data.dgl.ai/wheels-test/repo.html
ENV PATH=$PATH:/usr/local/cuda-11.6/bin LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda-11.6/lib64:/usr/lib/x86_64-linux-gnu

Dockerfile_ppc64le(For RPI DCS cluster)

FROM nvidia/cuda:11.2.2-cudnn8-devel-centos8
COPY requirements.txt /opt
RUN sed -i 's/mirrorlist/#mirrorlist/g' /etc/yum.repos.d/CentOS-* && sed -i 's|#baseurl=http://mirror.centos.org|baseurl=http://vault.centos.org|g' /etc/yum.repos.d/CentOS-*

RUN mkdir -p ~/miniconda3\
    #&& yum update && 
    && yum -y install wget gcc-gfortran cmake blas lapack git
RUN wget https://repo.anaconda.com/miniconda/Miniconda3-py39_23.5.2-0-Linux-ppc64le.sh -O ~/miniconda3/miniconda.sh\
    && bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3\
    &&rm -rf ~/miniconda3/miniconda.sh
RUN ~/miniconda3/bin/conda init bash 
ENV PATH=/root/miniconda3/bin:$PATH
RUN git clone https://github.com/pytorch/pytorch.git
RUN conda update -n base conda && conda install pandas astunparse numpy ninja pyyaml setuptools cmake cffi typing_extensions future six requests dataclasses\
    && conda install -c conda-forge liblapack && conda install -c anaconda nomkl
RUN cd pytorch && git checkout tags/v1.12.1 && git submodule sync &&\
    git submodule update --init --recursive --jobs 0 
    
RUN yum -y install which && export CMAKE_PREFIX_PATH=${CONDA_PREFIX:-"$(dirname $(which conda))/../"} && cd pytorch && TORCH_CUDA_ARCH_LIST=7.0 USE_CUDA=1 MAX_JOBS=8 python setup.py develop
RUN conda install -c https://ftp.osuosl.org/pub/open-ce/1.7.2-p10/1.7.2-p10/  openblas-devel 
## pytorch_scatter==2.0.8 pytorch_sparse==0.6.10 pytorch_geometric==2.1.0 tensorboard=2.9.1 pandas
RUN pip install --no-cache-dir torch-scatter==2.0.9 torch-sparse==0.6.14  -f https://data.pyg.org/whl/torch-1.12.1+cu102.html && pip install --no-cache-dir torch-geometric
RUN pip install --no-cache-dir pandas && conda install -c https://ftp.osuosl.org/pub/open-ce/1.7.2-p10/1.7.2-p10/ tensorboard=2.9.1

Usage

You can build from source with command docker build -f $Dockerfile path -t $YourImageName .All images can be found in the Onedrive
1. X86 X86
2. PPC64 IBM POWER
Unzip the $IMAGE.tar.gz and get the $IMAGE.tar file

Execute the following command

docker import - $YourImageName < $IMAGE.tar
docker run -v ~/BGPM/:/BGPM -w /BGPM --gpus all $YourImageName /bin/bash /BGPM/command.sh
# -v $HOST_PATH:$Container_Path means mounting the localfile to the container. 
# -w means setting the working dir
# --gpus all means using all GPU
# /bin/bash /BGPM/command.sh means using the /bin/bash of Container to execute the command.sh

You can have your own command.sh

python3 ./run_model.py --task GCL --model DGI --dataset Cora --config_file random_config/config_1

Requirements.txt

torch_geometric 
tqdm
numpy
scikit-learn
networkx
PyGCL

Singularity.def file

#Bootstrap is used to specify the agent,where the base image from,here localimage means to build from a local image
Bootstrap: localimage
## This is something like 'From' in DOCKERFILE to indicate the base image
From: ./pytorch_1.13.1-cuda11.6-cudnn8-devel.sif

# %files can be used to copy files from host into the image
# like 'COPY' in DOCKERFILE
# Here we copy the requirements.txt into the image, then we can use it to install the required dependencies.
%files
    ../PyGCL/requirements.txt /opt

# %post is used to build the new image
# Usage is same to shell.Here we used pip to install dependencies.
%post
    pip install -r /opt/requirements.txt
    pip install tensorboard
    pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-1.13.0+cu116.html
    pip install  dgl -f https://data.dgl.ai/wheels/cu116/repo.html 
    pip install  dglgo -f https://data.dgl.ai/wheels-test/repo.html 

 
#% environment is used to set env_variables once the image starts
# These lines are necessary to load cuda
%environment
    export PATH=$PATH:/usr/local/cuda-11.6/bin
    export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda-11.6/lib64:/usr/lib/x86_64-linux-gnu

# singularity exec --writable-tmpfs --nv --nvccli ../../SIF/HIEST.sif python3 ./DGI_inductive.py

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
README.assets		README.assets
libgptb		libgptb
random_config		random_config
split		split
.gitignore		.gitignore
README.md		README.md
command.sh		command.sh
config1.json		config1.json
config_generation.py		config_generation.py
hyper_example.json		hyper_example.json
multi-model.sh		multi-model.sh
run.sh		run.sh
run_model.py		run_model.py
temp.sh		temp.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pipeline for "Benchmarking Graph Pretraining Methods" (BGPM)

Instructions

Why we build this pipeline?

How can this pipeline help us?

How is this pipeline organised?

How to use this pipeline to execute my model?

Anything else I need to know about this pipeline?

Quick Start to execute your model.

1. Add default config

2. Add your Dataset implementation

3. Add your Model implementation

4. Add you executor implementation

5. Add your task configuration

The execution environment

Docerfile_x86

Dockerfile_ppc64le(For RPI DCS cluster)

Usage

Requirements.txt

Singularity.def file

About

Releases

Packages

Contributors 4

Languages

GraphTL-Bench/BGPM

Folders and files

Latest commit

History

Repository files navigation

Pipeline for "Benchmarking Graph Pretraining Methods" (BGPM)

Instructions

Why we build this pipeline?

How can this pipeline help us?

How is this pipeline organised?

How to use this pipeline to execute my model?

Anything else I need to know about this pipeline?

Quick Start to execute your model.

1. Add default config

2. Add your Dataset implementation

3. Add your Model implementation

4. Add you executor implementation

5. Add your task configuration

The execution environment

Docerfile_x86

Dockerfile_ppc64le(For RPI DCS cluster)

Usage

Requirements.txt

Singularity.def file

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages