Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
config.yaml		config.yaml
dataset.py		dataset.py
model.py		model.py
multi_class.py		multi_class.py
train.py		train.py
train_distributed_cpu.py		train_distributed_cpu.py
train_distributed_gpu.py		train_distributed_gpu.py

README.md

Distributed Deepwalk in PGL

Deepwalk is an algorithmic framework for representational learning on graphs. Given any graph, it can learn continuous feature representations for the nodes, which can then be used for various downstream machine learning tasks. Based on PGL, we reproduce distributed deepwalk algorithms and reach the same level of indicators as the paper.

Datasets

The datasets contain two networks: BlogCatalog.

Dependencies

paddlepaddle>=2.0rc
pgl>=2.0

How to run

We adopt PaddlePaddle Fleet as our distributed training frameworks config.yaml is config file for deepwalk hyperparameter. In distributed CPU mode, we have 2 pservers and 2 trainers. We can use fleetrun to help you startup the parameter servers and model trainers.

For examples, train deepwalk mode on BlogCataLog dataset.

# train deepwalk in CPU mode.
python train.py
# train deepwalk in single GPU mode.
CUDA_VISIBLE_DEVICES=0 python train.py --use_cuda
# train deepwalk in multiple GPU mode.
CUDA_VISIBLE_DEVICES=0,1 fleetrun train_distributed_gpu.py
# train deepwalk in distributed CPU mode, log can be found in ./log.
python -m paddle.distributed.launch --server_num=2 --trainer_num=2 train_distributed_cpu.py

# multiclass task example
python multi_class.py

Hyperparameters

dataset: The citation dataset "BlogCatalog".
conf: The model config file, default is ./config.yaml .
epoch: Number of training epoch.

Experiment results

Dataset	model	Task	Metric	PGL Result	Reported Result
BlogCatalog	distributed deepwalk	multi-label classification	MacroF1	0.233	0.211

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepwalk

deepwalk

README.md

Distributed Deepwalk in PGL

Datasets

Dependencies

How to run

Hyperparameters

Experiment results

Files

deepwalk

Directory actions

More options

Directory actions

More options

Latest commit

History

deepwalk

Folders and files

parent directory

README.md

Distributed Deepwalk in PGL

Datasets

Dependencies

How to run

Hyperparameters

Experiment results