From 5d7cb930e97c8ecfdd20d637d91a725935dc706d Mon Sep 17 00:00:00 2001 From: zhouyu5 Date: Thu, 13 Jun 2024 06:46:58 +0000 Subject: [PATCH] update readme --- benchmark/multi_gpu/training/README.md | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/benchmark/multi_gpu/training/README.md b/benchmark/multi_gpu/training/README.md index ad706ac93ec4..f9e491ccda8c 100644 --- a/benchmark/multi_gpu/training/README.md +++ b/benchmark/multi_gpu/training/README.md @@ -23,13 +23,13 @@ If you want to run your scripts inside a docker image, you could refer to the [d ### bare-metal setup -If you prefer to run your scripts directly on the bare-metal server. We recommend this [guide](https://intel.github.io/intel-extension-for-pytorch/xpu/latest/tutorials/features/DDP.html) and summarize it as follows: +If you prefer to run your scripts directly on the bare-metal server. We recommend the installation guidance provided by [Intel® Extension for PyTorch](https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=gpu&version=v2.1.30%2bxpu&os=linux%2fwsl2&package=pip). The following are some key steps: -- Install [Intel® oneCCL Bindings for PyTorch](https://github.com/intel/torch-ccl) and [Intel® oneAPI Collective Communications Library (oneCCL)](https://www.intel.com/content/www/us/en/developer/tools/oneapi/oneccl.html) +- Install [Intel® oneAPI Base Toolkit](https://www.intel.com/content/www/us/en/developer/tools/oneapi/base-toolkit.html), indluding [Intel® oneAPI DPC++ Compiler](https://www.intel.com/content/www/us/en/developer/tools/oneapi/dpc-compiler.html), [Intel® oneAPI Math Kernel Library (oneMKL)](https://www.intel.com/content/www/us/en/docs/oneapi/programming-guide/2024-1/intel-oneapi-math-kernel-library-onemkl.html), [Intel® oneAPI Collective Communications Library (oneCCL)](https://www.intel.com/content/www/us/en/developer/tools/oneapi/oneccl.html), and [Intel® oneCCL Bindings for PyTorch](https://github.com/intel/torch-ccl). ```bash -# Install oneCCL package -sudo apt install intel-oneapi-ccl-devel=2021.12.0-309 +# Install oneCCL package on Ubuntu +sudo apt install -y intel-oneapi-dpcpp-cpp-2024.1=2024.1.0-963 intel-oneapi-mkl-devel=2024.1.0-691 intel-oneapi-ccl-devel=2021.12.0-309 # Install oneccl_bindings_for_pytorch pip install oneccl_bind_pt==2.1.300+xpu --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/ # Runtime Dynamic Linking @@ -49,3 +49,5 @@ Run benchmark, e.g. assuming you have `n` XPUs: ``` mpirun -n python training_benchmark_xpu.py --dataset ogbn-products --model edge_cnn --num-epochs 3 ``` + +This [guide](https://intel.github.io/intel-extension-for-pytorch/xpu/latest/tutorials/features/DDP.html) is also helpful for you to lauch DDP training on intel GPU. \ No newline at end of file