GitHub - intel/intel-optimization-for-horovod

Intel® Optimization for Horovod* is the distributed training framework for TensorFlow*. The goal is to make distributed Deep Learning workload run faster and easier to use on Intel GPU devices. It's developed based on latest release version v0.28.1 of public Horovod.

Install

Hardware Requirements

Intel® Data Center GPU Max Series, Driver Version: 803

Software	Installation requirement
Intel® oneAPI Base Toolkit	Install Intel® oneAPI Base Toolkit
TensorFlow	Install tensorflow 2.15.1
Intel® Extension for TensorFlow*	Install Intel® Extension for TensorFlow*
System	Ubuntu 22.04, SUSE Linux Enterprise Server(SLES) 15 SP3/SP4
Python	3.9-3.11
Pip	19.0 or later (requires manylinux2014 support)

Install GPU Drivers

OS	Intel GPU	Install Intel GPU Driver
Ubuntu 22.04, RedHat 8.6, SLES 15 SP3/SP4	Intel® Data Center GPU Max Series	Refer to the Installation Guides for latest driver installation. If install the verified Intel® Data Center GPU Max Series/Intel® Data Center GPU Flex Series 803, please append the specific version after components.

Installation Channel:

Intel® Optimization for Horovod* can be installed through the following channels:

PyPI	Source
Install from pip	Build from source

Install for GPU

Installing Intel® Optimization for Horovod* with different frameworks is feasible. You could choose Intel® Extension for TensorFlow* as dependency.

Installing Intel® Extension for TensorFlow* and Intel® Optimization for Horovod* with command:

pip install tensorflow==2.15.1
pip install --upgrade intel-extension-for-tensorflow[xpu]
pip install intel-optimization-for-horovod

Running Intel® Optimization for Horovod*

The example commands below show how to run distributed training.

To run on a machine with 2 Intel GPUs, which have 4 titles totally.
```
horovodrun -np 4 python train.py
```

To run on 4 machines with 2 GPUs(4 tiles) each:

horovodrun -np 16 -H server1:4,server2:4,server3:4,server4:4 python train.py

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

It is easy to train models with Intel® Extension for TensorFlow. You can refer to tensorflow examples for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 1,389 Commits
.github		.github
cmake		cmake
docker		docker
docs		docs
examples		examples
horovod		horovod
test		test
third-party-programs		third-party-programs
third_party		third_party
xpu_docs		xpu_docs
xpu_examples		xpu_examples
xpu_test		xpu_test
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.test.cpu		Dockerfile.test.cpu
Dockerfile.test.gpu		Dockerfile.test.gpu
GOVERNANCE.md		GOVERNANCE.md
Jenkinsfile.ppc64le		Jenkinsfile.ppc64le
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
NOTICE		NOTICE
README.md		README.md
README.rst		README.rst
SECURITY.md		SECURITY.md
assert-package-versions.sh		assert-package-versions.sh
docker-compose.test.yml		docker-compose.test.yml
horovod.exp		horovod.exp
horovod.lds		horovod.lds
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install

Hardware Requirements

Install GPU Drivers

Installation Channel:

Install for GPU

Running Intel® Optimization for Horovod*

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

About

Releases 6

Packages

Contributors 147

Languages

License

intel/intel-optimization-for-horovod

Folders and files

Latest commit

History

Repository files navigation

Install

Hardware Requirements

Install GPU Drivers

Installation Channel:

Install for GPU

Running Intel® Optimization for Horovod*

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 147

Languages

Packages