Introduce hermetic CUDA in Google ML projects. #10673

copybara-service · 2024-03-18T20:41:09Z

Introduce hermetic CUDA in Google ML projects.

Hermetic CUDA rules allow building wheels with GPU support on a machine without GPUs, as well as running Bazel GPU tests on a machine with only GPUs and NVIDIA driver installed. When --config=cuda is provided in Bazel options, Bazel will download CUDA, CUDNN and NCCL redistributions in the cache, and use them during build and test phases.

Default location of CUNN redistributions

Default location of CUDA redistributions

Default location of NCCL redistributions

To include hermetic CUDA rules in your project, add the following in the WORKSPACE of the downstream project dependent on XLA.

Note: use @local_tsl instead of @tsl in Tensorflow project.

load(
   "@tsl//third_party/gpus/cuda/hermetic:cuda_json_init_repository.bzl",
   "cuda_json_init_repository",
)

cuda_json_init_repository()

load(
   "@cuda_redist_json//:distributions.bzl",
   "CUDA_REDISTRIBUTIONS",
   "CUDNN_REDISTRIBUTIONS",
)
load(
   "@tsl//third_party/gpus/cuda/hermetic:cuda_redist_init_repositories.bzl",
   "cuda_redist_init_repositories",
   "cudnn_redist_init_repository",
)

cuda_redist_init_repositories(
   cuda_redistributions = CUDA_REDISTRIBUTIONS,
)

cudnn_redist_init_repository(
   cudnn_redistributions = CUDNN_REDISTRIBUTIONS,
)

load(
   "@tsl//third_party/gpus/cuda/hermetic:cuda_configure.bzl",
   "cuda_configure",
)

cuda_configure(name = "local_config_cuda")

load(
   "@tsl//third_party/nccl/hermetic:nccl_redist_init_repository.bzl",
   "nccl_redist_init_repository",
)

nccl_redist_init_repository()

load(
   "@tsl//third_party/nccl/hermetic:nccl_configure.bzl",
   "nccl_configure",
)

nccl_configure(name = "local_config_nccl")

cliffwoolley

As far as I can tell, all my feedback has been addressed now, thanks.
LGTM

See instructions [here](https://github.com/openxla/xla/blob/main/docs/hermetic_cuda.md). [XLA PR](openxla/xla#10673) introduced hermetic CUDA rules in ML OSS projects. Now all XLA downstream projects should call those rules in their workspaces. Also `.bazelrc` should use new environment variables for CUDA builds. PiperOrigin-RevId: 663396732

copybara-service bot force-pushed the test_616865795 branch 21 times, most recently from a70493b to b33ee0c Compare March 25, 2024 18:39

copybara-service bot force-pushed the test_616865795 branch 9 times, most recently from 3359d97 to d5243b0 Compare March 28, 2024 18:06

copybara-service bot force-pushed the test_616865795 branch 4 times, most recently from 7540bf5 to a64b2a7 Compare August 6, 2024 00:41

cliffwoolley approved these changes Aug 6, 2024

View reviewed changes

copybara-service bot force-pushed the test_616865795 branch 4 times, most recently from cec9075 to 8bed0aa Compare August 7, 2024 01:26

beckerhe mentioned this pull request Aug 7, 2024

Added support for compiling the CUDA stubs on Windows. #15518

Closed

copybara-service bot force-pushed the test_616865795 branch 15 times, most recently from 7436960 to 052ebcd Compare August 13, 2024 22:03

copybara-service bot force-pushed the test_616865795 branch from 052ebcd to f4d745a Compare August 14, 2024 15:10

copybara-service bot closed this Aug 14, 2024

copybara-service bot force-pushed the test_616865795 branch from f4d745a to cb1541c Compare August 14, 2024 18:11

copybara-service bot deleted the test_616865795 branch August 14, 2024 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce hermetic CUDA in Google ML projects. #10673

Introduce hermetic CUDA in Google ML projects. #10673

copybara-service bot commented Mar 18, 2024 •

edited

Loading

cliffwoolley left a comment

Introduce hermetic CUDA in Google ML projects. #10673

Introduce hermetic CUDA in Google ML projects. #10673

Conversation

copybara-service bot commented Mar 18, 2024 • edited Loading

cliffwoolley left a comment

Choose a reason for hiding this comment

copybara-service bot commented Mar 18, 2024 •

edited

Loading