Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Installing TE is so hard. #1284

Open
ZihaoZheng98 opened this issue Oct 23, 2024 · 5 comments
Open

Installing TE is so hard. #1284

ZihaoZheng98 opened this issue Oct 23, 2024 · 5 comments
Labels
bug Something isn't working build Build system

Comments

@ZihaoZheng98
Copy link

Meeting problems such as

File "/TransformerEngine/build_tools/build_ext.py", line 91, in _build_cmake raise RuntimeError(f"Error when running CMake: {e}") RuntimeError: Error when running CMake: Command '['/usr/local/lib/python3.10/dist-packages/cmake/data/bin/cmake', '-S', '/TransformerEngine/transformer_engine/common', '-B', '/TransformerEngine/build/cmake', '-DPython_EXECUTABLE=/usr/bin/python3.10', '-DPython_INCLUDE_DIR=/usr/include/python3.10', '-DCMAKE_BUILD_TYPE=Release', '-DCMAKE_INSTALL_PREFIX=/TransformerEngine/build/lib.linux-x86_64-3.10', '-DCMAKE_CUDA_ARCHITECTURES=70;80;89;90', '-Dpybind11_DIR=/usr/local/lib/python3.10/dist-packages/pybind11/share/cmake/pybind11', '-GNinja']' returned non-zero exit status 1.

and ninja -v

@timmoon10
Copy link
Collaborator

#355 (comment)

@timmoon10 timmoon10 added bug Something isn't working build Build system labels Oct 23, 2024
@wplf
Copy link
Contributor

wplf commented Oct 24, 2024

The real problem shows above.

@ZihaoZheng98
Copy link
Author

Thanks, I will have a try. @wplf @timmoon10

@klhhhhh
Copy link

klhhhhh commented Nov 4, 2024

I agreed!!! 😮‍💨

@saurabh-kataria
Copy link

saurabh-kataria commented Nov 5, 2024

Following works for me after trying several combinations:

export TMPDIR=/tmp && export CMAKE_TEMP_DIR=/tmp && export BUILD_DIR=/tmp/build
mkdir -p $TMPDIR && mkdir -p $CMAKE_TEMP_DIR && mkdir -p $BUILD_DIR
TMP_DIR=$TMPDIR MAX_JOBS=1 CUDA_HOME=$CUDA_HOME CUDNN_PATH=$CUDNN_PATH CC=$CC CXX=$CXX pip -v install --cache-dir /tmp/pip-cache --no-deps git+https://github.com/NVIDIA/TransformerEngine.git@stable --no-deps

where your env vars are something like below

export CC=$GCCPATH/bin/gcc
export CXX=$GCCPATH/bin/g++
export CUDA_HOME=/home/$USER/cuda-12.3
export CUDNN_PATH=/home/$USER/cudnn-9.2.0/usr

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working build Build system
Projects
None yet
Development

No branches or pull requests

5 participants