Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pynvml version issue #2524

Open
4 tasks
apbose opened this issue Dec 2, 2024 · 1 comment
Open
4 tasks

pynvml version issue #2524

apbose opened this issue Dec 2, 2024 · 1 comment
Labels
bug Something isn't working triaged Issue has been triaged by maintainers

Comments

@apbose
Copy link

apbose commented Dec 2, 2024

System Info

Tensorrt-llm v0.14.0

Who can help?

No response

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

Looks like the pynvml version needs to be fixed in the latest release.
pip install tensorrt-llm leads to pynvml 12.0.0 but import tensorrt_llm leads to

  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/__init__.py", line 35, in <module>
    import tensorrt_llm
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/__init__.py", line 35, in <module>
    import tensorrt_llm.runtime as runtime
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/runtime/__init__.py", line 22, in <module>
    import tensorrt_llm.runtime as runtime
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/runtime/__init__.py", line 22, in <module>
    from .model_runner import ModelRunner
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/runtime/model_runner.py", line 26, in <module>
    from .model_runner import ModelRunner
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/runtime/model_runner.py", line 26, in <module>
    from .. import profiler
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/profiler.py", line 121, in <module>
    from .. import profiler
  File "/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/tensorrt_llm/profiler.py", line 121, in <module>
    if pynvml.__version__ < '11.5.0' or driver_version < '526':
    if pynvml.__version__ < '11.5.0' or driver_version < '526':
AttributeError: module 'pynvml' has no attribute '__version__'
AttributeError: module 'pynvml' has no attribute '__version__'

Downgrading pynvml to 11.5.0 leads the error to go away.

Expected behavior

import tensorrt-llm should work without error

actual behavior

It is leading to the error above.

additional notes

Maybe there should be additional handling of the pynvml versions in the code

@apbose apbose added the bug Something isn't working label Dec 2, 2024
@nv-guomingz
Copy link
Collaborator

Hi @apbose we've rootcaused this issue which raised by pynvml updating on Dec.2.
The coming 0.15 release will fix it.
If you wanna to fix it by yourself, just modify this line https://github.com/NVIDIA/TensorRT-LLM/blob/main/requirements.txt#L14 to pynvml~=11.5.0

@nv-guomingz nv-guomingz added the triaged Issue has been triaged by maintainers label Dec 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

2 participants