Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature request] Can we add H200 in infer_cluster_key() method? #2552

Open
dongluw opened this issue Dec 9, 2024 · 2 comments
Open

[feature request] Can we add H200 in infer_cluster_key() method? #2552

dongluw opened this issue Dec 9, 2024 · 2 comments
Assignees
Labels
triaged Issue has been triaged by maintainers

Comments

@dongluw
Copy link

dongluw commented Dec 9, 2024

https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/auto_parallel/cluster_info.py

here it handles a list of devices which doesn't include H200, can we add H200 (and potentially GB200) to that list as well?

@renjie0
Copy link

renjie0 commented Dec 11, 2024

What is infer_cluster_key used for

@nv-guomingz nv-guomingz self-assigned this Dec 18, 2024
@nv-guomingz nv-guomingz added the triaged Issue has been triaged by maintainers label Dec 18, 2024
@nv-guomingz
Copy link
Collaborator

Hi @dongluw thanks for suggestion. We'll add H200 into infer_cluster_key() menthod and this update will be avaiable in next release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

3 participants