Prune torch cuda arch list to match upstream #306

isuruf · 2024-12-26T03:48:33Z

Checklist

Used a personal fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
Reset the build number to 0 (if the version changed)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

conda-forge-admin · 2024-12-26T03:50:09Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

I do have some suggestions for making it better though...

For recipe/meta.yaml:

ℹ️ The recipe is not parsable by parser conda-souschef (grayskull). This parser is not currently used by conda-forge, but may be in the future. We are collecting information to see which recipes are compatible with grayskull.
ℹ️ The recipe is not parsable by parser conda-recipe-manager. The recipe can only be automatically migrated to the new v1 format if it is parseable by conda-recipe-manager.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12499338117. Examine the logs at this URL for more detail.}

hmaarrfk · 2024-12-26T04:21:38Z

It seems like a mistake to remove 8.9 to me. This targets 4090s and 6000 ada

isuruf · 2024-12-26T04:55:18Z

Is there a difference between the code generated for 8.6 vs code generated for 8.9? Removing 8.9 here does not mean that those targets are dropped. Just that 8.9 compute capability devices will use code compiled for 8.6.

hmaarrfk · 2024-12-26T04:59:22Z

Is there a difference between the code generated for 8.6 vs code generated for 8.9? Removing 8.9 here does not mean that those targets are dropped. Just that 8.9 compute capability devices will use code compiled for 8.6.

I’m honestly really not sure and it is difficult to tell without further inspection. How can one tell in practice?

isuruf · 2024-12-26T06:19:40Z

You have to run benchmarks to see if it does make sense. Since this is what upstream wheels do and most people use those wheels, I think we would have seen reports upstream if there was a difference.

hmaarrfk · 2024-12-26T06:33:42Z

I think we would have seen reports upstream if there was a difference.

I think very few people have skills to compile pytorch from source.

I can run a benchmark in 1 week if you can prepare one for me. I have a 4090.

hmaarrfk · 2024-12-26T06:40:17Z

I think there are a few differences in the capabilities of 8.6 and 8.9 that we should keep both...
https://docs.nvidia.com/cuda/cuda-c-programming-guide/#arithmetic-instructions

they seem hard to trigger and would take us a lot of time to design benchmarks for (i think)

hmaarrfk · 2024-12-26T06:45:02Z

It also seems that many lesser 40 series would benefit. Not just “those with highest budgets”

rgommers

Benchmarks are always great to inform decisions on what to build yes or no, so 👍🏼 from me for waiting for those. In principle this change seems correct though, and in the absence of evidence to the contrary of significant value of making different choices, it seems like the upstream TORCH_CUDA_ARCH_LIST should always be matched.

Prune torch cuda arch list to match upstream

a982267

isuruf requested review from Tobias-Fischer, beckermr, benjaminrwilson, hmaarrfk, jeongseok-meta and sodre as code owners December 26, 2024 03:48

hmaarrfk marked this pull request as draft December 27, 2024 13:37

rgommers approved these changes Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prune torch cuda arch list to match upstream #306

Prune torch cuda arch list to match upstream #306

isuruf commented Dec 26, 2024

conda-forge-admin commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

isuruf commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

isuruf commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

rgommers left a comment

Prune torch cuda arch list to match upstream #306

Are you sure you want to change the base?

Prune torch cuda arch list to match upstream #306

Conversation

isuruf commented Dec 26, 2024

conda-forge-admin commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

isuruf commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

isuruf commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

hmaarrfk commented Dec 26, 2024

rgommers left a comment

Choose a reason for hiding this comment