-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Request for Ascend NPU support #6368
Comments
That's great to hear @xuedinge233! Having hardware instances available for CI would be great for maintaining support |
Yeah, we‘re working on it, and we will upload it as soon as there is progress in the development |
Any update about this? |
We are making steady progress and expect to deliver the first version of the code in a couple of weeks. |
Is it planned to support NPU+CANN environments where the CPU is the aarch64 architecture? |
Looking forward to NPU support on vLLM! |
Yes, NPU+MindIE+CANN is in plan, and both aarch64 and x86_64 will be supported. |
Excuse me, is there any progress? |
I mean NPU+VLLM+CANN, and based on aarch64 CPU architecture. NPU device is Atlas 300I Duo or others. |
@xuedinge233 |
The latest updates on the progress can be found here #8054 |
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you! |
🚀 The feature, motivation and pitch
Background
Currently, the project supports various hardware accelerators such as GPUs, but there is no support for NPUs. Adding NPU support could significationly benefit users who have access to these devices, enabling faster and more efficient computations.
Reference Materials
Ascend is a full-stack AI computing infrastructure for industry applications and services based on Huawei Ascend processors and software. For more information about Ascend, see Ascend Community.
CANN (Compute Architecture of Neural Networks), developped by Huawei, is a heterogeneous computing architecture for AI.
Pytorch has officially announced support for Ascend NPU (through key PrivateUse1), please see the PrivateUse1 tutorial here.
Specific Request
we would like to request the addition of support for NPUs within the project. In order to achieve this goal, we will contributing code and providing feedback. This request may additional resources and effort on your part, but we hope to get your help if possible.
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: