[Feature]: Request for Ascend NPU support #6368

xuedinge233 · 2024-07-12T08:51:07Z

🚀 The feature, motivation and pitch

Background

Currently, the project supports various hardware accelerators such as GPUs, but there is no support for NPUs. Adding NPU support could significationly benefit users who have access to these devices, enabling faster and more efficient computations.

Reference Materials

Ascend is a full-stack AI computing infrastructure for industry applications and services based on Huawei Ascend processors and software. For more information about Ascend, see Ascend Community.

CANN (Compute Architecture of Neural Networks), developped by Huawei, is a heterogeneous computing architecture for AI.

Pytorch has officially announced support for Ascend NPU (through key PrivateUse1), please see the PrivateUse1 tutorial here.

Specific Request

we would like to request the addition of support for NPUs within the project. In order to achieve this goal, we will contributing code and providing feedback. This request may additional resources and effort on your part, but we hope to get your help if possible.

Alternatives

No response

Additional context

No response

mgoin · 2024-07-12T20:05:36Z

Duplicate of #1606 and #6066

xuedinge233 · 2024-07-15T02:38:37Z

Duplicate of #1606 and #6066

Thinks for your reply, we are ready to develop Ascend ADAPTS for the vllm-project, after completion, we will provide you with a public Ascend prototype

mgoin · 2024-07-15T15:36:38Z

That's great to hear @xuedinge233! Having hardware instances available for CI would be great for maintaining support

xuedinge233 · 2024-07-16T01:51:23Z

Yeah, we‘re working on it, and we will upload it as soon as there is progress in the development

BrightXiaoHan · 2024-07-31T06:08:18Z

Yeah, we‘re working on it, and we will upload it as soon as there is progress in the development

Any update about this?

xuedinge233 · 2024-08-01T03:32:31Z

Yeah, we‘re working on it, and we will upload it as soon as there is progress in the development

Any update about this?

We are making steady progress and expect to deliver the first version of the code in a couple of weeks.

seoibiubiu · 2024-08-13T02:08:56Z

Is it planned to support NPU+CANN environments where the CPU is the aarch64 architecture?

dogeeelin · 2024-08-13T03:48:28Z

Looking forward to NPU support on vLLM！

MengqingCao · 2024-08-19T11:42:43Z

Is it planned to support NPU+CANN environments where the CPU is the aarch64 architecture?

Yes, NPU+MindIE+CANN is in plan, and both aarch64 and x86_64 will be supported.

ccly1996 · 2024-08-26T15:20:37Z

Yeah, we‘re working on it, and we will upload it as soon as there is progress in the development

Any update about this?

We are making steady progress and expect to deliver the first version of the code in a couple of weeks.

Excuse me, is there any progress?

seoibiubiu · 2024-08-30T03:34:28Z

Is it planned to support NPU+CANN environments where the CPU is the aarch64 architecture?

Yes, NPU+MindIE+CANN is in plan, and both aarch64 and x86_64 will be supported.

I mean NPU+VLLM+CANN, and based on aarch64 CPU architecture. NPU device is Atlas 300I Duo or others.

shilei4260 · 2024-08-30T07:04:29Z

@xuedinge233
How long will it take to release the vllm version of npu

xuedinge233 · 2024-08-31T07:08:47Z

@xuedinge233 How long will it take to release the vllm version of npu

The latest updates on the progress can be found here #8054

github-actions · 2024-12-01T02:14:54Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

xuedinge233 added the feature request label Jul 12, 2024

jeejeelee mentioned this issue Jul 24, 2024

[Feature]: vllm support for Ascend NPU #6728

Open

wangshuai09 mentioned this issue Aug 20, 2024

[RFC]: Add Ascend NPU as a new backend #7692

Open

Yikun mentioned this issue Sep 28, 2024

[Roadmap] vLLM Roadmap Q3 2024 #5805

Closed

46 tasks

github-actions bot added the stale label Dec 1, 2024

xuedinge233 closed this as completed Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Request for Ascend NPU support #6368

[Feature]: Request for Ascend NPU support #6368

xuedinge233 commented Jul 12, 2024

mgoin commented Jul 12, 2024

xuedinge233 commented Jul 15, 2024

mgoin commented Jul 15, 2024

xuedinge233 commented Jul 16, 2024

BrightXiaoHan commented Jul 31, 2024

xuedinge233 commented Aug 1, 2024 •

edited

Loading

seoibiubiu commented Aug 13, 2024

dogeeelin commented Aug 13, 2024

MengqingCao commented Aug 19, 2024

ccly1996 commented Aug 26, 2024 •

edited

Loading

seoibiubiu commented Aug 30, 2024

shilei4260 commented Aug 30, 2024

xuedinge233 commented Aug 31, 2024

github-actions bot commented Dec 1, 2024

[Feature]: Request for Ascend NPU support #6368

[Feature]: Request for Ascend NPU support #6368

Comments

xuedinge233 commented Jul 12, 2024

🚀 The feature, motivation and pitch

Background

Reference Materials

Specific Request

Alternatives

Additional context

mgoin commented Jul 12, 2024

xuedinge233 commented Jul 15, 2024

mgoin commented Jul 15, 2024

xuedinge233 commented Jul 16, 2024

BrightXiaoHan commented Jul 31, 2024

xuedinge233 commented Aug 1, 2024 • edited Loading

seoibiubiu commented Aug 13, 2024

dogeeelin commented Aug 13, 2024

MengqingCao commented Aug 19, 2024

ccly1996 commented Aug 26, 2024 • edited Loading

seoibiubiu commented Aug 30, 2024

shilei4260 commented Aug 30, 2024

xuedinge233 commented Aug 31, 2024

github-actions bot commented Dec 1, 2024

xuedinge233 commented Aug 1, 2024 •

edited

Loading

ccly1996 commented Aug 26, 2024 •

edited

Loading