Skip to content

[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache #1560

[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache

[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache #1560

Triggered via pull request December 18, 2024 03:47
@liangfuliangfu
edited #11277
Status Success
Total duration 13s
Artifacts

cleanup_pr_body.yml

on: pull_request_target
update-description
5s
update-description
Fit to window
Zoom out
Zoom in

Annotations

1 warning
update-description
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636