Replies: 1 comment 3 replies
-
It's probably the result of a |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I'm writing a new backend for NPUs and I encountered this VIEW operation when trying to run LLaMA and Qwen2. I assume this is a part of KV-cache looking at the source tensor shape. However I don't understand what the target tensor. The stride doesn't make sense to me as it is not a permutation of the normal strides.
I expect the "normal" stride be
2, 256, 8192, 8192
Where did the 512 come from and where did the 8192 go?The following is the debug print I got when running Qwen2 1.5B.
Can someone help me understand where this view operation comes from and what is it supposed to do?
Beta Was this translation helpful? Give feedback.
All reactions