You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm working on a NPU backend and the NPU support 1D and 2D convolutions as a native functionality. AFAIK GGML implements convolution as a combination of im2col and matmul. I looked into the backend implementation and found graph_plan_create but it is documented to be not used right now.
Is there a place I can walk and rewrite the compute graph? So I can transform im2col + matmul into corresponding Conv1D and Conv2D operations.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm working on a NPU backend and the NPU support 1D and 2D convolutions as a native functionality. AFAIK GGML implements convolution as a combination of im2col and matmul. I looked into the backend implementation and found
graph_plan_create
but it is documented to be not used right now.Is there a place I can walk and rewrite the compute graph? So I can transform im2col + matmul into corresponding Conv1D and Conv2D operations.
Beta Was this translation helpful? Give feedback.
All reactions