Add support for a variety of (data tiled) convolution strategies #63

qedawkins · 2023-07-20T16:48:43Z

No description provided.

Adds the ability to use the transform dialect strategy builders behind `iree-spirv-enable-transform-dialect-jit`, mirroring the existing flags for LLVMCPU/GPU.

DetachElementwiseFromNamedOps is used to replace pre-filled outputs with a zero-fill + add for contracting ops (gemm, conv). This extends the pattern to the convolution interface to allow non-named cases. Renaming of the pass can happen as a follow up if/when this is upstreamed.

Towards pad fused convolution strategies.

Removes the restriction for named ops only on the convolution matcher, instead using the interface.

Adds a builder for mapping data tiled convolutions to a direct tensorcore approach (mainly targeting wmma for now). This generates a loop over the input channels, promotion of the padded input tile to shared memory, and then two more inner loops over the convolution filter.

Adds a direct SIMT(/fma/dot4) conv approach without shared memory.

Allows matching non-named contraction ops, using the same MatmulOpCaptures struct that exists for matmul and batch matmul

…r strategies Maps data tiled matmuls to tensor core, assuming no distribution is expected to happen over the inner tile.

Additionally improve distribution of pad copies for convolution strategy by greedily distributing over the outer most dimensions of the copy.

Currently pad fusion only applies to named convolutions. This allows it to apply based on the interface.

<32 bit width types are handled on the SPIR-V side by introducing bitcasts to and from i32 and bubbling them to the center of the kernel hoping to cancel. This adds a pattern for a bitcast on the result of an scf.if, which comes from the way that padding is handled (transfer_read in the `then` branch, else yield a splat constant).

I had built these out while working on root causing a regression. Cleaned them up and mainlining them. They are controlled by compile time variables for the moment and we can do something smarter later.

qedawkins added 12 commits July 20, 2023 12:44

[SPIRV] Add transform dialect JIT mode

1ce9089

Adds the ability to use the transform dialect strategy builders behind `iree-spirv-enable-transform-dialect-jit`, mirroring the existing flags for LLVMCPU/GPU.

[TransformMatchers] Match leading pad in convolution matcher

8dd79a3

Towards pad fused convolution strategies.

[TransformMatchers] Enable matching generic convolutions

b6010a5

Removes the restriction for named ops only on the convolution matcher, instead using the interface.

[TransformStrategies] Add direct fma conv without shared memory

5397313

Adds a direct SIMT(/fma/dot4) conv approach without shared memory.

[TransformMatchers] Add matcher for any contraction

381bc50

Allows matching non-named contraction ops, using the same MatmulOpCaptures struct that exists for matmul and batch matmul

[TransformStrategies] Add data tiled matmul strategy and cleanup othe…

f24c844

…r strategies Maps data tiled matmuls to tensor core, assuming no distribution is expected to happen over the inner tile.

[TransformStrategies] Enable fused tensor pad in conv strategy

f5f46ef

Additionally improve distribution of pad copies for convolution strategy by greedily distributing over the outer most dimensions of the copy.

[TransformExtensions] Add apply patterns op for fusing pad with consumer

208ec51

[Flow] Enable pad fusion on convolution interface

b4830b5

Currently pad fusion only applies to named convolutions. This allows it to apply based on the interface.

qedawkins merged commit f8976fd into shark_frozen Jul 20, 2023
8 of 11 checks passed

qedawkins deleted the shark_staging branch July 20, 2023 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for a variety of (data tiled) convolution strategies #63

Add support for a variety of (data tiled) convolution strategies #63

qedawkins commented Jul 20, 2023

Add support for a variety of (data tiled) convolution strategies #63

Add support for a variety of (data tiled) convolution strategies #63

Conversation

qedawkins commented Jul 20, 2023