Support for sample-parallelism (MMV) #789
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
Introduces support for additional parallelism along the spatial feature map dimension, i.e., processing multiple 'pixels' or 'samples' simultaneously. For some layers (e.g., MVAU), this functionality already existed in the HLS back-end and is now integrated on the compiler side. For others (e.g., RTL SWG), back-end functionality is extended.
Currently, this parallelism is controlled by the
M
attribute and requires existing folding factors (SIMD
,PE
,parallel_window
) to be maxed out first.Components
M
controls the number of parallel input samples and output windows, i.e.,mmv_in=M*1
andmmv_out=M*k_h*k_w
.out_dim_w divisible by M
,out_dim_w / M > 2
(1D conv is normalized to H=1).MultiChanData
internally, so that external interface is still ahls::stream<ap_uint<>>
.MultiChanData
internally, so that external interface is still ahls::stream<ap_uint<>>
.Tests
Integration into existing unit tests:
test_fpgadataflow_slidingwindow_rtl
New:
test_convert_to_hls_conv_mmv
(tests a graph of multiple layers including padding and pooling)