v17
- Analyse, Recalculate: Fix bug that broke 16 bit processing (patches by feisty2).
- Analyse, Recalculate: Support block sizes of 64x32, 64x64, 128x64, and 128x128.
- Analyse, Recalculate: Make dct=1..4 a bit faster on x86.
- FlowFPS, FlowInter: Add AVX2 code.
- Analyse, Recalculate: Fix SATD functions used when dct=5..10 and the input is 16 bit.
- Analyse, Recalculate: Allow dct=5..10 with blocks larger than 16x16.
The Windows DLLs now contain FFTW 3.3.5, which includes new AVX2 code.