Restructure criterion benchmarks into groups #8

smu160 · 2024-07-17T22:13:51Z

This is just a draft PR, but the gist of it is that we can restructure the benchmarks such that criterion automatically produces the charts for us (please see the attached screenshot). You can see this after you run cargo bench and then open target/criterion/all-equal-u8/report/index.html in your browser.

Let me know yours thoughts. Thank you!!

smu160 · 2024-07-17T22:16:10Z

Another thing to note is that I have the lengths going from 1, 10, 100, .... That way, you reduce the number of benches you actually run, but you get more performance information with respect to the CPU cache. We want to see how well it does when the data fits in the L1, L2, L3 caches, and beyond.

LaihoE · 2024-07-18T13:13:42Z

Looks good! Maybe we could hit the powers of two? 32..64..128? And what do you think about using throughput as the y-axis for the plot?

smu160 · 2024-07-19T03:11:04Z

@LaihoE Hi,

I just finished restructuring the rest of the benchmarks. That took longer than I expected! I should have looked into using macros or something, but I figure this is okay for now.

At this point I think a few things need to be reviewed/scrutinized:

throughput for y-axis
I'm not sure if criterion allows us to use those figures in the automated plots. The PlotConfiguration seems limited to changing the scale of the axes. We may have to resort to external plotting (matplotlib for that)
input lengths
For testing purposes, I set the input lengths to for (int len = 1; len < (1 << 11); len *= 10) for each benchmark group. Perhaps we should create a const array of the input lengths of interest? powers of 10, powers of 2, primes, etc.?

Excited to hear your thoughts! Thank you!

LaihoE · 2024-07-19T11:25:37Z

@smu160 Looks great thanks for the big effort!

As for the plots, I found them to not be so flexible/aesthetically pleasing and was why I went with python originally. Tbh idk what to do here.

As for testing lengths: Might as-well test them all? also (1 << 11) is not very large, I think we could go much bigger. My cpu has for example the following cache sizes:

64 KB L1 cache = 64 000 bytes
512 KB L2 cache = 512 000 bytes
64 MB L3 cache = 64 000 000 bytes.

smu160 · 2024-07-19T16:22:18Z

@LaihoE I think we can have both. The benchmarks a structured a bit better now for many reasons. For example, you can now easily filter out benchmarks specific to what you want. For example:

cargo bench -- all-equal-u8/SIMD

will run all the all_equal SIMD benchmarks, only.

The aesthetics of criterion plots isn't ideal, so we can use a python script to parse the json output and plot it using matplotlib. The csv output seems to be deprecated by criterion.

With respect to slice sizes, I can just hardcode a few lengths that includes power of twos, non-power-of-twos, primes, etc.

Restructure criterion benchmarks into groups

33ee72f

smu160 marked this pull request as draft July 17, 2024 22:18

smu160 added 10 commits July 18, 2024 16:48

Restructure contains benchmarks

7b6303d

Update eq benchmark using critertion structure

d3c5411

Restructure filter benchmark

56d1501

Restructure benchmarks for find

0acaf79

Restructure max and is_sorted benchmarks

b867296

Remove outer loop in max benchmark

49a278f

Restructure min benchmark

e5798b0

Restructure position benchmark

37c18ea

Fix position and min benchmarks

4a93dd3

Remove outer for loop for position benchmark

6d317d6

smu160 marked this pull request as ready for review July 19, 2024 02:29

Fix all_equal benchmark name

79152eb

smu160 mentioned this pull request Jul 19, 2024

Significantly improve performance by using chunks_exact(SIMD_LEN) and remainder() #10

Open

LaihoE merged commit ca798b2 into LaihoE:master Jul 21, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure criterion benchmarks into groups #8

Restructure criterion benchmarks into groups #8

smu160 commented Jul 17, 2024 •

edited

Loading

smu160 commented Jul 17, 2024

LaihoE commented Jul 18, 2024

smu160 commented Jul 19, 2024

LaihoE commented Jul 19, 2024

smu160 commented Jul 19, 2024

Restructure criterion benchmarks into groups #8

Restructure criterion benchmarks into groups #8

Conversation

smu160 commented Jul 17, 2024 • edited Loading

smu160 commented Jul 17, 2024

LaihoE commented Jul 18, 2024

smu160 commented Jul 19, 2024

LaihoE commented Jul 19, 2024

smu160 commented Jul 19, 2024

smu160 commented Jul 17, 2024 •

edited

Loading