MLP benchmarks #152

adam-smnk · 2024-07-25T23:41:13Z

Usage: ./tools/mlir_bench/mlp_bench.sh

TODO:

test on cluster
add support for matmul without transpose
investigate broadcast error when type is not f32

adam-smnk · 2024-07-26T15:23:58Z

Works fine on cluster, I was able to gather initial numbers for f32.
There's some lowering issue for other data types. I'll look into it next.

tools/mlir_bench/ov_model_gen.py

slyalin · 2024-07-29T11:35:33Z

investigate broadcast error when type is not f32

@adam-smnk, have you unlocked non f32 types in MLIR conversion?

I noticed issues with accuracy in Pytorch layer tests when running min_max tests. Reproduced when OV_MLIR is enabled only.

adam-smnk · 2024-07-29T16:05:24Z

have you unlocked non f32 types in MLIR conversion?

I think it was primarily a mistake in my testing setup. Otherwise, I just need to relax matchers to accept any types.

slyalin · 2024-07-30T09:28:07Z

tools/mlir_bench/ov_model_gen.py

+    inputs = [(ov.PartialShape(shapes), ov_type) for shapes in input_shapes]
+
+    ov_model = ov.convert_model(torch_seq, input=inputs)
+    ov.save_model(ov_model, f"{file_name}")


Just FYI: ov.save_model will partially compress weights from f32 to f16. "Partially" is because for each constant it is decided individually based on the values range in the constant. In the IR it will be represented as two operations: Constant(f16) -> Convert(f32). This is our default behavior for all models in ov.save_model to save space on the disk, and it shouldn't affect final inference because this combination of operations will be constant folded during model compilation.

It doesn't affect inference precision, it is just a way of weights compression.

tools/mlir_bench/ov_raw_mlir_bench.sh

adam-smnk requested review from rengolin and slyalin July 25, 2024 23:41

github-actions bot added the category: tools label Jul 25, 2024

rengolin reviewed Jul 28, 2024

View reviewed changes

tools/mlir_bench/ov_model_gen.py Show resolved Hide resolved

rengolin approved these changes Jul 28, 2024

View reviewed changes

adam-smnk added 4 commits July 29, 2024 18:33

OV model generator

31a5080

MLP benchmark script

d76b763

Baseline MLP model - no transpose, no broadcast

f96d03c

Fixes

b5be3c6

adam-smnk force-pushed the mlir-benchmarks branch from 8776247 to b5be3c6 Compare July 29, 2024 16:58

adam-smnk added 3 commits July 29, 2024 19:03

Fix num iters + disable parallelism

7b7be81

Refactor + minor fixes

9cfdd0e

Link MLIR runtime util for bf16

97a6af8

github-actions bot added the category: build label Jul 29, 2024

adam-smnk added 5 commits July 29, 2024 22:03

Add debug info

f1abac5

Raw MLIR bench

386b668

Update new file headers

6800be2

Cleanup

a9b3438

libxsmm bench

71448f2

slyalin reviewed Jul 30, 2024

View reviewed changes

adam-smnk added 3 commits July 30, 2024 11:34

TPP mlir-gen bench

42e17e0

Update desc

3b4dc3d

MLP bench readme

949587e

github-actions bot added the category: docs label Jul 30, 2024

slyalin reviewed Jul 30, 2024

View reviewed changes

tools/mlir_bench/ov_raw_mlir_bench.sh Outdated Show resolved Hide resolved

adam-smnk added 2 commits July 30, 2024 12:55

Disable TPP dependency

9a01acb

Print result types

cb34521

Fix env flag placement

52920ad

slyalin approved these changes Jul 30, 2024

View reviewed changes

adam-smnk merged commit 705477e into slyalin:mlir Jul 30, 2024
13 of 29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLP benchmarks #152

MLP benchmarks #152

adam-smnk commented Jul 25, 2024 •

edited

Loading

adam-smnk commented Jul 26, 2024

slyalin commented Jul 29, 2024

adam-smnk commented Jul 29, 2024

slyalin Jul 30, 2024

slyalin Jul 30, 2024

MLP benchmarks #152

MLP benchmarks #152

Conversation

adam-smnk commented Jul 25, 2024 • edited Loading

adam-smnk commented Jul 26, 2024

slyalin commented Jul 29, 2024

adam-smnk commented Jul 29, 2024

slyalin Jul 30, 2024

Choose a reason for hiding this comment

slyalin Jul 30, 2024

Choose a reason for hiding this comment

adam-smnk commented Jul 25, 2024 •

edited

Loading