Printing of `Apron` values very slow #1513

michael-schwarz · 2024-06-15T19:23:16Z

With verbose mode, we observe terrible slowdowns that seem to be due to very slow pretty-printing.

~~Currently, they are not considered which is bad for long-running benchmarks where privPrecCompare may take a long time or even fail to terminate.~~

The text was updated successfully, but these errors were encountered:

michael-schwarz · 2024-06-26T14:41:54Z

Turns out almost all the time is spent constructing and printing pretty_diffs:

      (if D.leq v1 v2 then nil else dprintf "diff: %a\n" D.pretty_diff (v1, v2))
      ++
      (if D.leq v2 v1 then nil else dprintf "reverse diff: %a\n" D.pretty_diff (v2, v1))

This takes several orders of magnitude more than just doing the comparisons.

michael-schwarz · 2024-06-26T14:53:43Z

It seems to be somehow not the actual implementation inside apronDomain.ml of pretty_diff though: That can be commented out to return Pretty.nil and the slowdown still stays.

michael-schwarz · 2024-06-26T14:59:45Z

The culprit seems to in fact be D.pretty () calls...

michael-schwarz · 2024-06-26T15:04:46Z

Which in turn come from Apron, so there must be something super slow happening in Apron.

michael-schwarz · 2024-06-26T15:30:25Z

The flamegraph on the other hand hints at a majority of time (90%) being spent in camlGoblintCil__Pretty__breakString_148 with almost all of that time being spent in camlStdlib__Bytes__sub_341

Perf data in Firefox format: https://gigamove.rwth-aachen.de/en/download/039038f988c7dfbc813e63447a35a0b1(accessible until 07.10).

This does not seem to be a blocker for my thesis, as I will generate missing reports manually in non-verbose mode, but we should look at it.

sim642 · 2024-06-26T17:43:04Z

Apparently the Pretty.text primitive directly uses that but does something involved under the hood to break the text up further (based on \n). It's just the massive strings of Apron constraints that it takes so long to split.

I suspect the Apron printer is using Format with some small output width and causing line breaks where we don't want them anyway. I think I've noticed this in tracing output before as well, but never looked into it.

michael-schwarz · 2024-06-26T18:04:40Z

Maybe something can be done here by using String.split_on_char?

See e.g. the performance numbers reported here: https://gist.github.com/mooreryan/220b47feea6b253630dab09c4b6ed18c

michael-schwarz · 2024-06-26T18:09:45Z

https://github.com/ocaml/ocaml/blob/107e8d3851f840e00c9c94118d70b74c06995d56/stdlib/string.ml#L225

this seems to avoid all of the intermediate allocating.

michael-schwarz · 2024-06-26T19:04:01Z

  let custom_text (s:string) = 
    let lines = String.split_on_char '\n' s in
    let rec doit = function
      | [] -> nil
      | [x1] -> text x1
      | x1::xs -> (text x1) ++ line ++ doit xs
    in
    doit lines

  let pretty () (x:t) = custom_text (show x)

instead of

  let pretty () (x:t) = text (show x)

Already goes from me running out of patience and killing it (while not even 1/4 done) after

real    6m20,054s
user    6m18,380s
sys     0m1,252s

to

real    2m25,576s
user    2m13,037s
sys     0m2,895s

That solution obviously is not tail-recursive, but even if we allow some overhead for that it still seems to be the clearly superior approach.
My guess would be that the performance of this changed when OCaml switched to immutable strings with 4.04 (?).

michael-schwarz · 2024-06-26T19:20:54Z

This is actual even almost a case of tail_mod_cons, but it's not clear to the compiler because Cons is hidden behind (++).

michael-schwarz · 2024-06-26T19:32:09Z

I let it run to the end

real    20m10,677s
user    19m51,909s
sys     0m4,897s

So the difference is almost an order of magnitude.

michael-schwarz · 2024-07-09T08:21:29Z

After making the analysis more precise by a patch for #1535 we once again have a case where precision comparison takes >33h, while analysis takes only 15min for the most involved setting.

michael-schwarz · 2024-07-09T08:21:53Z

Also, it allocates over 50GB of RAM.

michael-schwarz · 2024-07-09T13:16:18Z

Ok, this was still pretty-printing somehow behaving irrationally, if that is disabled, it works in a manner of minutes rather than 35 hours,

michael-schwarz added the benchmarking label Jun 15, 2024

michael-schwarz changed the title ~~Make privPrecCompare respect timeouts~~ Printing of Apron values very slow Jun 26, 2024

michael-schwarz added the performance Analysis time, memory usage label Jun 26, 2024

This was referenced Jun 27, 2024

Pretty.text very slow and uses a lot of memory goblint/cil#169

Open

Tracking Benchmark Changes for Thesis #1417

Draft

sim642 mentioned this issue Jun 27, 2024

Refactor Apron Printables #1527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Printing of `Apron` values very slow #1513

Printing of `Apron` values very slow #1513

michael-schwarz commented Jun 15, 2024 •

edited

Loading

michael-schwarz commented Jun 26, 2024 •

edited

Loading

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024 •

edited

Loading

sim642 commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jul 9, 2024

michael-schwarz commented Jul 9, 2024

michael-schwarz commented Jul 9, 2024

Printing of Apron values very slow #1513

Printing of Apron values very slow #1513

Comments

michael-schwarz commented Jun 15, 2024 • edited Loading

michael-schwarz commented Jun 26, 2024 • edited Loading

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024 • edited Loading

sim642 commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jun 26, 2024

michael-schwarz commented Jul 9, 2024

michael-schwarz commented Jul 9, 2024

michael-schwarz commented Jul 9, 2024

Printing of `Apron` values very slow #1513

Printing of `Apron` values very slow #1513

michael-schwarz commented Jun 15, 2024 •

edited

Loading

michael-schwarz commented Jun 26, 2024 •

edited

Loading

michael-schwarz commented Jun 26, 2024 •

edited

Loading