GH-123516: Improve JIT memory consumption by invalidating cold executors #124443

savannahostrowski · 2024-09-24T16:40:16Z

This PR succeeds #123402 and reworks the approach to use the eval breaker for the invalidation call instead of executor creation or gc (thanks @markshannon!). In experimenting, I tried a couple of different thresholds of 10k, 100k, and 1 million runs. The benchmarks for 100k and 1 million were most promising. Here are some relevant stats for quick reference:

100k
- -2.4% memory
- Roughly the same performance-wise

1 million

After chatting with @brandtbucher, I've opted to open this PR with the 100k threshold. One thing to note is that we are potentially a little too liberal in invalidating executors with this threshold, but with the lack of movement in performance and a more substantial decrease in memory usage, it seemed justified. We can continue to iterate here and consider making this tunable in the future.

Issue: Improve JIT memory consumption by invalidating cold executors #123516

brandtbucher

This is a nice change, thanks!

Include/internal/pycore_interp.h

Include/internal/pycore_optimizer.h

Python/ceval_gil.c

brandtbucher · 2024-09-24T20:54:44Z

Python/ceval_gil.c

@@ -1289,6 +1289,11 @@ _Py_HandlePending(PyThreadState *tstate)
        _Py_RunGC(tstate);
    }

+    if((breaker & _PY_EVAL_JIT_INVALIDATE_COLD_BIT) != 0) {
+        _Py_unset_eval_breaker_bit(tstate, _PY_EVAL_JIT_INVALIDATE_COLD_BIT);
+        _Py_Executors_InvalidateCold(tstate->interp);


Just a thought I had while reading through... I don't think any of the stuff that manipulates the linked list of executors is thread-safe currently. Probably not a problem for this PR, but in the future we'll probably want to go through and add _PyEval_StopTheWorld and _PyEval_StartTheWorld calls in optimize.c.

Python/optimizer.c

Python/bytecodes.c

Python/optimizer.c

savannahostrowski and others added 30 commits August 27, 2024 18:55

resolve conflict

0eac77b

tests pass except ssl

d576296

remove file

68e95d6

this is broken

c903af4

gc approach

5ca6b7f

rebase

beb4f65

Update has_run to run_count

427dbf5

update initialized run_count and move invalidate old

92d5590

set threshold to 1`

0cdf638

move incrementing run count into a new op

58e7447

add invalidation threshold in gc of 10

6c047e4

move back to incremenet

2645023

remove print

7c7ae98

move invalidation to executor creation

6d6d306

change threshold

4d086fe

new line

d08e45a

update constant

6315877

📜🤖 Added by blurb_it.

622c266

Merge branch 'main' into jit-mem-invalidate-10

e5117b2

resolve conflict

7c6704c

tests pass except ssl

1d72fdd

remove file

1778185

this is broken

2506821

gc approach

fca6dec

rebase

29436fd

Update has_run to run_count

b969b11

update initialized run_count and move invalidate old

a669e0f

set threshold to 1`

deb73ec

move incrementing run count into a new op

9fa55e8

add invalidation threshold in gc of 10

e4a461a

savannahostrowski added 4 commits September 19, 2024 16:12

Update to 1m

f3c01a1

update cases

563a4d7

add py_set_eval_breaker_bit to nonescaping'

062c54f

create 100k branch

17ece50

savannahostrowski requested a review from brandtbucher September 24, 2024 16:40

savannahostrowski requested review from markshannon and ericsnowcurrently as code owners September 24, 2024 16:40

bedevere-app bot added the awaiting review label Sep 24, 2024

bedevere-app bot mentioned this pull request Sep 24, 2024

Improve JIT memory consumption by invalidating cold executors #123516

Closed

savannahostrowski mentioned this pull request Sep 24, 2024

GH-123516: Improve JIT memory consumption by invalidating cold executors #123402

Closed

savannahostrowski added 2 commits September 24, 2024 09:42

Merge branch 'main' into jit-inv-mem-100k

beea8c6

Merge branch 'main' into jit-inv-mem-100k

eb48f82

brandtbucher reviewed Sep 24, 2024

View reviewed changes

Address comments from Brandt

34363f2

brandtbucher approved these changes Sep 25, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Sep 25, 2024

savannahostrowski mentioned this pull request Sep 25, 2024

Compiling tiny traces wastes lots of memory #116017

Open

Merge branch 'main' into jit-inv-mem-100k

77e81d4

picnixz reviewed Sep 25, 2024

View reviewed changes

Python/optimizer.c Outdated Show resolved Hide resolved

savannahostrowski added 6 commits September 25, 2024 08:54

Dedent goto error

09e3300

Merge branch 'main' into jit-inv-mem-100k

8ff071f

Address comments

f238189

Merge branch 'main' into jit-inv-mem-100k

ec99d5a

Add -mno-outline-atomics flag

9526d84

Comment and fix init

18febc7

brandtbucher enabled auto-merge (squash) September 27, 2024 00:06

brandtbucher merged commit 65f1237 into python:main Sep 27, 2024
65 checks passed

bedevere-app bot removed the awaiting merge label Sep 27, 2024

savannahostrowski deleted the jit-inv-mem-100k branch September 27, 2024 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-123516: Improve JIT memory consumption by invalidating cold executors #124443

GH-123516: Improve JIT memory consumption by invalidating cold executors #124443

savannahostrowski commented Sep 24, 2024 •

edited

Loading

brandtbucher left a comment

brandtbucher Sep 24, 2024

GH-123516: Improve JIT memory consumption by invalidating cold executors #124443

GH-123516: Improve JIT memory consumption by invalidating cold executors #124443

Conversation

savannahostrowski commented Sep 24, 2024 • edited Loading

brandtbucher left a comment

Choose a reason for hiding this comment

brandtbucher Sep 24, 2024

Choose a reason for hiding this comment

savannahostrowski commented Sep 24, 2024 •

edited

Loading