Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[IMP] snippets: move all work from parent to mp workers #137

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

cawo-odoo
Copy link
Contributor

@cawo-odoo cawo-odoo commented Sep 13, 2024

In convert_html_columns(), we select 100MiB worth of DB tuples and pass them to a ProcessPoolExecutor together with a converter callable. So far, the converter returns all tuples, changed or unchanged together with the information if it has changed something. All this is returned through IPC to the parent process. In the parent process, the caller only acts on the changed tuples, though, the rest is ignored. In any scenario I've seen, only a small proportion of the input tuples is actually changed, meaning that a large proportion is returned through IPC unnecessarily.

What makes it worse is that processing of the converted results in the parent process is often slower than the conversion, leading to two effects:

  1. The results of all workers sit in the parent process's memory, possibly leading to MemoryError (upg-2021031)
  2. The parallel processing is being serialized on the feedback, defeating a large part of the intended performance gains

To improve this, this commit

  • moves all work into the workers, meaning not just the conversion filter, but also the DB query as well as the DB updates.
    • by doing so reduces the amount of data passed by IPC to just the query texts
    • by doing so distributes the data held in memory to all worker processes
  • reduces the chunk size by one order of magnitude, which means
    • a lot less memory used at a time
    • a lot better distribution of "to-be-changed" rows when these rows are clustered in the table

All in all, in my test case, this

  • reduces maximum process size in memory to 300MiB for all processes compared to formerly >2GiB (and MemoryError) in the parent process
  • reduces runtime from 17 minutes to less than 2 minutes

@cawo-odoo cawo-odoo requested review from a team and UemuS September 13, 2024 14:12
@robodoo
Copy link
Contributor

robodoo commented Sep 13, 2024

Pull request status dashboard

@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch 4 times, most recently from 3cb6112 to 2c980fa Compare September 16, 2024 11:52
@cawo-odoo
Copy link
Contributor Author

ok, I stopped the mattness. If anyone wants to review ... :-)

@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from 2c980fa to bd55b91 Compare September 16, 2024 15:31
@cawo-odoo cawo-odoo changed the title [IMP] snippets: do not return unneeded data from mp worker [IMP] snippets: move all work from parent to mp workers Sep 23, 2024
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from bd55b91 to 3a69c90 Compare September 23, 2024 07:28
Copy link
Contributor

@aj-fuentes aj-fuentes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This loos mostly good.

src/util/snippets.py Outdated Show resolved Hide resolved
src/base/tests/test_util.py Outdated Show resolved Hide resolved
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from 3a69c90 to 9ba22c9 Compare September 23, 2024 09:00
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from 9ba22c9 to a697852 Compare September 23, 2024 09:04
Copy link
Contributor

@aj-fuentes aj-fuentes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some suggestion about the row processing method. Untested!

Regarding the test I'm OK with it, but you could put a comment to somehow clarify the extra steps in the test.

src/util/snippets.py Outdated Show resolved Hide resolved
src/util/snippets.py Outdated Show resolved Hide resolved
src/util/snippets.py Outdated Show resolved Hide resolved
src/util/snippets.py Outdated Show resolved Hide resolved
src/util/snippets.py Outdated Show resolved Hide resolved
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from b6888a6 to 117114b Compare September 23, 2024 11:31
@cawo-odoo
Copy link
Contributor Author

all comments resolved

src/util/snippets.py Outdated Show resolved Hide resolved
Copy link
Contributor

@aj-fuentes aj-fuentes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc: @KangOl

@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from 117114b to b8eebed Compare September 23, 2024 11:59
@cawo-odoo
Copy link
Contributor Author

𓃠 ?

@cawo-odoo
Copy link
Contributor Author

🐈 ?

In `convert_html_columns()`, we select 100MiB worth of DB tuples and pass them
to a ProcessPoolExecutor together with a converter callable. So far, the
converter returns all tuples, changed or unchanged together with the
information if it has changed something. All this is returned through IPC to
the parent process. In the parent process, the caller only acts on the changed
tuples, though, the rest is ignored. In any scenario I've seen, only a small
proportion of the input tuples is actually changed, meaning that a large
proportion is returned through IPC unnecessarily.

What makes it worse is that processing of the converted results in the parent
process is often slower than the conversion, leading to two effects:
1) The results of all workers sit in the parent process's memory, possibly
   leading to MemoryError (upg-2021031)
2) The parallel processing is being serialized on the feedback, defeating a
   large part of the intended performance gains

To improve this, this commit
- moves all work into the workers, meaning not just the conversion filter, but
  also the DB query as well as the DB updates.
  - by doing so reduces the amount of data passed by IPC to just the query
    texts
  - by doing so distributes the data held in memory to all worker processes
- reduces the chunk size by one order of magnitude, which means
  - a lot less memory used at a time
  - a lot better distribution of "to-be-changed" rows when these rows are
    clustered in the table

All in all, in my test case, this
- reduces maximum process size in memory to 300MiB for all processes compared
  to formerly >2GiB (and MemoryError) in the parent process
- reduces runtime from 17 minutes to less than 2 minutes
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from b8eebed to 0ee95a6 Compare November 14, 2024 14:35
@cawo-odoo cawo-odoo force-pushed the master-imp_snippets_memerror_performance-cawo branch from 0ee95a6 to 3a916ac Compare November 14, 2024 14:39
@cawo-odoo
Copy link
Contributor Author

@nseinlet concerning our earlier coffee-machine meeting: wdyt about the fixup just pushed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants