HPCC-32791 Partition the index LRU cache to reduce contention #19200

ghalliday · 2024-10-16T10:45:15Z

Type of change:

This change is a bug fix (non-breaking change which fixes an issue).
This change is a new feature (non-breaking change which adds functionality).
This change improves the code (refactor or other change that does not change the functionality)
This change fixes warnings (the fix does not alter the functionality or the generated code)
This change is a breaking change (fix or feature that will cause existing behavior to change).
This change alters the query API (existing queries will have to be recompiled)

Checklist:

Smoketest:

Send notifications about my Pull Request position in Smoketest queue.
Test my draft Pull Request.

Testing:

ghalliday · 2024-10-16T10:49:05Z

Pushed for initial review. It needs more cleanup - particularly adding some functions into CNodeMRUCache to remove the iterators from the CNodeCache functions. I will test fully later, and paste some concrete numbers.
Initial numbers from a previous iteration of this branch showed 5-10% improvement for old index format and <5% for new index format. (The latter is possibly because my test is no longer saturating the workers.)

github-actions · 2024-10-16T10:54:26Z

Jira Issue: https://hpccsystems.atlassian.net//browse/HPCC-32791

Jirabot Action Result:
Workflow Transition To: Merge Pending
Updated PR

Signed-off-by: Gavin Halliday <[email protected]>

ghalliday · 2024-10-17T09:33:12Z

I have rebased and rerun tests after sorting out the confusing inconsistent timings (HPCC-32814)
The following timings are min, median for 5 runs:
default, before: 5.688, 5.720
default, after: 5.345 5.362
inplace, before: 3.415 3.434
inplace, after: 2.854 2.921
An improvement of ~5% for default compression and 15% for new inplace compression.

Signed-off-by: Gavin Halliday <[email protected]>

mckellyln · 2024-10-17T13:47:20Z

system/jhtree/jhtree.cpp

-        {
-            if (ctx) ctx->noteStatistic(addStatId[cacheType], 1);
+            if (unlikely(alreadyExists))
+                ctx->noteStatistic(hitStatId[cacheType], 1);


Are these backwards ?

unlikely() is correct - because the code only reaches this point if there was a match in the cache, but the node associated with that entry has not been loaded yet.
I will add a comment to clarify why.

mckellyln

I think this is a great addition.
Just one ques about if alreadyExists is a more typical a likely or unlikely expectation.

Signed-off-by: Gavin Halliday <[email protected]>

mckellyln

Approved.

ghalliday requested review from richardkchapman and mckellyln October 16, 2024 10:45

HPCC-32791 Partition the index LRU cache to reduce contention

74e14f8

Signed-off-by: Gavin Halliday <[email protected]>

ghalliday force-pushed the issue32791 branch from ae41144 to 74e14f8 Compare October 17, 2024 09:26

ghalliday added 3 commits October 17, 2024 11:11

cleanup

9a4801d

Signed-off-by: Gavin Halliday <[email protected]>

Othe code cleanup - could push to a different PR

8807765

Signed-off-by: Gavin Halliday <[email protected]>

Metrics not initialised correctly

ab5cfc6

Signed-off-by: Gavin Halliday <[email protected]>

mckellyln reviewed Oct 17, 2024

View reviewed changes

mckellyln approved these changes Oct 18, 2024

View reviewed changes

Add clarifying comment

ba9a0c8

Signed-off-by: Gavin Halliday <[email protected]>

mckellyln approved these changes Oct 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HPCC-32791 Partition the index LRU cache to reduce contention #19200

HPCC-32791 Partition the index LRU cache to reduce contention #19200

ghalliday commented Oct 16, 2024 •

edited

Loading

ghalliday commented Oct 16, 2024

github-actions bot commented Oct 16, 2024

ghalliday commented Oct 17, 2024

mckellyln Oct 17, 2024

ghalliday Oct 22, 2024

mckellyln Oct 31, 2024

mckellyln left a comment

mckellyln left a comment

HPCC-32791 Partition the index LRU cache to reduce contention #19200

Are you sure you want to change the base?

HPCC-32791 Partition the index LRU cache to reduce contention #19200

Conversation

ghalliday commented Oct 16, 2024 • edited Loading

Type of change:

Checklist:

Smoketest:

Testing:

ghalliday commented Oct 16, 2024

github-actions bot commented Oct 16, 2024

ghalliday commented Oct 17, 2024

mckellyln Oct 17, 2024

Choose a reason for hiding this comment

ghalliday Oct 22, 2024

Choose a reason for hiding this comment

mckellyln Oct 31, 2024

Choose a reason for hiding this comment

mckellyln left a comment

Choose a reason for hiding this comment

mckellyln left a comment

Choose a reason for hiding this comment

ghalliday commented Oct 16, 2024 •

edited

Loading