[sombrero] Handle systems where number of cores doesn't divide lattice volume #250

giordano · 2023-12-04T17:12:22Z

Should fix #237 (comment). I followed the suggestion in #237 (comment). Probably this isn't the most efficient algorithm out there, but this takes a bunch of microseconds on my laptop, so hopefully it won't be a major performance bottleneck:

In [7]: %%timeit
   ...: max_num_tasks(112, LATTICE_VOLUME)
   ...: 
   ...: 
3.75 µs ± 24.6 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

CC: @mirenradia.

…e volume

giordano · 2023-12-04T17:14:03Z

benchmarks/apps/sombrero/sombrero.py

@@ -122,4 +137,7 @@ def set_up_from_parameters(self):

    @run_after('setup')
    def setup_num_tasks(self):
-        self.num_tasks = self.current_partition.processor.num_cores * 64
+        self.num_tasks = max_num_tasks(
+            self.current_partition.processor.num_cores * 64,


I'm not sure whether 64 should multiply self.current_partition.processor.num_cores or the result of max_num_tasks. In the latter case we'd greatly oversubscribe the node, right? But that was already the case before, so probably that was it?

In the latter case we'd greatly oversubscribe the node, right? But that was already the case before, so probably that was it?

I think this test was intended to run on 64 nodes so it shouldn't be oversubscribed. Maybe we should use the num_tasks_per_node option instead and calculate the max_num_tasks according to LATTICE_VOLUME / 64 to ensure even load balancing and that we actually use 64 nodes?

Agreed, setting num_tasks_per_node is cleaner in this case, I did that, thanks!

mirenradia · 2023-12-05T10:45:02Z

benchmarks/apps/sombrero/sombrero.py


 from benchmarks.apps.sombrero import case_filter
 from benchmarks.modules.reframe_extras import scaling_config
 from benchmarks.modules.utils import SpackTest

+# Fixed lattice volume in ITT benchmarks
+LATTICE_VOLUME = 32 * 24 * 24 * 24


Looking at the Sombrero README, I think this is only the lattice volume in the case -s small is passed to the executable. For the 64 node test, -s medium is currently passed so the lattice volume is $48^3 \cdot 64$. It's possible these might be changed as well in order to resolve #246.

Having said that, I think this will probably still work fine since the prime factor decomposition of these numbers are both of the form $2^p3^q$ and they're much bigger than the number of cores on even the largest nodes.

Thanks, I defined LATTICE_VOLUME_SMALL and LATTICE_VOLUME_MEDIUM to distinguish the two cases.

mirenradia · 2023-12-05T10:57:34Z

Probably this isn't the most efficient algorithm out there, but this takes a bunch of microseconds on my laptop, so hopefully it won't be a major performance bottleneck.

I agree that this is extremely unlikely to be a problem so I have no issues with it.

… needed

mirenradia

I'm happy with these changes other than my 1 small comment.

I have tested that the ITT-sn one works on the CSD3 Icelakes but unfortunately can't test the ITT-64n one as this exceeds the max job size (I did test it worked by replacing 64 with 48).

mirenradia · 2023-12-12T11:51:21Z

benchmarks/apps/sombrero/sombrero.py

+            LATTICE_VOLUME_MEDIUM // 64,
+        )
+        self.num_tasks = self.num_tasks_per_node * 64


One very tiny comment. Could you replace the two uses of the magic number 64 with a variable e.g. num_nodes?

Done! Can you please check it works for you as expected?

mirenradia · 2023-12-14T14:23:03Z

LGTM (but I can't approve it until you re-request a review).

I have tested this works on the CSD3 Icelakes as expected (by passing -S num_nodes=48 for the ITT-64n test).

giordano · 2023-12-14T14:26:32Z

Thanks!

[sombrero] Handle systems where number of cores doesn't divide lattic…

7381269

…e volume

giordano commented Dec 4, 2023

View reviewed changes

mirenradia reviewed Dec 5, 2023

View reviewed changes

giordano force-pushed the mg/sombrero-ntasks branch from cbdf998 to 6497528 Compare December 11, 2023 12:03

giordano requested a review from mirenradia December 11, 2023 12:04

[sombrero] Distinguish lattice volume; set num_tasks_per_node where…

34a729e

… needed

giordano force-pushed the mg/sombrero-ntasks branch from 6497528 to 34a729e Compare December 11, 2023 12:18

mirenradia suggested changes Dec 12, 2023

View reviewed changes

[sombrero] Allow changing number of nodes for ITT-64n benchmark

0a9eb93

giordano merged commit 364fdb8 into main Dec 14, 2023
4 checks passed

giordano deleted the mg/sombrero-ntasks branch December 14, 2023 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[sombrero] Handle systems where number of cores doesn't divide lattice volume #250

[sombrero] Handle systems where number of cores doesn't divide lattice volume #250

giordano commented Dec 4, 2023

giordano Dec 4, 2023

mirenradia Dec 5, 2023

giordano Dec 11, 2023

mirenradia Dec 5, 2023

giordano Dec 11, 2023

mirenradia commented Dec 5, 2023

mirenradia left a comment

mirenradia Dec 12, 2023

giordano Dec 13, 2023

mirenradia commented Dec 14, 2023 •

edited

Loading

giordano commented Dec 14, 2023

[sombrero] Handle systems where number of cores doesn't divide lattice volume #250

[sombrero] Handle systems where number of cores doesn't divide lattice volume #250

Conversation

giordano commented Dec 4, 2023

giordano Dec 4, 2023

Choose a reason for hiding this comment

mirenradia Dec 5, 2023

Choose a reason for hiding this comment

giordano Dec 11, 2023

Choose a reason for hiding this comment

mirenradia Dec 5, 2023

Choose a reason for hiding this comment

giordano Dec 11, 2023

Choose a reason for hiding this comment

mirenradia commented Dec 5, 2023

mirenradia left a comment

Choose a reason for hiding this comment

mirenradia Dec 12, 2023

Choose a reason for hiding this comment

giordano Dec 13, 2023

Choose a reason for hiding this comment

mirenradia commented Dec 14, 2023 • edited Loading

giordano commented Dec 14, 2023

mirenradia commented Dec 14, 2023 •

edited

Loading