Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Cholesky priorities #999

Merged
merged 1 commit into from
Oct 2, 2023
Merged

Fix Cholesky priorities #999

merged 1 commit into from
Oct 2, 2023

Conversation

rasolca
Copy link
Collaborator

@rasolca rasolca commented Sep 29, 2023

@rasolca
Copy link
Collaborator Author

rasolca commented Sep 29, 2023

cscs-ci run

@rasolca
Copy link
Collaborator Author

rasolca commented Sep 29, 2023

4 nodes on eiger

master:

[0]
[0] 1.07523s 2662.96GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[1]
[1] 1.02061s 2805.5GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[2]
[2] 1.05352s 2717.85GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[3]
[3] 1.01913s 2809.56GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[4]
[4] 1.05299s 2719.22GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC

[0]
[0] 3.7225s 6153.53GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[1]
[1] 3.97347s 5764.86GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[2]
[2] 3.82834s 5983.4GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[3]
[3] 3.83611s 5971.28GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[4]
[4] 3.81851s 5998.8GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC

new:

[0]
[0] 0.962872s 2973.72GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[1]
[1] 0.984092s 2909.6GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[2]
[2] 0.98801s 2898.06GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[3]
[3] 0.969163s 2954.42GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC
[4]
[4] 0.968429s 2956.66GFlop/s dL (20480, 20480) (512, 512) (8, 4) 16 MC

[0]
[0] 3.71782s 6161.27GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[1]
[1] 3.67812s 6227.76GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[2]
[2] 3.6927s 6203.18GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[3]
[3] 3.76707s 6080.73GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC
[4]
[4] 3.66226s 6254.75GFlop/s dL (40960, 40960) (512, 512) (8, 4) 16 MC

@rasolca rasolca merged commit fbab08c into master Oct 2, 2023
3 checks passed
@rasolca rasolca deleted the rasolca/cholesky branch October 2, 2023 12:05
github-actions bot pushed a commit that referenced this pull request Oct 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants