Multishift refinement w/ QUDA #1

mathiaswagner · 2015-07-09T17:12:08Z

QUDA internally refines the multi shift inversions using the single CG.
The MILC code also calls the single CG (either CPU or GPU) again.
This generates a lot of overhead and essentially always does zero iterations, so just wastes a lot of time.
There is an option NO_REFINE which skips the refinement step if the Naik epsilon of the higher shifts is identical to the one for the zeroth-shift.
It would be beneficial to turn off any refinement call from the MILC code, i.e. make NO_REFINE the default option.

Any objections, @detar, @stevengottlieb ?

In a short test on a 32^4 lattice that reduce runtime of the RHMC (single precision) by a factor 2. Admitted, I basically just changed the test case from tests case, so the iteration count is low and the overhead more pronounced.

The text was updated successfully, but these errors were encountered:

QUDA interfacing improvements to mixed-precision multi-shift solver

Production/g a.quda g smear

detar added a commit that referenced this issue Oct 19, 2015

Merge pull request #1 from lattice/feature/no_cpu_refine

32698a7

QUDA interfacing improvements to mixed-precision multi-shift solver

weinbe2 pushed a commit that referenced this issue Sep 4, 2024

Merge pull request #1 from ylin910095/production/gA.QudaGSmear

a64b652

Production/g a.quda g smear

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multishift refinement w/ QUDA #1

Multishift refinement w/ QUDA #1

mathiaswagner commented Jul 9, 2015

Multishift refinement w/ QUDA #1

Multishift refinement w/ QUDA #1

Comments

mathiaswagner commented Jul 9, 2015