Simple implementation of Stochastic Reconfiguration #5017

camelto2 · 2024-05-31T01:32:58Z

Please review the developer documentation
on the wiki of this project that contains help and requirements.

Proposed changes

Wanted to out this up to get some eyes on it.

This implements a very simple version of stochastic reconfiguration. The benefit is that we avoid the <Psi_i | H | Psi_j > matrix elements which get costly for a large number of parameters. This stochastic reconfiguration approach basically solves -tau * h = S * dp for dp, where tau is a small projection time, h is the vector <Psi_i | H | Psi_0>, and S is the overlap matrix <Psi_i | Psi_j>. The simple approach would be to build S, invert, and solve for dp. A better method, employed here, is to use a conjugate gradient method to solve the linear equation above. This avoids having to explicitly build the entire S matrix, and only builds the S*z matrix-vector product at each iteration of the CG algorithm...essentially this is the "Accelerated" SR approach described in https://doi.org/10.1103/PhysRevB.85.045103, described in the text surrounding Eqn. 7.

Preliminary tests using this have allowed us to scale up 100,000+ variational parameters, which was simply not possible with the current implementation of one_shift_only and has a huge speedup over one_shift for large parameter counts.

This is still a WIP because there are a number of things to improve. A few things off the top of my head are

Need to add meaningful tests. This was the result of some exploratory coding to see if it could work. Now that our preliminary testing seems to suggest the method works pretty well and is fast, I need to add proper tests.
Would like to abstract away the CG solver
Probably need a better implementation of the CG solver...I literally implemented whatever was on wikipedia. I have some hard coded CG convergence criteria which may not be sufficient
Right now, it either uses a fixed tau to get the parameter update or it can do a line search with correlated sampling. In some cases, the line search is far superior and converges significantly faster. However, if the correlated sampling weight gets small and stays small throughout the optimization then the energy can blow up and tank the optimization. In those cases, you just need to use the tau as something small, but this can result in many optimization iterations. Each iteration is a lot faster, but you may end up needing O(100) or more to get convergence. Also if the tau is too large, then the optimization can go haywire as well. I think an optimal approach would be to use the line search unless the weights get too small, and just take steps with small tau otherwise. Or some sort of adaptive method which scales the timestep automatically to help accelerate convergence.
probably many other things I'm not thinking of.

I'm going to be on leave for a few weeks, but I wanted to get this up to get some comments/suggestions. This seems to be the approach other QMC codes are using for large parameter count optimizations, so it would be nice to have something like this in the code. But I think it needs a lot of improvements.

What type(s) of changes does this code introduce?

Delete the items that do not apply

New feature

Does this introduce a breaking change?

No

What systems has this change been tested on?

Checklist

Update the following with a yes where the items apply. If you're unsure about any of them, don't hesitate to ask. This is
simply a reminder of what we are going to look for before merging your code.

Yes. This PR is up to date with current the current state of 'develop'
Yes. Code added or changed in the PR has been clang-formatted
No. This PR adds tests to cover any new code, or to catch a bug that is being fixed
No. Documentation has been added (if appropriate)

(N,1) size

… stochastic_reconfig_krylov

…fig_fullkrylov

prckent · 2024-05-31T14:26:04Z

Test this please

prckent · 2024-08-21T19:36:11Z

Cody: Can we revisit the WIP label of this PR and get it merged? This addition is a great achievement -- several people have been able to successfully use it in real calculations. After fixing the conflict and making any other updates you wish, we can review and make a list of what needs addressing in future PRs. e.g. Some small documentation and a basic test to exercise the code paths.

camelto2 · 2024-08-23T17:59:44Z

Cody: Can we revisit the WIP label of this PR and get it merged? This addition is a great achievement -- several people have been able to successfully use it in real calculations. After fixing the conflict and making any other updates you wish, we can review and make a list of what needs addressing in future PRs. e.g. Some small documentation and a basic test to exercise the code paths.

Glad to hear that people have tried it out successfully. I will fix the conflict. I might try to refactor part of it and make a smaller PR for the krylov solver which will be easier to write a test for. Then we can incorporate those changes into this optimizer and get it merged if that sounds good.

prckent · 2024-08-23T18:17:36Z

Since people are using this, best to get it in mainline. So I prefer: fix, merge, & then to iterate testing+refactoring+documenting.

…fig_fullkrylov

prckent · 2024-08-23T20:35:28Z

Test this please

ye-luo · 2024-08-23T21:12:29Z

Since it is a specific flavor of SR, could you make the enum and option name more specific?

prckent · 2024-08-26T14:14:04Z

@camelto2 I think Ye has a point, but I'll leave it up to you whether to name now or in a later PR. Please leave a comment either way. This capability is obviously evolving and as you point out (thanks) e.g. the CG is worth both refining and abstracting.

…fig_fullkrylov

ye-luo · 2024-09-03T16:08:03Z

Test this please

ye-luo · 2024-09-12T22:31:36Z

src/QMCDrivers/Optimizers/OptimizerTypes.h

+     {"OneShiftOnly", OptimizerType::ONESHIFTONLY}, {"adaptive", OptimizerType::ADAPTIVE},
+     {"descent", OptimizerType::DESCENT}, {"hybrid", OptimizerType::HYBRID},
+     {"gradient_test", OptimizerType::GRADIENT_TEST},
+     {"stochastic_reconfiguration", OptimizerType::STOCHASTIC_RECONFIGURATION_CG}};


@camelto2 can we also rename the input to "sr_cg"?

camelto2 added 30 commits April 29, 2024 10:53

poor mans SR

bd950fb

use line search already in code

41e61c8

temp reuse of fillOverlap

4fb618c

set up functions

e41f459

not quite working

c358dad

fixed ham grad

07fc96d

edits

2fa7184

its working

65228f6

its working!

e14e520

clean up fillOverlap

3f8f058

add new checkConfigurations, non optimial implementation

859c44b

make new checkConfigurationsSR without dH

7a506b8

allreduce specialization issue using vectors, fix by using matrix of

e2af9ad

(N,1) size

Merge remote-tracking branch 'upstream/develop' into stochastic_reconfig

9baf7ed

krylov solver working

c22c621

add timers

c3a9904

change max CG iterations

af24de6

add option to use line search

e5bfca0

Merge remote-tracking branch 'origin/stochastic_reconfig_krylov' into…

4822cca

… stochastic_reconfig_krylov

change krylov convergence

2d64f0e

change order

2ae00e5

add back shift and remove debug

541f792

avoid building full S matrix

be8b4d0

auto shift_s

f17e0a3

add actual input parameter for SR tau

3a5c750

switch to vector instead of Matrix(N, 1) types

5fbef96

add parameter rescaling

643ea3c

Merge remote-tracking branch 'upstream/develop' into stochastic_recon…

281d004

…fig_fullkrylov

slightly improve correlated sampling line search

2f62909

adjust linesearch

86451ff

camelto2 added 4 commits May 16, 2024 17:54

change scaling

7b49701

fix printing

b51678c

Merge remote-tracking branch 'upstream/develop' into stochastic_recon…

4029483

…fig_fullkrylov

add some comments

cbfb2af

prckent added this to the v4.0.0 Release milestone Aug 19, 2024

Merge remote-tracking branch 'upstream/develop' into stochastic_recon…

3f77c00

…fig_fullkrylov

camelto2 changed the title ~~[WIP] Simple implementation of Stochastic Reconfiguration~~ Simple implementation of Stochastic Reconfiguration Aug 23, 2024

Merge remote-tracking branch 'upstream/develop' into stochastic_recon…

aa4789d

…fig_fullkrylov

camelto2 added 2 commits September 3, 2024 09:07

update name of method

421ff7f

Merge remote-tracking branch 'upstream/develop' into stochastic_recon…

ecb57d8

…fig_fullkrylov

ye-luo approved these changes Sep 3, 2024

View reviewed changes

ye-luo enabled auto-merge September 3, 2024 16:08

ye-luo merged commit a126083 into QMCPACK:develop Sep 3, 2024
37 of 39 checks passed

camelto2 mentioned this pull request Sep 4, 2024

Refactor Conjugate Gradient part of SR optimization #5156

Merged

ye-luo reviewed Sep 12, 2024

View reviewed changes

camelto2 deleted the stochastic_reconfig_fullkrylov branch September 16, 2024 19:51

camelto2 mentioned this pull request Sep 17, 2024

Change regularization in SR #5169

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple implementation of Stochastic Reconfiguration #5017

Simple implementation of Stochastic Reconfiguration #5017

camelto2 commented May 31, 2024

prckent commented May 31, 2024

prckent commented Aug 21, 2024

camelto2 commented Aug 23, 2024

prckent commented Aug 23, 2024

prckent commented Aug 23, 2024

ye-luo commented Aug 23, 2024

prckent commented Aug 26, 2024

ye-luo commented Sep 3, 2024

ye-luo Sep 12, 2024 •

edited

Loading

Simple implementation of Stochastic Reconfiguration #5017

Simple implementation of Stochastic Reconfiguration #5017

Conversation

camelto2 commented May 31, 2024

Proposed changes

What type(s) of changes does this code introduce?

Does this introduce a breaking change?

What systems has this change been tested on?

Checklist

prckent commented May 31, 2024

prckent commented Aug 21, 2024

camelto2 commented Aug 23, 2024

prckent commented Aug 23, 2024

prckent commented Aug 23, 2024

ye-luo commented Aug 23, 2024

prckent commented Aug 26, 2024

ye-luo commented Sep 3, 2024

ye-luo Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

ye-luo Sep 12, 2024 •

edited

Loading