-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quasi-Newton update rules issues #382
Comments
I started looking a bit into these points. Ad 1) The function is used when updating the Hessian at Manopt.jl/src/solvers/quasi_Newton.jl Line 677 in 1856a26
or Manopt.jl/src/solvers/quasi_Newton.jl Line 738 in 1856a26
Ad 2) That might indeed be the case, I could only find the scale in the update of the full-matrix variants. Manopt.jl/src/solvers/quasi_Newton.jl Lines 514 to 516 in 1856a26
I am not sure where the scaling was “lost” in one of the reworks, but we can surely bring that back Ad 3) Sure, I am not 100% sure how to realise that, but when we have the iterate available doing a reset when that is zero sounds reasonable. When we wrote this code, reusing the state (and hence the updates) was not so much thought given to, for sure.
Manopt.jl/src/plans/quasi_newton_plan.jl Line 500 in 1856a26
and Manopt.jl/src/plans/quasi_newton_plan.jl Line 582 in 1856a26
which we basically have used before but then a bit more generic (not just a boolean)
and then
which might be a bit nicer to allow also for other ways to care for stability. |
I see, thanks.
I have it sketched already so I will only ask for a review when it's ready 🙂
Yes, unifying that is a good idea. |
If you see how to correctly add the scaling ( and unifying that to be a real value), feel free to add that as well. I am sure it was from one of the Wuang papers about QN. |
I think the last point mentioned here should be addressed now in #392, since I removed edit: or to be precise it is described at Lines 35 to 40 in 649bc7e
|
I also checked for the scaling, if you compare the part/scale between the backward and forward pass in the scaling factor should be added in this line Manopt.jl/src/plans/quasi_newton_plan.jl Line 596 in c8564b8
so we could/should write
I can add that to the PR we are doing as well, since we would change it to a real as well in that step, of course. |
Oh and the full one also has a bool we would change to a real and then scale basically at Manopt.jl/src/plans/quasi_newton_plan.jl Line 434 in c8564b8
as well. |
Nice, so that PR would completely resolve this 👍 |
Yes, I can do that tomorrow after my oral exams. |
* trigger all ambiguity errors. * Move dispatch on p into subfunctions, that existed before anyways. * rework the safeguards to dispatch internally. * remove deprecated definitions. * remove all deprecated parameters. * rename set_manopt_parameter! to set_parameter and get_manopt_parameter to get_parameter * unify stabilisation through projection keyword. * remove `update_stopping_criterion` to update values in the stopping criterion and submerge it in the set_parameter! scheme * unify state constructors of ALM and ARC * Adapt EPM. * Finish DoC and DCPPA * Unify how p and X are passed to states overall in all states. * Sketch a first idea of a factory. * Fix scaling parameter in quasi newton. This resolvs #382 * Start changing the docs snippets to using a documentation glossary * 📚Finish the glossary work * Unify signatures, fix a few tex typos and improve english on the factory info/note. --------- Co-authored-by: Hajg Jasa <[email protected]> Co-authored-by: Bagaev Dmitry <[email protected]> Co-authored-by: Mateusz Baran <[email protected]>
I'm currently working on some updates to quasi-Newton direction update rules. Here are a few things to check/consider:
QuasiNewtonCautiousDirectionUpdate
doesn't seem to useθ
function anywhere.QuasiNewtonLimitedMemoryDirectionUpdate
doesn't seem to usescale
.initialize_solver!
, and maybe also when the direction stops being a descent direction. That shouldn't happen for a well-designed set of options but it's a bit hard to tell which ones are safe from that issue.I'm also working on some variant of direction update for manifolds with corners inspired by L-BFGS-B. There are different approaches with elaborate line searches and update matrices but I'm aiming for something relatively simple and generic. I'm not aiming at competing with Fortran codes, my goal is making combined manifold-box constrained optimization work relatively well.
The text was updated successfully, but these errors were encountered: