Quasi-Newton update rules issues #382

mateuszbaran · 2024-04-22T17:58:47Z

I'm currently working on some updates to quasi-Newton direction update rules. Here are a few things to check/consider:

QuasiNewtonCautiousDirectionUpdate doesn't seem to use θ function anywhere.
QuasiNewtonLimitedMemoryDirectionUpdate doesn't seem to use scale.
Some direction updates need a reset mechanism called by initialize_solver!, and maybe also when the direction stops being a descent direction. That shouldn't happen for a well-designed set of options but it's a bit hard to tell which ones are safe from that issue.
Unify projections to a custom function instead of a boolean flag.

I'm also working on some variant of direction update for manifolds with corners inspired by L-BFGS-B. There are different approaches with elaborate line searches and update matrices but I'm aiming for something relatively simple and generic. I'm not aiming at competing with Fortran codes, my goal is making combined manifold-box constrained optimization work relatively well.

The text was updated successfully, but these errors were encountered:

kellertuer · 2024-04-24T08:05:05Z

I started looking a bit into these points.

Ad 1) The function is used when updating the Hessian at

Manopt.jl/src/solvers/quasi_Newton.jl

Line 677 in 1856a26

bound = d.θ(norm(M, p, X))

or

Manopt.jl/src/solvers/quasi_Newton.jl

Line 738 in 1856a26

bound = d.θ(norm(M, p_old, get_gradient(mp, p_old)))

Ad 2) That might indeed be the case, I could only find the scale in the update of the full-matrix variants.

Manopt.jl/src/solvers/quasi_Newton.jl

Lines 514 to 516 in 1856a26

    
           if iter == 1 && d.scale == true 
        
               d.matrix = skyk_c / inner(M, p, st.sk, st.sk) * d.matrix 
        
           end

I am not sure where the scaling was “lost” in one of the reworks, but we can surely bring that back

Ad 3) Sure, I am not 100% sure how to realise that, but when we have the iterate available doing a reset when that is zero sounds reasonable. When we wrote this code, reusing the state (and hence the updates) was not so much thought given to, for sure.

I also noticed that one of the recent updates that meant to introduce a stability idea did

Manopt.jl/src/plans/quasi_newton_plan.jl

Line 500 in 1856a26

project::Bool=true,

and

Manopt.jl/src/plans/quasi_newton_plan.jl

Line 582 in 1856a26

d.project && embed_project!(M, r, p, r)

which we basically have used before but then a bit more generic (not just a boolean)

Manopt.jl/src/solvers/truncated_conjugate_gradient_descent.jl

Line 18 in 1856a26

    
           * `project!`:            (`copyto!`) for numerical stability it is possible to project onto

and then

Manopt.jl/src/solvers/truncated_conjugate_gradient_descent.jl

Line 741 in 1856a26

tcgs.project!(M, tcgs.δ, p, tcgs.δ)

which might be a bit nicer to allow also for other ways to care for stability.
We should both unify this and probably also switch the second one to embed_project! (since that default is older than the embed/project discussion in Manifolds.jl).

mateuszbaran · 2024-04-24T21:40:44Z

Ad 1) The function is used when updating the Hessian at

I see, thanks.

Ad 3) Sure, I am not 100% sure how to realise that, but when we have the iterate available doing a reset when that is zero sounds reasonable. When we wrote this code, reusing the state (and hence the updates) was not so much thought given to, for sure.

I have it sketched already so I will only ask for a review when it's ready 🙂

We should both unify this and probably also switch the second one to embed_project! (since that default is older than the embed/project discussion in Manifolds.jl).

Yes, unifying that is a good idea.

kellertuer · 2024-04-25T05:05:00Z

If you see how to correctly add the scaling ( and unifying that to be a real value), feel free to add that as well. I am sure it was from one of the Wuang papers about QN.

kellertuer · 2024-08-15T10:11:26Z

I think the last point mentioned here should be addressed now in #392, since I removed stabilize and introduced project!.

edit: or to be precise it is described at

Manopt.jl/Changelog.md

Lines 35 to 40 in 649bc7e

    
           * changed the `stabilize::Bool=` keyword in `quasi_Newton` to the more flexible `project!=` 
        
             keyword, this is also more in line with the other solvers. Internally the same is done 
        
             within the `QuasiNewtonLimitedMemoryDirectionUpdate`. To adapt, 
        
             * the previous `stabilize=true` is now set with `(project!)=embed_project!` in general, 
        
               and if the manifold is represented by points in the embedding, like the sphere, `(project!)=project!` suffices 
        
             * the new default is `(project!)=copyto!`, so by default no projection/stabilization is performed.

kellertuer · 2024-08-15T15:37:49Z

I also checked for the scaling, if you compare the part/scale between the backward and forward pass in

https://github.com/NicolasBoumal/manopt/blob/3d742b5a4755bb1f256fba1d2dbe6de17b9e5c2f/manopt/solvers/bfgs/rlbfgs.m#L531

the scaling factor should be added in this line

Manopt.jl/src/plans/quasi_newton_plan.jl

Line 596 in c8564b8

r ./= d.ρ[last_safe_index] * norm(M, p, d.memory_y[last_safe_index])^2

so we could/should write

r  .*= d.scale/ (d.ρ[last_safe_index] * norm(M, p, d.memory_y[last_safe_index])^2)

I can add that to the PR we are doing as well, since we would change it to a real as well in that step, of course.
Ah and it is not 100% the same as in the Matlab version, since there they have a function that does the gradnorm idea as we do already as well. So our scale it an additional scaling factor one can specify.

kellertuer · 2024-08-15T15:43:18Z

Oh and the full one also has a bool we would change to a real and then scale basically at

Manopt.jl/src/plans/quasi_newton_plan.jl

Line 434 in c8564b8

copyto!(d.matrix, I)

as well.

mateuszbaran · 2024-08-15T15:44:55Z

Nice, so that PR would completely resolve this 👍

kellertuer · 2024-08-15T15:59:25Z

Yes, I can do that tomorrow after my oral exams.

* trigger all ambiguity errors. * Move dispatch on p into subfunctions, that existed before anyways. * rework the safeguards to dispatch internally. * remove deprecated definitions. * remove all deprecated parameters. * rename set_manopt_parameter! to set_parameter and get_manopt_parameter to get_parameter * unify stabilisation through projection keyword. * remove `update_stopping_criterion` to update values in the stopping criterion and submerge it in the set_parameter! scheme * unify state constructors of ALM and ARC * Adapt EPM. * Finish DoC and DCPPA * Unify how p and X are passed to states overall in all states. * Sketch a first idea of a factory. * Fix scaling parameter in quasi newton. This resolvs #382 * Start changing the docs snippets to using a documentation glossary * 📚Finish the glossary work * Unify signatures, fix a few tex typos and improve english on the factory info/note. --------- Co-authored-by: Hajg Jasa <[email protected]> Co-authored-by: Bagaev Dmitry <[email protected]> Co-authored-by: Mateusz Baran <[email protected]>

mateuszbaran added the enhancement label Apr 22, 2024

mateuszbaran mentioned this issue May 9, 2024

:reinitialize_direction_update for quasi-Newton #388

Merged

kellertuer added a commit that referenced this issue Aug 16, 2024

Fix scaling parameter in quasi newton. This resolvs #382

bb574c2

kellertuer mentioned this issue Aug 16, 2024

Rework high level interfaces #392

Merged

27 tasks

kellertuer linked a pull request Aug 16, 2024 that will close this issue

Rework high level interfaces #392

Merged

27 tasks

kellertuer closed this as completed in #392 Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quasi-Newton update rules issues #382

Quasi-Newton update rules issues #382

mateuszbaran commented Apr 22, 2024 •

edited

Loading

kellertuer commented Apr 24, 2024

mateuszbaran commented Apr 24, 2024

kellertuer commented Apr 25, 2024

kellertuer commented Aug 15, 2024 •

edited

Loading

kellertuer commented Aug 15, 2024 •

edited

Loading

kellertuer commented Aug 15, 2024

mateuszbaran commented Aug 15, 2024

kellertuer commented Aug 15, 2024

Quasi-Newton update rules issues #382

Quasi-Newton update rules issues #382

Comments

mateuszbaran commented Apr 22, 2024 • edited Loading

kellertuer commented Apr 24, 2024

mateuszbaran commented Apr 24, 2024

kellertuer commented Apr 25, 2024

kellertuer commented Aug 15, 2024 • edited Loading

kellertuer commented Aug 15, 2024 • edited Loading

kellertuer commented Aug 15, 2024

mateuszbaran commented Aug 15, 2024

kellertuer commented Aug 15, 2024

mateuszbaran commented Apr 22, 2024 •

edited

Loading

kellertuer commented Aug 15, 2024 •

edited

Loading

kellertuer commented Aug 15, 2024 •

edited

Loading