DifferentiationInterface testing #2469

gdalle · 2024-07-15T09:59:41Z

Hi there!
I'm heading towards multi-argument and non-array support in DI, and I'd like to start testing Lux layers. For this I would need two things:

Suggestions for a test suite of layers so that we hit some corner cases
Your definition of what it means to be "the right gradient" (in other words, a recursive comparison function between a given gradient and the reference output).

Do you think you could help me out?

CarloLucibello · 2024-07-15T14:53:04Z

You can take a look at the tests we added for Enzyme

https://github.com/FluxML/Flux.jl/blob/master/test/ext_enzyme/enzyme.jl

e.g. begin with

x = rand(Float32, 2, 1)
model = Chain(Dense(2=>3, relu), Dense(3=>2))
g = gradient(model -> sum(model(x)), model)[1]

We impose little limitations on gradients, they can be nested structs or named structs.
For instance, the ones returned by Enzyme and the ones returned by Zygote are compared by

function test_grad(g1, g2; broken=false)
    fmap_with_path(g1, g2) do kp, x, y
        :state ∈ kp && return # ignore RNN and LSTM state
        if x isa AbstractArray{<:Number}
            # @show kp
            @test x ≈ y rtol=1e-2 atol=1e-6 broken=broken
        end
        return x
    end
end

where fmap_with_path is defined in Functors.jl. So what we need is a gradient for each numerical array leaf in the original object. These leaves should be reachable through the same "path", e.g. g.layers[1].weight.

gdalle · 2024-07-19T06:30:06Z

I'm having issues when comparing the true gradients with finite differences. Depending on the random seed I get unpredictable failures. Is that a problem in the Flux test suite as well @CarloLucibello? I didn't find a way to pass an rng to the network constructors, do I have to seed! the global rng?
For now I have increased atol and rtol but it's hard to know the right threshold.

CarloLucibello · 2024-07-19T07:36:04Z

We do Random.seed!(0) in runtests.jl and we don't see test failures, but I would have expected the tests to be robust. Can you identify the frail ones? Maybe the ones with RNNs?

gdalle · 2024-07-19T07:45:07Z

I'll try! Which backends should I aim to test? Zygote, Enzyme and Tracker?

CarloLucibello · 2024-07-19T07:53:05Z

We don't support Tracker anymore. Primarly Zygote, and experimentally Enzyme.

gdalle · 2024-07-19T08:27:01Z

I added a random seed in gdalle/DifferentiationInterface.jl#371, tests seem to pass for Zygote with the same tolerances as you. I'll notify you if I see random failures further down the road.

Any idea why Enzyme fails on two scenarios only (see the PR for details)?

gdalle mentioned this issue Jul 15, 2024

Testing NNLib / Lux / Flux gdalle/DifferentiationInterface.jl#105

Open

avik-pal mentioned this issue Jul 16, 2024

DifferentiationInterface testing LuxDL/Lux.jl#769

Open

gdalle mentioned this issue Jul 16, 2024

First test scenarios for Flux gradients gdalle/DifferentiationInterface.jl#352

Merged

gdalle mentioned this issue Jul 19, 2024

Debug Flux tests gdalle/DifferentiationInterface.jl#371

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DifferentiationInterface testing #2469

DifferentiationInterface testing #2469

gdalle commented Jul 15, 2024

CarloLucibello commented Jul 15, 2024

gdalle commented Jul 19, 2024

CarloLucibello commented Jul 19, 2024 •

edited

Loading

gdalle commented Jul 19, 2024

CarloLucibello commented Jul 19, 2024

gdalle commented Jul 19, 2024

DifferentiationInterface testing #2469

DifferentiationInterface testing #2469

Comments

gdalle commented Jul 15, 2024

CarloLucibello commented Jul 15, 2024

gdalle commented Jul 19, 2024

CarloLucibello commented Jul 19, 2024 • edited Loading

gdalle commented Jul 19, 2024

CarloLucibello commented Jul 19, 2024

gdalle commented Jul 19, 2024

CarloLucibello commented Jul 19, 2024 •

edited

Loading