Better error messages for BatchNorm #240

fKunstner · 2022-02-10T19:48:20Z

Makes the BatchNorm error message more explicit to avoid confusions (#239) and adds an option to ignore the exception.

Summary of changes:

Make BatchNorm error message more explicit
Tie the BatchNorm raising of exception or warning to fail_mode
Split error handling on missing extensions for first and second order
The current error handling for missing modules mixed second-order extensions (which should fail if there is no extension defined) and first-order extensions (which should fail if there is no extension defined and the module has parameters). Moved to first order and second order base classes.
Change default fail_mode to error and make fail_mode user accessible
First order extensions did not expose fail_mode and had warn as a default.

…for every extension

f-dangel · 2022-02-15T10:32:13Z

Hey Fred, I skimmed through your changes:

The failing test checks if the result in batch_grad for a BN layer in train mode sums to grad. Working with batch_grad is 'okay' in this case because it's not interpreted as per-sample gradients. We could either revert the default for first-oder fail mode, or adapt the test to use BatchGrad(fail_mode-"WARNING"). I would currently favor to revert the default (as this also does not trigger a version bump, and fixes 2.).
The RTD example with the custom ResNet fails for similar reasons as in 1.
Can you pip install --upgrade && make black to update the formatting?

Happy to review or discuss!

fKunstner · 2022-02-15T18:11:34Z

Thanks for the check!

I'd lean more towards crash that warn, but to get to something we can 👍; How about, starting from this setup;

Revert the default to fail_mode = Warning for first-order extensions
Change the error message to Use at your own risks
Add a notice that This is not supported and might throw an error in a future version?

The failing test checks if the result in batch_grad for a BN layer in train mode sums to grad.
Working with batch_grad is 'okay' in this case because it's not interpreted as per-sample gradients

I don't follow the "batch_grad is okay". Do you mean in the context of the tests? If so I agree that BatchGrad should sum to Grad with or without batchnorm. But I don't think this should be the default behavior of the user-facing API. Someone calling batch_grad is expecting individual gradients and should get an error (maybe a strong warning works as well).

Can you pip install --upgrade && make black to update the formatting?

The files that black complains about are not part of this pr(?). I'll merge main in there again.

fKunstner added 4 commits February 10, 2022 10:34

Make BatchNorm error message more explicit

075834e

Tie the BatchNorm raising of exception or warning to fail_mode

07dbbbf

Split error handling on missing extensions for first and second order

73fd638

Change default fail_mode to error and make fail_mode user accessible …

c69dce1

…for every extension

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better error messages for BatchNorm #240

Better error messages for BatchNorm #240

fKunstner commented Feb 10, 2022

f-dangel commented Feb 15, 2022

fKunstner commented Feb 15, 2022

Better error messages for BatchNorm #240

Are you sure you want to change the base?

Better error messages for BatchNorm #240

Conversation

fKunstner commented Feb 10, 2022

f-dangel commented Feb 15, 2022

fKunstner commented Feb 15, 2022