warnings when training with gradient_checkpointing=True #46

alekseymalakhov11 · 2024-10-23T08:57:45Z

return Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass /usr/local/lib/python3.10/site-packages/torch/autograd/graph.py:744: UserWarning: c10d::broadcast_: an autograd kernel was not registered to the Autograd key(s) but we are trying to backprop through it. This may lead to silently incorrect behavior. This behavior is deprecated and will be removed in a future version of PyTorch. If your operator is differentiable, please ensure you have registered an autograd kernel to the correct Autograd key (e.g. DispatchKey::Autograd, DispatchKey::CompositeImplicitAutograd). If your operator is not differentiable, or to squash this warning and use the previous behavior, please register torch::CppFunction::makeFallthrough() to DispatchKey::Autograd. (Triggered internally at ../torch/csrc/autograd/autograd_not_implemented_fallback.cpp:63.)

The text was updated successfully, but these errors were encountered:

alekseymalakhov11 linked a pull request Nov 26, 2024 that will close this issue

🎨 Fix gradient checkpointing warnings #49

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

warnings when training with gradient_checkpointing=True #46

warnings when training with gradient_checkpointing=True #46

alekseymalakhov11 commented Oct 23, 2024

warnings when training with gradient_checkpointing=True #46

warnings when training with gradient_checkpointing=True #46

Comments

alekseymalakhov11 commented Oct 23, 2024