test: fix OPT_STEP_ADAMW for test-backend-ops #974

JohannesGaessler · 2024-09-30T07:46:06Z

Fixup for #966 .

When I tested GGML_OP_OPT_STEP_ADAMW I had mistyped the filter for test-backend-ops so I didn't notice that the test is broken. The problem is that the gradients for tensors are no longer being allocated unless a backward graph is constructed. This can simply be fixed by explicitly creating a tensor the gradients. Also I'm changing the interface for ggml_opt_step_adamw to accept a gradient tensor since the long term goal for ggml_tensor.grad is to remove it.

test: fix OPT_STEP_ADAMW for test-backend-ops

41f35a5

JohannesGaessler mentioned this pull request Sep 30, 2024

ggml: fix gradient allocation logic #966

Merged

ggerganov approved these changes Sep 30, 2024

View reviewed changes

JohannesGaessler merged commit 4de6ee8 into ggerganov:master Sep 30, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: fix OPT_STEP_ADAMW for test-backend-ops #974

test: fix OPT_STEP_ADAMW for test-backend-ops #974

JohannesGaessler commented Sep 30, 2024

test: fix OPT_STEP_ADAMW for test-backend-ops #974

test: fix OPT_STEP_ADAMW for test-backend-ops #974

Conversation

JohannesGaessler commented Sep 30, 2024