Cupy backpropagation error #50

tangzhenjie · 2023-12-20T03:47:23Z

I encountered this problem when using cupy for backpropagation. I don’t know what’s going on? There is no problem in the forward direction, but in the reverse direction, there is a dimension mismatch.

IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

code location:
mgbx = mgradbselect * xselect
gradA = torch.sum(mgbx, dim=1)

tvercaut · 2023-12-20T08:43:13Z

Can you provide a minimal reproducible example?

tangzhenjie · 2023-12-20T12:53:40Z

I wrote a sample code that will cause this problem.

import torch
import torch.optim as optim
import torchsparsegradutils.cupy.cupy_sparse_solve as cupy_solve


A = torch.randn(12, 12, requires_grad=True)
b = torch.randn(12, requires_grad=True)

target = torch.randn(12, 1, requires_grad=True)

learning_rate = 0.05
optimizer = optim.SGD([A, b], lr=learning_rate)

for i in range(100):

    x = torch.unsqueeze(cupy_solve.sparse_solve_c4t(A.to_sparse(), b), 1)

    loss = torch.mean(target - x * 10)

    optimizer.zero_grad()

    loss.backward()

    optimizer.step()

tvercaut · 2023-12-20T21:08:10Z

The issue is that we currently expect b to be a matrix in the backward pass. This is not cupy related.

You can workaround the issue by making b a 4x1 matrix (i.e. unsqueeze it). I'll open up a specific issue for handling vectors rhs in teh sparse solver routines.

tangzhenjie · 2023-12-21T02:06:28Z

Thank you very much for your prompt answer, but I changed the code to the one below and the same problem still occurs.

import torch
import torchsparsegradutils.cupy.cupy_sparse_solve as cupy_solve

A = torch.randn(12, 12, requires_grad=True).to_sparse()
b = torch.randn(12, 1, requires_grad=True)

x = cupy_solve.sparse_solve_c4t(A, b)
loss = x.sum()
loss.backward()

tvercaut · 2023-12-21T10:58:27Z

Indeed, there was another similar issue in the cupy solver. This should now be fixed albeit with missing unit tests:
https://colab.research.google.com/drive/1vZ23g_tamJPFGCoeV9RbKB5ObEMfkkv7?usp=sharing

Completion of unit tests will be tracked in #51

tvercaut closed this as completed Dec 20, 2023

tvercaut mentioned this issue Dec 20, 2023

Handle vector RHS in sparse solver routine #51

Open

tvercaut added a commit that referenced this issue Dec 21, 2023

Fix for #50 and #51 - Unit tests are pending

8081161

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cupy backpropagation error #50

Cupy backpropagation error #50

tangzhenjie commented Dec 20, 2023 •

edited

Loading

tvercaut commented Dec 20, 2023 •

edited

Loading

tangzhenjie commented Dec 20, 2023 •

edited by tvercaut

Loading

tvercaut commented Dec 20, 2023

tangzhenjie commented Dec 21, 2023 •

edited by tvercaut

Loading

tvercaut commented Dec 21, 2023

Cupy backpropagation error #50

Cupy backpropagation error #50

Comments

tangzhenjie commented Dec 20, 2023 • edited Loading

tvercaut commented Dec 20, 2023 • edited Loading

tangzhenjie commented Dec 20, 2023 • edited by tvercaut Loading

tvercaut commented Dec 20, 2023

tangzhenjie commented Dec 21, 2023 • edited by tvercaut Loading

tvercaut commented Dec 21, 2023

tangzhenjie commented Dec 20, 2023 •

edited

Loading

tvercaut commented Dec 20, 2023 •

edited

Loading

tangzhenjie commented Dec 20, 2023 •

edited by tvercaut

Loading

tangzhenjie commented Dec 21, 2023 •

edited by tvercaut

Loading