LigerFusedLinearCrossEntropyFunction does not support reduction=None #488

Xiang-cd · 2024-12-18T17:03:51Z

🐛 Describe the bug

loss = LigerFusedLinearCrossEntropyLoss(reduction='none')(model.lm_head.weight, flattened_hidden_states, flattened_target)

returns loss with shape []:
tensor(209594.4062, device='cuda:7',
grad_fn=)

reduction is actually perfromed

Reproduce

from liger_kernel.transformers import LigerCrossEntropyLoss, LigerFusedLinearCrossEntropyLoss
device3 = 'cuda'
weight = torch.randn((180000, 4096), device=device3, dtype=torch.float32)
fhidden_states = torch.randn((20, 4096), device=device3, dtype=torch.float32)
ftarget = torch.ones((20,), device=device3, dtype=torch.long)
loss = LigerFusedLinearCrossEntropyLoss(reduction='none')(weight, fhidden_states, ftarget)
print(loss)

Versions

Environment Report:

Operating System: Linux-5.4.0-135-generic-x86_64-with-glibc2.31
Python version: 3.10.14
Liger Kernel version: 0.5.2
PyTorch version: 2.5.1+cu124
CUDA version: 12.4
HIP(ROCm) version: Not available
Triton version: 3.1.0
Transformers version: 4.44.0
XPU version: XPU Not Available

Tcc0403 · 2024-12-21T01:15:24Z

Liger-Kernel/src/liger_kernel/ops/fused_linear_cross_entropy.py

Line 136 in 15a2f58

loss = torch.sum(loss_1d)

It should be easily fixed by removing torch.sum() if reduction is "none". Similar to what LigerCrossEntropy does.

Tcc0403 added the good first issue Good for newcomers label Dec 21, 2024

ryankert01 linked a pull request Dec 21, 2024 that will close this issue

Fix/liger fused linear cross entropy function does not support reduction=none #496

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LigerFusedLinearCrossEntropyFunction does not support reduction=None #488

LigerFusedLinearCrossEntropyFunction does not support reduction=None #488

Xiang-cd commented Dec 18, 2024

Tcc0403 commented Dec 21, 2024 •

edited

Loading

LigerFusedLinearCrossEntropyFunction does not support reduction=None #488

LigerFusedLinearCrossEntropyFunction does not support reduction=None #488

Comments

Xiang-cd commented Dec 18, 2024

🐛 Describe the bug

Reproduce

Versions

Environment Report:

Tcc0403 commented Dec 21, 2024 • edited Loading

Tcc0403 commented Dec 21, 2024 •

edited

Loading