You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug/ 问题描述 (Mandatory / 必填)
在使用XLMRobertaModel族模型bge-reranker-base出现输出全为nan,具体来说,bge-reranker-base在前向传播第12层过attention层的时候出现了一个-nan导致后续的值全部为-nan,同样使用XLMRobertaModel的embedding模型也同样有这个错误,Ascend设备上没有这个bug。
Hardware Environment(Ascend/GPU/CPU) / 硬件环境:
CPU
Software Environment / 软件环境 (Mandatory / 必填):
-- MindSpore version (e.g., 1.7.0.Bxxx) :2.4.0
-- Python version (e.g., Python 3.7.5) :3.10
-- OS platform and distribution (e.g., Linux Ubuntu 16.04):Windows
-- GCC/Compiler version (if compiled from source):
To Reproduce / 重现步骤 (Mandatory / 必填)
sentence中字符串的长度大于20就出现上述错误
Describe the bug/ 问题描述 (Mandatory / 必填)
在使用XLMRobertaModel族模型bge-reranker-base出现输出全为nan,具体来说,bge-reranker-base在前向传播第12层过attention层的时候出现了一个-nan导致后续的值全部为-nan,同样使用XLMRobertaModel的embedding模型也同样有这个错误,Ascend设备上没有这个bug。
Hardware Environment(
Ascend
/GPU
/CPU
) / 硬件环境:CPU
Software Environment / 软件环境 (Mandatory / 必填):
-- MindSpore version (e.g., 1.7.0.Bxxx) :2.4.0
-- Python version (e.g., Python 3.7.5) :3.10
-- OS platform and distribution (e.g., Linux Ubuntu 16.04):Windows
-- GCC/Compiler version (if compiled from source):
To Reproduce / 重现步骤 (Mandatory / 必填)
sentence中字符串的长度大于20就出现上述错误
Expected behavior / 预期结果 (Mandatory / 必填)
正确输出
Screenshots/ 日志 / 截图 (Mandatory / 必填)
If applicable, add screenshots to help explain your problem.
Additional context / 备注 (Optional / 选填)
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: