Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

建议对deepseek-v2-coder-lite进行sft测试 #342

Open
bao-xiaoyi opened this issue Sep 12, 2024 · 1 comment
Open

建议对deepseek-v2-coder-lite进行sft测试 #342

bao-xiaoyi opened this issue Sep 12, 2024 · 1 comment

Comments

@bao-xiaoyi
Copy link

sft训练后,生成代码容易产生明显语法错误,与抽风问题。目前尚未查明原因

@jerryli1981
Copy link
Collaborator

您好,试试在微调的时候加一个--reset-attention-mask

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants