Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

昇腾910B npu8卡训练显存不足
#5491 opened Sep 20, 2024 by LtroiNGU
1 task done
【help】训练中途突然显存暴涨导致OOM pending This problem is yet to be addressed
#5490 opened Sep 20, 2024 by RRRRRayyyyy
1 task done
Is LLAVA chat template correct? pending This problem is yet to be addressed
#5489 opened Sep 20, 2024 by mibejjh
1 task done
Running on machines with limited number of online programs pending This problem is yet to be addressed
#5488 opened Sep 19, 2024 by moshushi007ow
1 task done
启动 webui失败 pending This problem is yet to be addressed
#5485 opened Sep 19, 2024 by ClementeGao
1 task done
请问DPO训练的时候有什么注意事项吗?我训练出来效果很差。 pending This problem is yet to be addressed
#5484 opened Sep 19, 2024 by zlh-source
1 task done
sft do_predict, 生成的json 文件 的 label 都是空 pending This problem is yet to be addressed
#5465 opened Sep 18, 2024 by dayuyang1999
1 task done
qwen2_vl模型训练异常 pending This problem is yet to be addressed
#5462 opened Sep 18, 2024 by will-wiki
AttributeError: 'Qwen2Attention' object has no attribute 'max_position_embeddings' pending This problem is yet to be addressed
#5461 opened Sep 17, 2024 by chengchengpei
1 task done
Tips for implementing LlaMa-Factory for new Hardwares pending This problem is yet to be addressed
#5460 opened Sep 17, 2024 by EtashGuha
no such a file or directory of data pending This problem is yet to be addressed
#5457 opened Sep 17, 2024 by Esmail-ibraheem
1 task done
max pixels argument pending This problem is yet to be addressed
#5456 opened Sep 17, 2024 by sharonsalabiglossai
1 task done
多机多卡运行报错 pending This problem is yet to be addressed
#5450 opened Sep 16, 2024 by hecheng64
1 task done
对微调后的GLM-4-9B-Chat运行examples/train_lora/llama3_lora_predict.yaml出错 pending This problem is yet to be addressed
#5447 opened Sep 16, 2024 by Twilightsh
1 task done
Running tokenizer on dataset 速度逐渐变慢 pending This problem is yet to be addressed
#5443 opened Sep 15, 2024 by xuyue1112
1 task done
bitsandbytes qlora微调模型推理 pending This problem is yet to be addressed
#5442 opened Sep 15, 2024 by oulin1031esti
help on understanding the implementation of FSDP. pending This problem is yet to be addressed
#5441 opened Sep 15, 2024 by jq-wei
如何在 使用 openai 风格 部署时,使用 beam search pending This problem is yet to be addressed
#5440 opened Sep 15, 2024 by cat-knight
1 task done
微调后词表长度不一致怎么办 pending This problem is yet to be addressed
#5436 opened Sep 14, 2024 by topology1
1 task done
请问,llamafactory现在支持在昇腾910上进行模型评估嘛? npu This problem is related to NPU devices pending This problem is yet to be addressed
#5434 opened Sep 13, 2024 by yiyayieryo
1 task done
Latest LLaMA-Factory repo force to use Troch 2.4 hence is clashing with Unsloth/XFormers pending This problem is yet to be addressed
#5431 opened Sep 13, 2024 by thusinh1969
1 task done
ProTip! What’s not been updated in a month: updated:<2024-08-19.