-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
【help】训练中途突然显存暴涨导致OOM
pending
This problem is yet to be addressed
#5490
opened Sep 20, 2024 by
RRRRRayyyyy
1 task done
Is LLAVA chat template correct?
pending
This problem is yet to be addressed
#5489
opened Sep 20, 2024 by
mibejjh
1 task done
Running on machines with limited number of online programs
pending
This problem is yet to be addressed
#5488
opened Sep 19, 2024 by
moshushi007ow
1 task done
启动 webui失败
pending
This problem is yet to be addressed
#5485
opened Sep 19, 2024 by
ClementeGao
1 task done
请问DPO训练的时候有什么注意事项吗?我训练出来效果很差。
pending
This problem is yet to be addressed
#5484
opened Sep 19, 2024 by
zlh-source
1 task done
训练时template设为empty时,label开头会加上<|EOT|>,之前的版本好像不会这样
pending
This problem is yet to be addressed
#5474
opened Sep 18, 2024 by
haoranjun
只全参数微调Qwen2-VL-7B-Instruct的visual.merger部分,冻结其他模型参数,训练过程报错
pending
This problem is yet to be addressed
#5472
opened Sep 18, 2024 by
wjx-sudo
1 task done
sft do_predict, 生成的json 文件 的 label 都是空
pending
This problem is yet to be addressed
#5465
opened Sep 18, 2024 by
dayuyang1999
1 task done
AttributeError: 'Qwen2Attention' object has no attribute 'max_position_embeddings'
pending
This problem is yet to be addressed
#5461
opened Sep 17, 2024 by
chengchengpei
1 task done
Tips for implementing LlaMa-Factory for new Hardwares
pending
This problem is yet to be addressed
#5460
opened Sep 17, 2024 by
EtashGuha
no such a file or directory of data
pending
This problem is yet to be addressed
#5457
opened Sep 17, 2024 by
Esmail-ibraheem
1 task done
max pixels argument
pending
This problem is yet to be addressed
#5456
opened Sep 17, 2024 by
sharonsalabiglossai
1 task done
对微调后的GLM-4-9B-Chat运行examples/train_lora/llama3_lora_predict.yaml出错
pending
This problem is yet to be addressed
#5447
opened Sep 16, 2024 by
Twilightsh
1 task done
model.generate的参数在yaml中设定无效,我设了do_sample: false,使用profiler查看实际还是true 此问题只在训练中途的eval发生,训练结束的最后一次eval正常
pending
This problem is yet to be addressed
#5444
opened Sep 15, 2024 by
aliencaocao
1 task done
Running tokenizer on dataset 速度逐渐变慢
pending
This problem is yet to be addressed
#5443
opened Sep 15, 2024 by
xuyue1112
1 task done
bitsandbytes qlora微调模型推理
pending
This problem is yet to be addressed
#5442
opened Sep 15, 2024 by
oulin1031esti
help on understanding the implementation of FSDP.
pending
This problem is yet to be addressed
#5441
opened Sep 15, 2024 by
jq-wei
如何在 使用 openai 风格 部署时,使用 beam search
pending
This problem is yet to be addressed
#5440
opened Sep 15, 2024 by
cat-knight
1 task done
微调后词表长度不一致怎么办
pending
This problem is yet to be addressed
#5436
opened Sep 14, 2024 by
topology1
1 task done
Gemma 2 + unsloth + fa2 full SFT RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
pending
This problem is yet to be addressed
#5435
opened Sep 13, 2024 by
hengdos
1 task done
请问,llamafactory现在支持在昇腾910上进行模型评估嘛?
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#5434
opened Sep 13, 2024 by
yiyayieryo
1 task done
Latest LLaMA-Factory repo force to use Troch 2.4 hence is clashing with Unsloth/XFormers
pending
This problem is yet to be addressed
#5431
opened Sep 13, 2024 by
thusinh1969
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-08-19.