Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[rank0]: RuntimeError: tensor does not have a device pending This problem is yet to be addressed
#6454 opened Dec 26, 2024 by Juvenilecris
1 task done
华为昇腾NPU支持QLora训练吗? npu This problem is related to NPU devices pending This problem is yet to be addressed
#6452 opened Dec 26, 2024 by sunxiaoyu12
1 task done
单机多卡微调 Signal 11 (SIGSEGV) pending This problem is yet to be addressed
#6450 opened Dec 26, 2024 by luhan1999
1 task done
验证集loss相较于训练集过低,仅有0.08左右 pending This problem is yet to be addressed
#6448 opened Dec 25, 2024 by yzd11
DeepSpeed支持yaml配置文件 pending This problem is yet to be addressed
#6445 opened Dec 25, 2024 by randydl
lora微调Mamba-Codestral-7B-v0.1出现了问题 pending This problem is yet to be addressed
#6434 opened Dec 24, 2024 by tongzeliang
1 task done
寒武纪:咱们是否能支持寒武纪? pending This problem is yet to be addressed
#6429 opened Dec 24, 2024 by y149604146
1 task done
Ascend NPU 910B3采用deepspeed引擎训练,Q1:未调用NPU,Q2:NPU健康状态是否影响训练。 npu This problem is related to NPU devices pending This problem is yet to be addressed
#6428 opened Dec 24, 2024 by Lexlum
1 task done
奖励模型能否不是一个model,而是一个自己定义的函数 pending This problem is yet to be addressed
#6423 opened Dec 23, 2024 by cdhx
1 task done
ppo训练相关问题 pending This problem is yet to be addressed
#6419 opened Dec 22, 2024 by ccp123456789
Tokenizer does not derive the newer config pending This problem is yet to be addressed
#6415 opened Dec 21, 2024 by xiaosu-zhu
1 task done
Questions about resuming training form ckpt pending This problem is yet to be addressed
#6414 opened Dec 21, 2024 by Jiawei-Guo
1 task done
Why Speed per iteration slower when dataset is large pending This problem is yet to be addressed
#6410 opened Dec 20, 2024 by coding2debug
1 task done
sft have bug while lora run successfully pending This problem is yet to be addressed
#6405 opened Dec 20, 2024 by TimeFlysLeo
1 task done
How to reproduce the paper results? pending This problem is yet to be addressed
#6387 opened Dec 19, 2024 by StiphyJay
1 task done
LLaMA-Factory对话预期之外存在问题 pending This problem is yet to be addressed
#6386 opened Dec 19, 2024 by 3237522375
1 task done
如何把我训练的奖励模型放到ppo的工作管线里 pending This problem is yet to be addressed
#6385 opened Dec 19, 2024 by chcoo
1 task done
LLava Series (7B, 14B) freeze_vision_tower=false bug pending This problem is yet to be addressed
#6376 opened Dec 18, 2024 by xirui-li
1 task done
多节点使用zero3速度很慢 pending This problem is yet to be addressed
#6372 opened Dec 18, 2024 by HelloWorld506
1 task done
webui加载qwen2-vl-7b进行chat报错 pending This problem is yet to be addressed
#6371 opened Dec 18, 2024 by laoqiongsuan
1 task done
Can you support fast resume with streaming option? pending This problem is yet to be addressed
#6352 opened Dec 16, 2024 by JonghwanMun
1 task done
Support phi-4 released by msft on 2024-12-16 pending This problem is yet to be addressed
#6346 opened Dec 16, 2024 by yx-lamini
1 task done
ProTip! Add no:assignee to see everything that’s not assigned.