-
Notifications
You must be signed in to change notification settings - Fork 298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SWIFT 2.4 TO DO LIST #1617
Comments
希望能支持零一万物的Yi-1.5系列的Megatron,感谢大佬~ |
还有多机多卡的数据集训练加载问题~NFS挂载的网络波动问题导致加载不了本地的cache
希望有更优雅的解决方法~ |
另外数据集希望能支持在命令行中给个标签,然后分别计算各个标签的loss,比如通用数据集loss,代码数据集loss,垂域数据集loss等,然后对应到Tensorboard看看情况
|
还有远古的DDP+MP的问题)另外我看日志里输出的是MP,这个有可能进化成PP吗,毕竟朴素MP的话气泡期也太长了,但我这边没跑成功过,所以不太清楚是不是已经做了优化 |
这个device_map主要是用于节约显存的。如果要使用PP,可以使用deepspeed。如果要使用TP,估计需要等megatron了 |
好嘞,感谢大佬~ |
希望支持训练RM(reward model)模型 |
解决了 |
支持 qwenvl2 internvl2 vllm 多图和视频推理,谢谢 |
Please support PPO! Thanks |
Dataset
Megatron PreTrain
Fine-tuning
RLHF
Multi-modal
Inference&Deployment
WEB-UI
The text was updated successfully, but these errors were encountered: