为啥照着步骤来，加载dataset example，始终无法加载自定义内容 #2

poppysix · 2023-04-14T10:17:02Z

hiyouga · 2023-04-14T10:24:34Z

没有看到详细的报错信息，请问你微调模型了吗。把微调日志发一下。

poppysix · 2023-04-14T11:25:57Z

没有报错，看起来一切正常啊
CUDA_VISIBLE_DEVICES=0 python finetune_chatglm.py
--do_train
--dataset example
--finetuning_type lora
--output_dir output
--per_device_train_batch_size 16
--gradient_accumulation_steps 1
--lr_scheduler_type cosine
--logging_steps 10
--save_steps 1000
--max_train_samples 3000
--learning_rate 5e-5
--num_train_epochs 1.0
--fp16

hiyouga · 2023-04-14T11:40:21Z

这个只是做个例子。具体微调要准备其他大型数据集，这个示例数据集数据量太小了。

poppysix · 2023-04-14T11:43:07Z

这个只是做个例子。具体微调要准备其他大型数据集，这个示例数据集数据量太小了。

用了自己的数据集900条也不出结果，用官方Tuning是可以出来的

hiyouga · 2023-04-14T12:50:08Z

可以试着增大Lora的r值，或者使用和官方一样的pre_seq_len=128的P-Tuning方法。同时增大learning_rate=1e-3。

默认参数中为了避免模型发生灾难性遗忘并过拟合到新数据集上，采用的都是较为保守的参数。

LainNetWork · 2023-04-15T16:42:56Z

可以试着增大Lora的r值，或者使用和官方一样的pre_seq_len=128的P-Tuning方法。同时增大learning_rate=1e-3。

默认参数中为了避免模型发生灾难性遗忘并过拟合到新数据集上，采用的都是较为保守的参数。

你好，小白想请教一个问题，这个增大lora的r值具体要怎么做呢？需要加什么参数么？

hiyouga · 2023-04-16T07:46:30Z

你好，小白想请教一个问题，这个增大lora的r值具体要怎么做呢？需要加什么参数么？

@LainNetWork 加入参数--lora_rank=16

LainNetWork · 2023-04-16T11:47:06Z

你好，小白想请教一个问题，这个增大lora的r值具体要怎么做呢？需要加什么参数么？

@LainNetWork 加入参数--lora_rank=16

明白了，感谢回答~

hiyouga · 2023-04-21T01:53:35Z

您好，我们写了一份关于加载自定义内容的教程文档，请移步：https://github.com/hiyouga/ChatGLM-Efficient-Tuning/blob/main/examples/alter_self_cognition.md

Update collator.py

hiyouga added the pending This problem is yet to be addressed. label Apr 16, 2023

hiyouga added solved This problem has been already solved. and removed pending This problem is yet to be addressed. labels Apr 21, 2023

hiyouga closed this as completed Apr 22, 2023

Tungsong mentioned this issue May 19, 2023

多卡sft跑到30%的时候报错，重试了好几次都会中途报错 #105

Closed

hiyouga pushed a commit that referenced this issue Jul 19, 2023

Merge pull request #2 from Yang-HangWA/patch-2

75282d3

Update collator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

为啥照着步骤来，加载dataset example，始终无法加载自定义内容 #2

为啥照着步骤来，加载dataset example，始终无法加载自定义内容 #2

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023 •

edited

Loading

LainNetWork commented Apr 15, 2023

hiyouga commented Apr 16, 2023

LainNetWork commented Apr 16, 2023

hiyouga commented Apr 21, 2023

为啥照着步骤来，加载dataset example，始终无法加载自定义内容 #2

为啥照着步骤来，加载dataset example，始终无法加载自定义内容 #2

Comments

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023

poppysix commented Apr 14, 2023

hiyouga commented Apr 14, 2023 • edited Loading

LainNetWork commented Apr 15, 2023

hiyouga commented Apr 16, 2023

LainNetWork commented Apr 16, 2023

hiyouga commented Apr 21, 2023

hiyouga commented Apr 14, 2023 •

edited

Loading