You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
谢谢,尝试了成功了,不过我遇到了一个很有趣的问题:
我在输入例如:
1、Read this sentence aloud, this is input: Today is a sunny day.
2、ask this question, this is input: do you know who is jams harden?
时,运行速度非常快,大约在500ms左右,但是我在运行以下输入时,速度就很慢:
1、do you know who is jams harden?
耗时大概是2s,我不知道具体是什么原因,在vllm中也有类似问题,添加this is input: 之后,速度就会变快
Checklist
Describe the bug
案例代码都是使用stream_infer推理,但是单条样本推理我发现有decode的代码,输出都是id,请问单条样本的预测代码是什么呢
Reproduction
logits = self.generator.decode(input_ids)
Environment
Error traceback
No response
The text was updated successfully, but these errors were encountered: