Raise error for long prompt #273

LiuXiaoxuanPKU · 2023-06-27T04:47:19Z

This is a fix for #113. The program hangs when the input prompt is too long because the check will always succeed and the request will always in the waiting queue and will never be added to the running queue.
Add a check before the if statement, if the prompt length is too long, it will raise ValueError directly.

zhuohan123 · 2023-06-27T16:03:15Z

Hi Lily, please check all the possible situations here and make sure you can handle all of them. Thanks!

LiuXiaoxuanPKU · 2023-06-28T03:54:51Z

I move forward with the length check. The original code should already handle the len(prompt) + len(generated) > limit case, so I only deal with model limit and len(prompt) > limit. Let me know if there are any problems, thanks!

zhuohan123

Hey Lily, thanks for the work! Please check my comments.

vllm/core/scheduler.py

vllm/sequence.py

vllm/engine/arg_utils.py

vllm/config.py

zhuohan123

Thanks for the hard work! In general LGTM! Will merge it after the following small comments are fixed.

vllm/engine/llm_engine.py

vllm/config.py

…project#273) SUMMARY: * update benchmarking, testing, and accuracy jobs to run on label `aws-test-a10g-24G` or `aws-test-a10-96G` which is based on "vanilla deeplearning" AMI * update relevant GHA actions and workflows to not be dependent on `pyenv` virtualenv * update "model cache" to use local disk as opposed to "EFS" TEST PLAN: runs on remote push --------- Co-authored-by: andy-neuma <andy@neuralmagic.com> Co-authored-by: Domenic Barbuzzi <domenic@neuralmagic.com>

fixlen

d3acf1a

LiuXiaoxuanPKU requested a review from WoosukKwon June 27, 2023 04:47

zhuohan123 mentioned this pull request Jun 27, 2023

Prompt size limits? It keeps hanging with prompts longer than 120 tokens #276

Closed

move forward input check

0dfebd0

LinPoly mentioned this pull request Jun 28, 2023

Long context will cause the vLLM stop #286

Closed

WoosukKwon mentioned this pull request Jun 28, 2023

[PyPI] Bump the version up to v0.1.2 #300

Merged

LiuXiaoxuanPKU added 3 commits June 29, 2023 23:33

handle long prompt

a2da10e

clean create request output

4055a80

move request output

9957596

zhuohan123 requested changes Jun 30, 2023

View reviewed changes

zhuohan123 mentioned this pull request Jun 30, 2023

Fix #issues/320 #325

Closed

fix comments

923745d

zhuohan123 approved these changes Jul 1, 2023

View reviewed changes

vllm/engine/llm_engine.py Outdated Show resolved Hide resolved

vllm/config.py Outdated Show resolved Hide resolved

fix comments

bbdf8f2

zhuohan123 merged commit dafd924 into vllm-project:main Jul 1, 2023

michaelfeil pushed a commit to michaelfeil/vllm that referenced this pull request Jul 1, 2023

Raise error for long prompt (vllm-project#273)

183792a

LiuXiaoxuanPKU deleted the fix branch July 6, 2023 17:55

dexterju27 mentioned this pull request Aug 15, 2023

Server hanging after prompt exceeds limit with LLaMA 2 models #765

Closed

ann-lab52 mentioned this pull request Dec 28, 2023

API server abort all request for no reason #2297

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Raise error for long prompt (vllm-project#273)

bc320ac

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

Raise error for long prompt (vllm-project#273)

905219e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raise error for long prompt #273

Raise error for long prompt #273

LiuXiaoxuanPKU commented Jun 27, 2023

zhuohan123 commented Jun 27, 2023

LiuXiaoxuanPKU commented Jun 28, 2023

zhuohan123 left a comment

zhuohan123 left a comment

Raise error for long prompt #273

Raise error for long prompt #273

Conversation

LiuXiaoxuanPKU commented Jun 27, 2023

zhuohan123 commented Jun 27, 2023

LiuXiaoxuanPKU commented Jun 28, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 left a comment

Choose a reason for hiding this comment