[Core] Factor out input preprocessing to a separate class #7329

DarkLight1337 · 2024-08-09T02:57:01Z

Following up to #7258, this PR implements the suggestion by @robertgshaw2-neuralmagic to create a new class for prompt parsing. This helps keep the sync/async versions of the preprocessing code together so they can be updated together easily, while also slimming down the existing engine classes to focus on request processing.

github-actions · 2024-08-09T02:57:13Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

njhill

Thanks @DarkLight1337, just some minor comments.

We may need to re-think some of this soon since we're hoping to get rid of AsyncLLMEngine altogether. I'm thinking we'll hopefully no longer need both sync and async versions of these things.

vllm/inputs/preprocess.py

njhill · 2024-09-12T16:56:24Z

vllm/inputs/preprocess.py

+        """Async version of :meth:`_extract_prompt_components`."""
+        parsed = parse_singleton_prompt(inputs)
+
+        if parsed["type"] == "str":


have a prompt_type = parsed["type"] and reuse instead of repeated lookups? (same for parsed["content"])

I tried this, and unfortunately this breaks the type checker's capability to narrow down the type of parsed. So I'm keeping the code as is.

njhill · 2024-09-12T17:02:13Z

vllm/inputs/preprocess.py

+
+        parsed = parse_singleton_prompt(inputs)
+
+        if parsed["type"] == "str":


have a prompt_type = parsed["type"] and reuse instead of repeated lookups? (same for parsed["content"])

DarkLight1337 · 2024-09-12T17:15:58Z

Thanks for the review! I'll address the detailed comments shortly.

We may need to re-think some of this soon since we're hoping to get rid of AsyncLLMEngine altogether. I'm thinking we'll hopefully no longer need both sync and async versions of these things.

Even without AsyncLLMEngine, the preprocessing logic better fits inside vllm.inputs module, and this PR would reduce the bloat inside LLMEngine itself.

DarkLight1337 · 2024-09-12T17:20:40Z

I have finished addressing your other comments.

…ct#7329)

Factor out input preprocessing to a separate class

295bb07

DarkLight1337 requested a review from robertgshaw2-neuralmagic August 9, 2024 02:57

DarkLight1337 added 9 commits August 9, 2024 03:00

format

4b00d69

Merge branch 'upstream' into input-preprocessor

8b0b526

Merge branch 'upstream' into input-preprocessor

0e0f308

Merge branch 'upstream' into input-preprocessor

09d0706

Remove unnecessary arg

ecb43ee

Merge branch 'upstream' into input-preprocessor

26d308e

Fix type error

e86901f

Fix imports

61ea59c

Fix type error

5a97c5b

DarkLight1337 force-pushed the input-preprocessor branch from caec7bf to 5a97c5b Compare August 28, 2024 08:40

DarkLight1337 added 4 commits August 28, 2024 15:23

Merge branch 'upstream' into input-preprocessor

1866ee4

Merge branch 'upstream' into input-preprocessor

f605c45

Merge branch 'upstream' into input-preprocessor

31e15c0

Merge branch 'upstream' into input-preprocessor

f5c0bbd

njhill self-requested a review September 12, 2024 15:32

njhill reviewed Sep 12, 2024

View reviewed changes

DarkLight1337 added 3 commits September 12, 2024 17:17

Merge branch 'upstream' into input-preprocessor

90b36f7

Remove redundant else

23d89ff

format

b10b4b0

Remove unused import

9e5f5f8

njhill approved these changes Sep 12, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) September 12, 2024 17:24

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 12, 2024

DarkLight1337 added 2 commits September 13, 2024 00:53

Merge branch 'upstream' into input-preprocessor

fbf878a

Update test

1b8f568

DarkLight1337 merged commit 5ec9c0f into vllm-project:main Sep 13, 2024
49 of 50 checks passed

DarkLight1337 deleted the input-preprocessor branch September 13, 2024 03:18

Jeffwan pushed a commit to aibrix/vllm that referenced this pull request Sep 19, 2024

[Core] Factor out input preprocessing to a separate class (vllm-proje…

b93e2ec

…ct#7329)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Factor out input preprocessing to a separate class #7329

[Core] Factor out input preprocessing to a separate class #7329

DarkLight1337 commented Aug 9, 2024 •

edited

Loading

github-actions bot commented Aug 9, 2024

njhill left a comment

njhill Sep 12, 2024

DarkLight1337 Sep 12, 2024 •

edited

Loading

njhill Sep 12, 2024

DarkLight1337 Sep 12, 2024

DarkLight1337 commented Sep 12, 2024 •

edited

Loading

DarkLight1337 commented Sep 12, 2024


		parsed = parse_singleton_prompt(inputs)

		if parsed["type"] == "str":

[Core] Factor out input preprocessing to a separate class #7329

[Core] Factor out input preprocessing to a separate class #7329

Conversation

DarkLight1337 commented Aug 9, 2024 • edited Loading

github-actions bot commented Aug 9, 2024

njhill left a comment

Choose a reason for hiding this comment

njhill Sep 12, 2024

Choose a reason for hiding this comment

DarkLight1337 Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

njhill Sep 12, 2024

Choose a reason for hiding this comment

DarkLight1337 Sep 12, 2024

Choose a reason for hiding this comment

DarkLight1337 commented Sep 12, 2024 • edited Loading

DarkLight1337 commented Sep 12, 2024

DarkLight1337 commented Aug 9, 2024 •

edited

Loading

DarkLight1337 Sep 12, 2024 •

edited

Loading

DarkLight1337 commented Sep 12, 2024 •

edited

Loading