[Model] Add PaliGemma #5189

ywang96 · 2024-06-02T00:50:50Z

Add support for PaliGemma - Google's Cutting-Edge Open Vision Language Model.

FIX #4833

ywang96 · 2024-06-02T00:55:32Z

Not ready for review yet but FYI @zhuohan123 @simon-mo since you asked me about this.

also cc @DarkLight1337 since I might need you to review this eventually

ywang96 · 2024-06-10T07:04:36Z

Seeing some correctness issue with the implementation of Gemma itself - going to mark this PR back to draft for now.

DarkLight1337 · 2024-06-10T07:43:18Z

tests/models/test_paligemma.py

+        input_id for input_id in input_ids
+        if input_id != image_token_id and input_id != tokenizer.bos_token_id
+    ]
+    if hf_input_ids[-1] == 108:


Maybe indicate what this 108 means.

DarkLight1337 · 2024-06-10T08:09:25Z

Design-wise I think the code is fine. However, there appear to be major discrepancies between the expected and actual output beyond what can be observed for the base Gemma model. For example, the caption es example is not working at all (the model returns English output instead of Spanish).

WoosukKwon · 2024-07-05T17:43:25Z

@ywang96 Did you have a chance to test the model again after #5913?

ywang96 · 2024-07-05T22:37:53Z

@ywang96 Did you have a chance to test the model again after #5913?

@WoosukKwon I haven't, but I also need to update this PR because of the refactoring we just finished to make sure it works.

ywang96 · 2024-07-06T03:36:54Z

This PR passed all tests locally and is ready for review. Please take a look @WoosukKwon!

DarkLight1337

Overall LGTM but I'll check the warnings outputted in the VLM tests to see how consistent it is with the HF model.

vllm/model_executor/models/paligemma.py

DarkLight1337 · 2024-07-06T05:27:35Z

Seems that the vLLM output string is not following HF format:

https://buildkite.com/vllm/ci-aws/builds/4150#0190861d-cafe-41db-81ba-cb4c0d8fc2a6

Can you fix that?

docs/source/models/supported_models.rst

ywang96 · 2024-07-06T05:32:20Z

Seems that the vLLM output string is not following HF format:

https://buildkite.com/vllm/ci-aws/builds/4150#0190861d-cafe-41db-81ba-cb4c0d8fc2a6

Can you fix that?

Ah it's the eos_token. It's also weird that some of these tests actually don't give an output at all - I guess I will need to take a deeper look at it and see what went wrong.

WoosukKwon

@ywang96 Thanks for the PR! The PR looks good overall, but I think we can reduce the redundancy by reusing the code for Llava.

docs/source/models/supported_models.rst

WoosukKwon · 2024-07-06T18:39:58Z

examples/paligemma_example.py

Do we need this example? I'm wondering because it seems pretty similar to the llava example.

The tests and examples were made for each model previously because the image-related engine args were different from each model. This is indeed no longer needed after the refactoring we did, and I will open another PR for consolidate them if you're okay with that.

Thanks for the explanation! Sounds good. Please open the PR!

WoosukKwon · 2024-07-06T18:41:59Z

tests/models/test_paligemma.py

I think this test is also a bit redundant with test_llava.py. Can we refactor test_llava.py to cover both models?

See my comment above.

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon

@ywang96 LGTM. Please feel free to open the PR for refactoring.

Also, please merge the PR after @DarkLight1337 also approves as he might be more knowledgeable about the models than me.

DarkLight1337 · 2024-07-07T01:25:26Z

Although the vLLM output for Spanish translation has some numerical difference and diverges from HF output after the first 2 tokens, I think the overall meaning is similar enough so it is fine. Let's merge this.

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

initial

e6352e5

ywang96 mentioned this pull request Jun 2, 2024

[New Model]: Google's Paligemma family of models #4833

Closed

remove lm head

7dfbe44

DarkLight1337 mentioned this pull request Jun 2, 2024

[RFC]: Multi-modality Support Refactoring #4194

Open

88 tasks

ywang96 added 5 commits June 7, 2024 15:13

Merge branch 'main' into paligemma

3fd77fe

Merge branch 'main' into paligemma

ccb0f25

update tests

9b5269d

fix test

af11afa

format

a465e85

ywang96 assigned DarkLight1337 Jun 9, 2024

ywang96 marked this pull request as ready for review June 9, 2024 03:10

ywang96 added 11 commits June 8, 2024 20:44

fix model loading

3e9a12b

fix input args

c734a17

fix model loading

2d7de4d

add embedding method to gemma

2f65bf7

fix linear output

04e4ace

update gemma forward

4a9551d

update

6fd10f1

fix test

d08db94

remove extra bos

e325630

format

cbb7c49

add gemma to model test

7ea7265

ywang96 marked this pull request as draft June 10, 2024 07:04

DarkLight1337 reviewed Jun 10, 2024

View reviewed changes

ywang96 added 2 commits June 10, 2024 19:44

try normal caption

9a8cd85

Merge branch 'main' into paligemma

9069831

DarkLight1337 mentioned this pull request Jun 13, 2024

[Model] Initialize Phi-3-vision support #4986

Merged

3 tasks

WoosukKwon and others added 4 commits June 28, 2024 06:28

Merge branch 'main' into woosuk-gemma1

524db49

Merge branch 'main' into paligemma

7e6f0fd

Merge remote-tracking branch 'upstream/woosuk-gemma1' into paligemma

50ae420

Merge branch 'main' into paligemma

e0828b0

ywang96 added 5 commits July 5, 2024 17:07

update paligemma

c4fa37f

update test

b09066e

update

bf4bb58

update

c1b9ebf

add model to doc

5c0d2ec

ywang96 marked this pull request as ready for review July 6, 2024 03:36

ywang96 assigned WoosukKwon Jul 6, 2024

DarkLight1337 reviewed Jul 6, 2024

View reviewed changes

vllm/model_executor/models/paligemma.py Outdated Show resolved Hide resolved

vllm/model_executor/models/paligemma.py Outdated Show resolved Hide resolved

address comments

0b76ac1

DarkLight1337 reviewed Jul 6, 2024

View reviewed changes

docs/source/models/supported_models.rst Outdated Show resolved Hide resolved

ywang96 added 3 commits July 5, 2024 22:44

fix eos

4823852

Merge branch 'main' into paligemma

02b7c21

move doc

1651b15

WoosukKwon reviewed Jul 6, 2024

View reviewed changes

Update docs/source/models/supported_models.rst

2f94007

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon approved these changes Jul 7, 2024

View reviewed changes

DarkLight1337 approved these changes Jul 7, 2024

View reviewed changes

DarkLight1337 merged commit 6206dcb into vllm-project:main Jul 7, 2024
70 checks passed

mawong-amd mentioned this pull request Jul 19, 2024

[Bugfix][CI/Build][Hardware][AMD] Fix AMD tests, add HF cache, update CK FA, add partially supported model notes #6543

Merged

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Model] Add PaliGemma (vllm-project#5189)

f6a1521

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add PaliGemma #5189

[Model] Add PaliGemma #5189

ywang96 commented Jun 2, 2024 •

edited

Loading

ywang96 commented Jun 2, 2024

ywang96 commented Jun 10, 2024

DarkLight1337 Jun 10, 2024 •

edited

Loading

DarkLight1337 commented Jun 10, 2024

WoosukKwon commented Jul 5, 2024

ywang96 commented Jul 5, 2024

ywang96 commented Jul 6, 2024

DarkLight1337 left a comment

DarkLight1337 commented Jul 6, 2024

ywang96 commented Jul 6, 2024

WoosukKwon left a comment

WoosukKwon Jul 6, 2024

ywang96 Jul 6, 2024

WoosukKwon Jul 6, 2024

WoosukKwon Jul 6, 2024

ywang96 Jul 6, 2024

WoosukKwon left a comment

DarkLight1337 commented Jul 7, 2024

[Model] Add PaliGemma #5189

[Model] Add PaliGemma #5189

Conversation

ywang96 commented Jun 2, 2024 • edited Loading

ywang96 commented Jun 2, 2024

ywang96 commented Jun 10, 2024

DarkLight1337 Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

DarkLight1337 commented Jun 10, 2024

WoosukKwon commented Jul 5, 2024

ywang96 commented Jul 5, 2024

ywang96 commented Jul 6, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jul 6, 2024

ywang96 commented Jul 6, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment

WoosukKwon Jul 6, 2024

Choose a reason for hiding this comment

ywang96 Jul 6, 2024

Choose a reason for hiding this comment

WoosukKwon Jul 6, 2024

Choose a reason for hiding this comment

WoosukKwon Jul 6, 2024

Choose a reason for hiding this comment

ywang96 Jul 6, 2024

Choose a reason for hiding this comment

WoosukKwon left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jul 7, 2024

ywang96 commented Jun 2, 2024 •

edited

Loading

DarkLight1337 Jun 10, 2024 •

edited

Loading