support internlm-xcomposer2d5-7b #1932

irexyc · 2024-07-05T09:26:30Z

Motivation

related issues #1920

base model & image inputs
Write & Webpage model (since turbomind doesn't support slora, only one model can be deployed at a time)
quantization
document
video input

docs/en/multi_modal/xcomposer2d5.md

lvhan028 · 2024-07-08T03:25:09Z

docs/en/multi_modal/xcomposer2d5.md

+
+## Lora Model
+
+InternLM-XComposer-2.5 trained the LoRA weights for webpage creation and article writing. As TurboMind backend doesn't support slora, only one LoRA model can be deployed at a time, and the LoRA weights need to be merged when deploying the model. LMDeploy provides the corresponding conversion script, which is used as follows:


Since we support plora, can we support lora directly without merging it?

For specific task, there are actually two LoRA weights(plora + lora) and we can only support one.
https://huggingface.co/internlm/internlm-xcomposer2d5-7b/blob/main/build_mlp.py#L227-L243

irexyc · 2024-07-08T03:33:32Z

Currently, the hf model can't be used with transformers api due to InternLM/InternLM-XComposer#354

lmdeploy/model.py

docs/en/multi_modal/xcomposer2d5.md

lvhan028

LGTM after some typos are fixed

lmdeploy/vl/templates.py

RunningLeon

LGTM

irexyc added 2 commits July 5, 2024 08:45

support base model & image inputs

7f4bb33

fix vit and template

d236c18

irexyc added the WIP label Jul 5, 2024

irexyc added 2 commits July 7, 2024 15:17

support quant & update docs

5ca887d

fix xcomposer2 awq model tp inference

67f343d

irexyc removed the WIP label Jul 7, 2024

lvhan028 reviewed Jul 8, 2024

View reviewed changes

docs/en/multi_modal/xcomposer2d5.md Show resolved Hide resolved

lvhan028 reviewed Jul 8, 2024

View reviewed changes

lvhan028 added the enhancement New feature or request label Jul 8, 2024

lvhan028 reviewed Jul 9, 2024

View reviewed changes

lmdeploy/model.py Outdated Show resolved Hide resolved

irexyc added 2 commits July 9, 2024 09:24

remove internlm-xcomposer2-4khd chat template

8135472

update tests

6668ce2

lvhan028 requested review from RunningLeon and AllentDan July 14, 2024 05:56

lvhan028 reviewed Jul 14, 2024

View reviewed changes

docs/en/multi_modal/xcomposer2d5.md Outdated Show resolved Hide resolved

lvhan028 reviewed Jul 14, 2024

View reviewed changes

docs/en/multi_modal/xcomposer2d5.md Outdated Show resolved Hide resolved

lvhan028 approved these changes Jul 14, 2024

View reviewed changes

update docs

2621c9a

AllentDan reviewed Jul 15, 2024

View reviewed changes

lmdeploy/vl/templates.py Outdated Show resolved Hide resolved

update

01bb1d8

RunningLeon approved these changes Jul 15, 2024

View reviewed changes

lvhan028 merged commit 1ceceb5 into InternLM:main Jul 15, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support internlm-xcomposer2d5-7b #1932

support internlm-xcomposer2d5-7b #1932

irexyc commented Jul 5, 2024 •

edited

Loading

lvhan028 Jul 8, 2024

irexyc Jul 8, 2024 •

edited

Loading

irexyc commented Jul 8, 2024

lvhan028 left a comment

RunningLeon left a comment


		## Lora Model

		InternLM-XComposer-2.5 trained the LoRA weights for webpage creation and article writing. As TurboMind backend doesn't support slora, only one LoRA model can be deployed at a time, and the LoRA weights need to be merged when deploying the model. LMDeploy provides the corresponding conversion script, which is used as follows:

support internlm-xcomposer2d5-7b #1932

support internlm-xcomposer2d5-7b #1932

Conversation

irexyc commented Jul 5, 2024 • edited Loading

Motivation

lvhan028 Jul 8, 2024

Choose a reason for hiding this comment

irexyc Jul 8, 2024 • edited Loading

Choose a reason for hiding this comment

irexyc commented Jul 8, 2024

lvhan028 left a comment

Choose a reason for hiding this comment

RunningLeon left a comment

Choose a reason for hiding this comment

irexyc commented Jul 5, 2024 •

edited

Loading

irexyc Jul 8, 2024 •

edited

Loading