Add Baichuan2 Support #247

AoyuQC · 2023-12-10T13:24:54Z

Add support for baichuan2, issue #50

casper-hansen · 2023-12-13T16:05:47Z

This looks great, thanks for the PR @AoyuQC. Could you run perplexity on FP16 vs INT4 quantized so we can see how much performance is degraded?

python examples/eval.py --model_path <your_model>

AoyuQC · 2023-12-17T06:56:49Z

Hi, @casper-hansen , I have run perplexity test on FP16 vs INT4. The FP16 version is 6.800 while the INT4 version is 6.938. Please check the following image for experiment results.

casper-hansen · 2023-12-21T12:51:58Z

Solid numbers @AoyuQC! And great work :)

I have a few questions:

You have created a BaichuanBlock. It looks a lot like the LlamaLikeBlock. Can we reuse the existing one? (we already reuse it for mistral, yi, qwen, etc.)
Is the modification to calib_data.py necessary to support the model? It seems like the new method makes some new assumptions about the format of custom datasets.

AoyuQC · 2023-12-22T02:15:06Z

Solid numbers @AoyuQC! And great work :)

I have a few questions:

1. You have created a `BaichuanBlock`. It looks a lot like the `LlamaLikeBlock`. Can we reuse the existing one? (we already reuse it for mistral, yi, qwen, etc.)

2. Is the modification to `calib_data.py` necessary to support the model? It seems like the new method makes some new assumptions about the format of custom datasets.

Hi @casper-hansen , I have reused LlamaLikeBlock for baichuan2, please have a look. The reason that I modified calib_data.py is that I want to support the case that people want to use tokenized input for calibration not just raw string input.

AoyuQC added 6 commits December 8, 2023 08:41

inital add for baichuan files

a560b9d

lower calib data for testing

6e0555a

Merge branch 'main' into baichuan2

13b6067

feat: finish fuzed function

d97c784

fix length bug, remove unused dep

1d352d6

return to original quantizer

b2031df

feat: improve calib data to support tokenized dataset

0dd6053

feat: reuse llamalikeblock for baichuan2

18b0ffe

casper-hansen added 3 commits December 23, 2023 13:23

Merge branch 'main' into pr/247

439acc4

Readd use_alibi to LlamaLikeBlock

93efe3e

Add Baichuan to auto-map

fe983dc

casper-hansen merged commit cef9f11 into casper-hansen:main Dec 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Baichuan2 Support #247

Add Baichuan2 Support #247

AoyuQC commented Dec 10, 2023

casper-hansen commented Dec 13, 2023

AoyuQC commented Dec 17, 2023 •

edited

Loading

casper-hansen commented Dec 21, 2023

AoyuQC commented Dec 22, 2023 •

edited

Loading

Add Baichuan2 Support #247

Add Baichuan2 Support #247

Conversation

AoyuQC commented Dec 10, 2023

casper-hansen commented Dec 13, 2023

AoyuQC commented Dec 17, 2023 • edited Loading

casper-hansen commented Dec 21, 2023

AoyuQC commented Dec 22, 2023 • edited Loading

AoyuQC commented Dec 17, 2023 •

edited

Loading

AoyuQC commented Dec 22, 2023 •

edited

Loading