Add comments on RoPE initialization #1176

WoosukKwon · 2023-09-25T23:18:02Z

Added more comments that can be useful for understanding the differences from HF.

casper-hansen · 2023-09-26T10:57:21Z

Does this explain the difference between HF and vLLM? I.e. if you enable the same CPU initialization with greedy sampling, we get same outputs?

WoosukKwon · 2023-09-26T17:05:55Z

@casper-hansen Not always. Because floating-point arithmetics is not associative, different kernel implementations might lead to different outputs. The difference is more significant when reduction operation is involved. Therefore, our custom CUDA kernels for attention and RMS normalization do not produce exactly the same outputs as the original HF implementation, and thus outputs of vLLM models can be different from the HF models. We've checked that the outputs usually match when using FP32 and greedy sampling, but there are some cases where the outputs do not match. However, please note that this does not hurt task accuracy, as vLLM's implementation is mathematically equivalent to HF's.

zhuohan123

LGTM! Thanks for the fix!

Add comments

06b9e95

WoosukKwon requested a review from zhuohan123 September 26, 2023 17:23

zhuohan123 approved these changes Sep 26, 2023

View reviewed changes

WoosukKwon merged commit 03ffd0a into main Sep 26, 2023
2 checks passed

WoosukKwon deleted the fix-rope branch September 26, 2023 17:48

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Add comments on RoPE initialization (vllm-project#1176)

582ccbe

sjchoi1 pushed a commit to casys-kaist-internal/vllm that referenced this pull request May 7, 2024

Add comments on RoPE initialization (vllm-project#1176)

2988c8f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comments on RoPE initialization #1176

Add comments on RoPE initialization #1176

WoosukKwon commented Sep 25, 2023 •

edited

Loading

casper-hansen commented Sep 26, 2023 •

edited

Loading

WoosukKwon commented Sep 26, 2023

zhuohan123 left a comment

Add comments on RoPE initialization #1176

Add comments on RoPE initialization #1176

Conversation

WoosukKwon commented Sep 25, 2023 • edited Loading

casper-hansen commented Sep 26, 2023 • edited Loading

WoosukKwon commented Sep 26, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

WoosukKwon commented Sep 25, 2023 •

edited

Loading

casper-hansen commented Sep 26, 2023 •

edited

Loading