Could I compute relative_position_bias in init() instead of forward() ? #189

Alobal · 2022-04-07T07:10:21Z

Hi,

the relative_position_bias seems to be a constant index of self.relative_position_bias_table, why should it be computed in forward() every time?

For the code, why the following code can't be computed in WindowAttention.__init__()?

relative_position_bias = self.relative_position_bias_table[self.relative_position_index.view(-1)].view(
        self.window_size[0] * self.window_size[1], self.window_size[0] * self.window_size[1], -1)  # Wh*Ww,Wh*Ww,nH
relative_position_bias = relative_position_bias.permute(2, 0, 1).contiguous()  # nH, Wh*Ww, Wh*Ww

Thank you for your time to solve my confusion~

The text was updated successfully, but these errors were encountered:

Beastmaster · 2023-08-24T09:10:16Z

I have the same question. relative_position_bias are selected by indexes from a predefined tensor, also it's shape won't change during training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could I compute relative_position_bias in init() instead of forward() ? #189

Could I compute relative_position_bias in init() instead of forward() ? #189

Alobal commented Apr 7, 2022

Beastmaster commented Aug 24, 2023

Could I compute relative_position_bias in __init__() instead of forward() ? #189

Could I compute relative_position_bias in __init__() instead of forward() ? #189

Comments

Alobal commented Apr 7, 2022

Beastmaster commented Aug 24, 2023

Could I compute relative_position_bias in init() instead of forward() ? #189

Could I compute relative_position_bias in init() instead of forward() ? #189