Fix the '-0.0' issue of blendv_ps and blendv_pd. #7

lowintelligence · 2020-04-18T06:18:08Z

For '_mm256_blendv_ps' and '_mm256_blendv_pd', the selector is
controlled by the highest flag of each unit, while the mask is
presented as a float or double SIMD register. For the case of
'-0.0', which is intepreted as 0x8000..00 in register, digits
in the 'b' input should be selected.
Previous implementation used 'vcgeq_f32' and 'vcgeq_f64' to get
flags of selector. These intrinsics won't think '-0.0 < 0.0',
thus digits in 'a' should be selected and an incorrect result
would be returned.
This fix convert the comparing from float intrinsics to integer
ones and preserve the correction of '-0.0' case.

For '_mm256_blendv_ps' and '_mm256_blendv_pd', the selector is controlled by the highest flag of each unit, while the mask is presented as a float or double SIMD register. For the case of '-0.0', which is intepreted as 0x8000..00 in register, digits in the 'b' input should be selected. Previous implementation used 'vcgeq_f32' and 'vcgeq_f64' to get flags of selector. These intrinsics won't think '-0.0 < 0.0', thus digits in 'a' should be selected and an incorrect result would be returned. This fix convert the comparing from float intrinsics to integer ones and preserve the correction of '-0.0' case.

gd321

Nice!

gd321

Nice!

gd321

looks great

derekpush requested a review from gd321 April 21, 2020 08:36

gd321 reviewed Jul 23, 2020

View reviewed changes

gd321 approved these changes Jul 23, 2020

View reviewed changes

gd321 merged commit 9f651e7 into kunpengcompute:master Jul 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the '-0.0' issue of blendv_ps and blendv_pd. #7

Fix the '-0.0' issue of blendv_ps and blendv_pd. #7

lowintelligence commented Apr 18, 2020

gd321 left a comment

gd321 left a comment

gd321 left a comment

Fix the '-0.0' issue of blendv_ps and blendv_pd. #7

Fix the '-0.0' issue of blendv_ps and blendv_pd. #7

Conversation

lowintelligence commented Apr 18, 2020

gd321 left a comment

Choose a reason for hiding this comment

gd321 left a comment

Choose a reason for hiding this comment

gd321 left a comment

Choose a reason for hiding this comment