relaxed i8x16.swizzle #22

ngzhian · 2021-04-19T20:42:39Z

What are the instructions being proposed?

relaxed i8x16.swizzle

What are the semantics of these instructions?

relaxed i8x16.swizzle(a, s) selects lanes from a using indices in s, indices in the range [0,15] will select the i-th element of a, the result for any out of range indices is implementation-defined (i.e. if the index is [16-255].

How will these instructions be implemented? Give examples for at least
x86-64 and ARM64. Also provide reference implementation in terms of 128-bit
Wasm SIMD.

x86/64, pshufb, out of range indices will return different results:

if top bit of index is set, return 0
else select the i % 16-th element

ARM/ARM64, vtbl and tbl, out of range indices return 0.

RISC-V V vrgather.vv a, b, out of range return 0 (assuming VEW set to 8, LMUL set to 1, VLEN set to 128, so VLMAX = 16).

Simd128, i8x16.swizzle

How does behavior differ across processors? What new fingerprinting surfaces will be exposed?

Difference between x86/64 and ARM/ARM64

What use cases are there?

Swizzle is quite a common operation, e.g. used in multiple places in meshoptimizer.

The text was updated successfully, but these errors were encountered:

nemequ · 2021-04-19T22:14:43Z

On PPC, vpermr (the vec_perm intrinsic) could be used for this. It actually takes two input vectors (plus the index vector) and only the lower 5 bits are used for each index, but if you pass the same vector for both inputs effectively you get the i % 16 behavior.

On z/Arch there is vperm/vec_perm(), which works the same.

jlb6740 · 2021-04-27T06:47:38Z

This instruction is straightforward and was used as an example motivator for the relaxed-simd proposal itself. One question that comes to mind though is the mechanism for enabling? I think this has been discussed before but how would we be expected to enable specific instructions to be their relaxed version while others remain unrelaxed?

ngzhian · 2021-04-27T18:03:16Z

This instruction is straightforward and was used as an example motivator for the relaxed-simd proposal itself. One question that comes to mind though is the mechanism for enabling? I think this has been discussed before but how would we be expected to enable specific instructions to be their relaxed version while others remain unrelaxed?

We will not enable an existing instruction to be executed in a relaxed manner. The relaxed instruction will be a completely new instruction with different opcode.

jlb6740 · 2021-04-27T21:09:01Z

Yes, so then you could have a module that has both swizzle and relaxed swizzle instructions? What I am wondering then is if I am writing code in C that is auto-vectorized for example, is there expected to be a way to specify this to the compiler that's targeting Wasm?

ngzhian · 2021-04-27T21:30:04Z

Yes, so then you could have a module that has both swizzle and relaxed swizzle instructions?

Yup that is possible.

is there expected to be a way to specify this to the compiler that's targeting Wasm?

Not at the moment. Maybe we can introduce an Emscripten flag to do this, similar to the -msimd128 currently, that will emit relaxed i8x16.swizzle instead instead of i8x16.swizzle.

jlb6740 · 2021-04-27T21:44:09Z

Yes, a flag makes sense. In fact I imagine with the proper dependence analysis the compiler could figure out if it is safe to use the relaxed version of an instruction. In fact perhaps it should be criteria or go into the thinking/motivation of proposing a relaxed instruction .. that with a compiler flag giving permission and proper analysis a compiler could determine when it is safe to generate the relaxed version.

ngzhian · 2021-04-27T22:47:09Z

In fact I imagine with the proper dependence analysis the compiler could figure out if it is safe to use the relaxed version of an instruction.

Good idea, but likely not possible in the most general case. E.g. if the swizzle depends on a mutable global/imported value,

Maratyszcza · 2021-04-28T10:36:15Z

if I am writing code in C that is auto-vectorized for example

I don't expect that compiler would be able to generate either the normal i8x16.swizzle or a related one from auto-vectorized code.

#22

See WebAssembly/relaxed-simd#22 Differential Revision: https://phabricator.services.mozilla.com/D126704

ngzhian · 2021-11-01T18:58:21Z

Note: vtbl is not available on ARM v8-M MVE AFAICT.

ngzhian · 2021-11-01T19:47:05Z

RISC-V V has vrgather which returns 0 for out of bounds.

ngzhian · 2021-11-01T23:37:36Z

For Power, likely require vperm with shift left on the selection vector (vperm uses bits 3:7 of each byte of selection), then it will select modulo 16.

ngzhian added the instruction-proposal label Apr 19, 2021

penzn mentioned this issue May 17, 2021

Out-of-bounds behaviour WebAssembly/flexible-vectors#35

Closed

ngzhian added a commit that referenced this issue Jun 7, 2021

Add relaxed swizzle to overview

c885874

#22

ngzhian mentioned this issue Jun 7, 2021

Add relaxed swizzle to overview #24

Merged

ngzhian added a commit that referenced this issue Jun 10, 2021

Add relaxed swizzle to overview (#24)

e71547e

#22

moz-v2v-gh pushed a commit to mozilla/gecko-dev that referenced this issue Oct 1, 2021

Bug 1731855 - Prototype relaxed-SIMD i8x16.swizzle instruction. r=lth

786ed9d

See WebAssembly/relaxed-simd#22 Differential Revision: https://phabricator.services.mozilla.com/D126704

jamienicol pushed a commit to jamienicol/gecko that referenced this issue Oct 4, 2021

Bug 1731855 - Prototype relaxed-SIMD i8x16.swizzle instruction. r=lth

1f419d4

See WebAssembly/relaxed-simd#22 Differential Revision: https://phabricator.services.mozilla.com/D126704

ngzhian added the in-overview Instruction has been added to Overview.md label Feb 18, 2022

tomrittervg mentioned this issue Jun 16, 2022

WebAssembly Relaxed SIMD mozilla/standards-positions#651

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

relaxed i8x16.swizzle #22

relaxed i8x16.swizzle #22

ngzhian commented Apr 19, 2021 •

edited

Loading

nemequ commented Apr 19, 2021

jlb6740 commented Apr 27, 2021

ngzhian commented Apr 27, 2021

jlb6740 commented Apr 27, 2021

ngzhian commented Apr 27, 2021

jlb6740 commented Apr 27, 2021 •

edited

Loading

ngzhian commented Apr 27, 2021

Maratyszcza commented Apr 28, 2021

ngzhian commented Nov 1, 2021

ngzhian commented Nov 1, 2021

ngzhian commented Nov 1, 2021 •

edited

Loading

relaxed i8x16.swizzle #22

relaxed i8x16.swizzle #22

Comments

ngzhian commented Apr 19, 2021 • edited Loading

nemequ commented Apr 19, 2021

jlb6740 commented Apr 27, 2021

ngzhian commented Apr 27, 2021

jlb6740 commented Apr 27, 2021

ngzhian commented Apr 27, 2021

jlb6740 commented Apr 27, 2021 • edited Loading

ngzhian commented Apr 27, 2021

Maratyszcza commented Apr 28, 2021

ngzhian commented Nov 1, 2021

ngzhian commented Nov 1, 2021

ngzhian commented Nov 1, 2021 • edited Loading

ngzhian commented Apr 19, 2021 •

edited

Loading

jlb6740 commented Apr 27, 2021 •

edited

Loading

ngzhian commented Nov 1, 2021 •

edited

Loading