ARM64-SVE: Fix hwintrinsics flags #107791

a74nh · 2024-09-13T12:22:27Z

ARM64-SVE: Fix hwintrinsics flags

Spotted during implementation of #107459

SQADD and SQSUB have two variants, predicated and unpredicated. Currently we only support SVE1, so must use the unpredicated version. Remove RMW and low mask flags, because these are for the predicated version.

ConvertMaskToVector does not have an explicit mask, but ConvertVectorToMask does. Fix these.

Add assert checks to IsLowMaskedOperation

a74nh · 2024-09-13T12:45:26Z

This is ready.

@dotnet/arm64-contrib @kunalspathak

a74nh · 2024-09-13T12:46:41Z

Difference between this PR and HEAD is that HEAD will mark some registers as delay free when it doesn't need to.

a74nh · 2024-09-13T17:00:06Z

Confirmed stress tests of the hwintrinsics tests fully pass.

kunalspathak

Added a question about HW_Flag_ExplicitMaskedOperation.

kunalspathak · 2024-09-13T16:46:03Z

src/coreclr/jit/hwintrinsic.h

@@ -204,7 +204,8 @@ enum HWIntrinsicFlag : unsigned int
    // The intrinsic uses a mask in arg1 to select elements present in the result
    HW_Flag_ExplicitMaskedOperation = 0x20000,

-    // The intrinsic uses a mask in arg1 to select elements present in the result, and must use a low register.
+    // The intrinsic uses a mask in arg1 (either explicitly, embdedd or optionally embedded) to select elements present


nit: something to fix in follow-up PR

Suggested change

// The intrinsic uses a mask in arg1 (either explicitly, embdedd or optionally embedded) to select elements present

// The intrinsic uses a mask in arg1 (either explicitly, embedded or optionally embedded) to select elements present

kunalspathak · 2024-09-13T17:16:59Z

src/coreclr/jit/hwintrinsiclistarm64sve.h

-HARDWARE_INTRINSIC(Sve,           ConvertMaskToVector,                                              -1,      1,     {INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov},     HW_Category_Helper,                HW_Flag_Scalable|HW_Flag_ExplicitMaskedOperation)
-HARDWARE_INTRINSIC(Sve,           ConvertVectorToMask,                                              -1,      2,     {INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne},   HW_Category_Helper,                HW_Flag_Scalable|HW_Flag_ReturnsPerElementMask|HW_Flag_LowMaskedOperation)
+HARDWARE_INTRINSIC(Sve,           ConvertMaskToVector,                                              -1,      1,     {INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov,        INS_sve_mov},     HW_Category_Helper,                HW_Flag_Scalable)
+HARDWARE_INTRINSIC(Sve,           ConvertVectorToMask,                                              -1,      2,     {INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne,      INS_sve_cmpne},   HW_Category_Helper,                HW_Flag_Scalable|HW_Flag_ExplicitMaskedOperation|HW_Flag_ReturnsPerElementMask|HW_Flag_LowMaskedOperation)


removing HW_Flag_ExplicitMaskedOperation from MaskToVector and putting it VectorToMask makes sense, but wondering why it worked so far? probably we do not care of this flag and can be eliminated?

spoke offline. As expected, this was never used for these 2 intrinsics, but good to have for correctness.

We'll never use it because the converts will never be embedded

kunalspathak · 2024-09-13T17:27:56Z

/backport to release/9.0

github-actions · 2024-09-13T17:28:09Z

Started backporting to release/9.0: https://github.com/dotnet/runtime/actions/runs/10853493972

ARM64-SVE: Fix hwintrinsics flags

d7ab686

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 13, 2024

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Sep 13, 2024

a74nh added arm-sve Work related to arm64 SVE/SVE2 support community-contribution Indicates that the PR has been added by a community member and removed community-contribution Indicates that the PR has been added by a community member labels Sep 13, 2024

a74nh marked this pull request as ready for review September 13, 2024 12:42

a74nh mentioned this pull request Sep 13, 2024

ARM64-SVE: refactor lsra buildHWIntrinsic #107459

Open

kunalspathak approved these changes Sep 13, 2024

View reviewed changes

kunalspathak merged commit 924fc2a into dotnet:main Sep 13, 2024
111 checks passed

github-actions bot mentioned this pull request Sep 13, 2024

[release/9.0] ARM64-SVE: Fix hwintrinsics flags #107802

Merged

4 tasks

a74nh deleted the lsra_addsaturate_github branch September 16, 2024 08:31

jtschuster pushed a commit to jtschuster/runtime that referenced this pull request Sep 17, 2024

ARM64-SVE: Fix hwintrinsics flags (dotnet#107791)

7c070d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARM64-SVE: Fix hwintrinsics flags #107791

ARM64-SVE: Fix hwintrinsics flags #107791

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

kunalspathak left a comment

kunalspathak Sep 13, 2024

kunalspathak Sep 13, 2024

kunalspathak Sep 13, 2024

a74nh Sep 13, 2024

kunalspathak commented Sep 13, 2024

github-actions bot commented Sep 13, 2024

	// The intrinsic uses a mask in arg1 (either explicitly, embdedd or optionally embedded) to select elements present
	// The intrinsic uses a mask in arg1 (either explicitly, embedded or optionally embedded) to select elements present

ARM64-SVE: Fix hwintrinsics flags #107791

ARM64-SVE: Fix hwintrinsics flags #107791

Conversation

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

a74nh commented Sep 13, 2024

kunalspathak left a comment

Choose a reason for hiding this comment

kunalspathak Sep 13, 2024

Choose a reason for hiding this comment

kunalspathak Sep 13, 2024

Choose a reason for hiding this comment

kunalspathak Sep 13, 2024

Choose a reason for hiding this comment

a74nh Sep 13, 2024

Choose a reason for hiding this comment

kunalspathak commented Sep 13, 2024

github-actions bot commented Sep 13, 2024