Do not use full memory barrier for osx/arm64 #71026

kunalspathak · 2022-06-20T22:51:20Z

MacOS M1+ are on Arm v8.5 which has support for atomic instructions, and we don't have to emit full barrier if that is the case.

Refereces:

ghost · 2022-06-20T22:53:49Z

Tagging subscribers to this area: @dotnet/gc
See info in area-owners.md if you want to be subscribed.

Issue Details

MacOS M1+ are on Arm v8.5 which has support for atomic instructions, and we don't have to emit full barrier if that is the case.

Refereces:

Author:	kunalspathak
Assignees:	-
Labels:	`area-GC-coreclr`
Milestone:	-

jkotas · 2022-06-21T01:25:55Z

src/coreclr/pal/inc/pal.h

- __sync_synchronize();
-#endif
+ #else
+ // For OSX Arm64, the default Arm architecture is v8.1 which uses atomic instructions that don't need a full barrier.


Is the C/C++ compiler guaranteed to use the newer atomic instructions?

This PR should add an explicit flag for clang to use arm8.1 or e.g. mcpu=apple-m1 Currently, it relies on my observations that by default Clang targets >Arm 8.0 on M1 but if Apple decides to change the default internally we might end up in a situation where these compiler intrinsics will be lowered to 8.0 and without the memory barrier = potential non-reproduceable race conditions somewhere in the vm

How does passing in mcpu=apple-m1 guarantee that the compiler is only ever use the new instructions?

LLVM maps apple-m1 to ARMV8_5A as seen in https:/llvm/llvm-project/blob/5ba0a9571b3ee3bc76f65e16549012a440d5a0fb/llvm/include/llvm/Support/AArch64TargetParser.def#L256-L257. However, I think the concern is valid and the full proof way to address it is to check explicitly the way it is done for windows counterpart in #70921. I am working on PR that will add similar check for linux-arm64 (reason stated in #70921 (comment)), so it should take care of these things for osx as well.

I think inline asm solves all problems here (might be tricky with templates)

Alternatively we can write a small test that validates that the intrinsic is lowered into LSE 🤷

I do not think you can reliably test for this. For example, you may see the old instruction only when there is a certain addressing mode needed or only when the code is cold.

Does it use casal in debug? Can it switch to old LL/SC helper because of register pressure or if the old implementation is one day found faster (it could be).

It feels like inline asm could have more reliable guarantees.

kunalspathak · 2022-06-30T23:14:58Z

Replaced by #71512

Do not use full barrier for osx/arm64

c33e88d

dotnet-issue-labeler bot added the area-GC-coreclr label Jun 20, 2022

ghost assigned kunalspathak Jun 20, 2022

kunalspathak mentioned this pull request Jun 20, 2022

Windows/Arm64: Use 8.1 atomic instructions if they are available #70921

Merged

jkotas reviewed Jun 21, 2022

View reviewed changes

kunalspathak closed this Jun 30, 2022

kunalspathak mentioned this pull request Jul 5, 2022

Unix arm64 atomics #71512

Merged

ghost locked as resolved and limited conversation to collaborators Jul 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not use full memory barrier for osx/arm64 #71026

Do not use full memory barrier for osx/arm64 #71026

kunalspathak commented Jun 20, 2022 •

edited

Loading

ghost commented Jun 20, 2022

jkotas Jun 21, 2022

EgorBo Jun 21, 2022

jkotas Jun 21, 2022 •

edited

Loading

kunalspathak Jun 21, 2022

EgorBo Jun 21, 2022 •

edited

Loading

EgorBo Jun 21, 2022

jkotas Jun 21, 2022

VSadov Jun 21, 2022 •

edited

Loading

kunalspathak commented Jun 30, 2022

Do not use full memory barrier for osx/arm64 #71026

Do not use full memory barrier for osx/arm64 #71026

Conversation

kunalspathak commented Jun 20, 2022 • edited Loading

ghost commented Jun 20, 2022

jkotas Jun 21, 2022

Choose a reason for hiding this comment

EgorBo Jun 21, 2022

Choose a reason for hiding this comment

jkotas Jun 21, 2022 • edited Loading

Choose a reason for hiding this comment

kunalspathak Jun 21, 2022

Choose a reason for hiding this comment

EgorBo Jun 21, 2022 • edited Loading

Choose a reason for hiding this comment

EgorBo Jun 21, 2022

Choose a reason for hiding this comment

jkotas Jun 21, 2022

Choose a reason for hiding this comment

VSadov Jun 21, 2022 • edited Loading

Choose a reason for hiding this comment

kunalspathak commented Jun 30, 2022

kunalspathak commented Jun 20, 2022 •

edited

Loading

jkotas Jun 21, 2022 •

edited

Loading

EgorBo Jun 21, 2022 •

edited

Loading

VSadov Jun 21, 2022 •

edited

Loading