-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA atomic_fetch_sub for doubles is hitting CAS instead of intrinsic #1624
Labels
Bug
Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
Milestone
Comments
crtrott
added
Bug
Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
Blocks Promotion
Overview issue for release-blocking bugs
labels
May 16, 2018
woah |
This caused a huge performance regression in LAMMPS. |
crtrott
added
InDevelop
and removed
Blocks Promotion
Overview issue for release-blocking bugs
labels
May 16, 2018
I confirm that #1627 fixes the performance regression in LAMMPS. Thanks. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
This manifested in LAMMPS and ExaMiniMD where atomic_fetch_sub was exercised via the -= operator of an atomic view.
The text was updated successfully, but these errors were encountered: