forked from tensorflow/tensorflow
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Tensorflow CUDA helper: custom impl of atomicMax(uint64..).
The motivation is this particular overload is only available for compute capability >= 3.5. We'd like to make sure any callers (the Multinomial GPU kernel) have consistent semantics, so we replace the atomicExch() workaround with this custom impl, when needs be. Change: 123468056
- Loading branch information
1 parent
5185d44
commit ef7013c
Showing
2 changed files
with
15 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters