You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
thank you for these beautiful examples, really helpful!
My aplogies if this is not the right place to ask, feel free to close this issue.
I am not openning this issue cause I have a problem,
but rather, I am trying to do openacc-interoperability with cuda_fp16 half precision intrinsics.
I have looked both at openacc_c_main and openacc_cuda_device in order to get some influence.
Dear all,
thank you for these beautiful examples, really helpful!
My aplogies if this is not the right place to ask, feel free to close this issue.
I am not openning this issue cause I have a problem,
but rather, I am trying to do openacc-interoperability with
cuda_fp16
half precision intrinsics.I have looked both at
openacc_c_main
andopenacc_cuda_device
in order to get some influence.My changes are here, for openacc_cuda_device: master...georgebisbas:wip_fp16
I am working on a V100 and I am using:
and
Code compiles:
but seems to be crashing when calling
foo
:I have been able so far to compile and execute with ease mixed precision code: https://github.com/NVIDIA-developer-blog/code-samples/tree/master/posts/mixed-precision
and bare
openacc
code as well as the openacc+cuda examples of this repository (openacc-interoperability ).
Any inshight would be extremely helpful.
Regards,
--George
The text was updated successfully, but these errors were encountered: