PAMID: MPI_Allreduce/MPI_Reduce coredump w/ DOUBLE_INT datatype
This commit includes multiple fixes:
- Fixes for MPI_IN_PLACE checking. cudaGetPointerAttributes returns
true on MPI_IN_PLACE which causes issues. Now we check on MPI_IN_PLACE
before passing pointer to cuda.
- Enabling PAMID geometries (in order to get to PAMID collectives) when
MP_CUDA_AWARE=yes. This allows for intercepting CUDA buffer.
- Disabling FCA when MP_CUDA_AWARE=yes if user enables FCA.
- Copying user recv buffer into temp recv host buffer before collective
starts, especially in MPI_IN_PLACE cases.
(ibm) D203255
Signed-off-by:
Tsai-Yang (Alan) Jea <tjea@us.ibm.com>
Please register or sign in to comment