- 13 Jan, 2015 2 commits
-
-
Wesley Bland authored
There was an accidental ADI breakage earlier when MPI level codes would query into the dev part of the MPID request object. This commit removes that breakage by adding a new macro into the mpiimpl.h file to portably check whether a request is anysource. For now, in pamid, this macro always evaluates to 0. This can easily be fixed by overwriting it in the pamid code, but since pamid doesn't support FT, it won't have any functional change either. Signed-off-by:
Huiwei Lu <huiweilu@mcs.anl.gov>
-
Wesley Bland authored
It was pointed out that by putting this in a macro and failing silently when unimplemented, this make things challenging for derivatives which will implement this function in the future. By moving this to an MPID level function, it becomes more obvious that the function should be implemented later. Signed-off-by:
Huiwei Lu <huiweilu@mcs.anl.gov>
-
- 08 Jan, 2015 1 commit
-
-
Su Huang authored
Signed-off-by:
Sameh Sharkawi <sssharka@us.ibm.com>
-
- 26 Nov, 2014 1 commit
-
-
Wesley Bland authored
Since pamid doesn't include any of the fault tolerance functions, it should never say that a message is pending failure. We also can't call abort in here since the function is usind at the MPI layer. Signed-off-by:
Paul Coffman <pkcoff@us.ibm.com>
-
- 06 Nov, 2014 1 commit
-
-
Wesley Bland authored
This function will check to see if a request can be completed if it involves MPI_ANY_SOURCE. An any source request cannot be completed if the communicator is not collectively active. Signed-off-by:
Huiwei Lu <huiweilu@mcs.anl.gov>
-
- 23 Sep, 2014 1 commit
-
-
Sameh Sharkawi authored
Added an empty implementation for MPID_Comm_get_all_failed_procs and MPID_Comm_revoke in pamid. Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- 31 Jul, 2014 2 commits
-
-
Wesley Bland authored
Adds function implementing an agreement algorithm for the user. This function lets the user manually perform an agreement as well as detect unacknowledged failures. Signed-off-by:
Junchao Zhang <jczhang@mcs.anl.gov>
-
Wesley Bland authored
This commit adds the new functions MPI(X)_COMM_FAILURE_ACK and MPI(X)_COMM_FAILURE_GET_ACKED. These two functions together allow the user to get the group of failed processes. Most of the implementation for this is pushed into the MPID layer since some systems won't support this (PAMI). The existing function MPIDI_CH3U_Check_for_failed_procs has been modified to give back the group of acknowledged failed processes. There is an inefficiency here in that the list of failed processes is retrieved from PMI and parsed every time the user calls both failure_ack and get_acked, but this means we don't have to try to cache the list that comes back from PMI (which could potentially be expensive, but would have some cost even in the failure-free case). This commit adds a failed to the MPID_Comm structure. There is now a field called last_ack_rank. This is a single integer that stores the last acknowledged failure for this communicator which is used to determine when to stop parsing when getting back the list of acknowledged failed processes. Lastly, this commit includes a test to make sure that all of the above works (test/mpi/ft/failure_ack). This tests that a failure is appropriately included in the failed group and excluded if the failure was not previously acknowledged. Signed-off-by:
Junchao Zhang <jczhang@mcs.anl.gov>
-
- 30 Jul, 2014 1 commit
-
-
Wesley Bland authored
A few MPIX functions pertaining to fault tolerance were added to MPICH a while back. These functions are no longer applicable given the new ULFM implementation so they are being removed here. Signed-off-by:
Junchao Zhang <jczhang@mcs.anl.gov>
-
- 08 Jan, 2014 1 commit
-
-
added MPID_Win_allocate_shared and MPID_Win_shared_query support and fixes on passing incorrect rank to MPIDI_WinCtrlSend in MPIDI_WinPost_post and MPIDI_WinComplete_post minor fixes for 'win allocate shared' - compiles for bgq - UNRESOLVED: undefined shm key on bgq, or any other time the required environment variables are not set - UNRESOLVED: multiple calls to 'win allocate shared' will use the same shm key (ibm) F189033 Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- 31 Dec, 2013 2 commits
-
-
(ibm) F189037 Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- add check epoch - changes after intergrating Sameer's get_accumulate - update to remove dependency on mpid data structure (ibm) F189038 Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com> Minor modifications by Pavan to minimize the changes in mpir_request.c, and to reduce duplication. Collective requests and RMA requests are handled the same way. Signed-off-by:
Pavan Balaji <balaji@mcs.anl.gov>
-
- 18 Dec, 2013 1 commit
-
-
Sameer Kumar authored
Does not include the win request tls optimization Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- 04 Dec, 2013 1 commit
-
-
Michael Blocksome authored
The following functions are added: - MPI_Win_allocate - MPI_Win_allocate_dynamic - MPI_Win_attach - MPI_Win_deatch - MPI_Win_lock_all - MPI_Win_unlock_all - MPI_Win_sync - MPI_Win_flush (ibm) F189033 Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- 12 Apr, 2013 1 commit
-
-
Haizhu Liu authored
(ibm) F182392 (ibm) 084d46ff209380813e5a8822ae6b4c517fc1dc42 (ibm) 910067ee7018c6616568457a80644704271b3b82 Signed-off-by:
Michael Blocksome <blocksom@us.ibm.com>
-
- 18 Sep, 2012 1 commit
-
-
Pavan Balaji authored
-
- 27 Aug, 2012 1 commit
-
-
Pavan Balaji authored
No reviewer.
-