1. 13 Jan, 2015 2 commits
    • Wesley Bland's avatar
      Remove ADI breakage introduced earlier · 6f646ca0
      Wesley Bland authored
      
      
      There was an accidental ADI breakage earlier when MPI level codes would
      query into the dev part of the MPID request object. This commit removes
      that breakage by adding a new macro into the mpiimpl.h file to portably
      check whether a request is anysource. For now, in pamid, this macro
      always evaluates to 0. This can easily be fixed by overwriting it in the
      pamid code, but since pamid doesn't support FT, it won't have any
      functional change either.
      Signed-off-by: default avatarHuiwei Lu <huiweilu@mcs.anl.gov>
      6f646ca0
    • Wesley Bland's avatar
      Move MPID_Comm_AS_enabled to MPID layer function · d1ab9e68
      Wesley Bland authored
      
      
      It was pointed out that by putting this in a macro and failing silently
      when unimplemented, this make things challenging for derivatives which
      will implement this function in the future. By moving this to an MPID
      level function, it becomes more obvious that the function should be
      implemented later.
      Signed-off-by: default avatarHuiwei Lu <huiweilu@mcs.anl.gov>
      d1ab9e68
  2. 08 Jan, 2015 1 commit
  3. 26 Nov, 2014 1 commit
  4. 06 Nov, 2014 1 commit
  5. 23 Sep, 2014 1 commit
  6. 31 Jul, 2014 2 commits
    • Wesley Bland's avatar
      Add MPIX_Comm_agree · 1f0ee136
      Wesley Bland authored
      
      
      Adds function implementing an agreement algorithm for the user. This function
      lets the user manually perform an agreement as well as detect unacknowledged
      failures.
      Signed-off-by: default avatarJunchao Zhang <jczhang@mcs.anl.gov>
      1f0ee136
    • Wesley Bland's avatar
      Add MPIX_Comm_failure_ack/get_acked · 8652e0ad
      Wesley Bland authored
      
      
      This commit adds the new functions MPI(X)_COMM_FAILURE_ACK and
      MPI(X)_COMM_FAILURE_GET_ACKED. These two functions together allow the user to
      get the group of failed processes.
      
      Most of the implementation for this is pushed into the MPID layer since some
      systems won't support this (PAMI). The existing function
      MPIDI_CH3U_Check_for_failed_procs has been modified to give back the group of
      acknowledged failed processes. There is an inefficiency here in that the list
      of failed processes is retrieved from PMI and parsed every time the user calls
      both failure_ack and get_acked, but this means we don't have to try to cache
      the list that comes back from PMI (which could potentially be expensive, but
      would have some cost even in the failure-free case).
      
      This commit adds a failed to the MPID_Comm structure. There is now a field
      called last_ack_rank. This is a single integer that stores the last
      acknowledged failure for this communicator which is used to determine when to
      stop parsing when getting back the list of acknowledged failed processes.
      
      Lastly, this commit includes a test to make sure that all of the above works
      (test/mpi/ft/failure_ack). This tests that a failure is appropriately included
      in the failed group and excluded if the failure was not previously
      acknowledged.
      Signed-off-by: default avatarJunchao Zhang <jczhang@mcs.anl.gov>
      8652e0ad
  7. 30 Jul, 2014 1 commit
  8. 08 Jan, 2014 1 commit
    • Su Huang's avatar
      PAMID: MPID_Win_allocate_shared and MPID_Win_shared_query support · c6d910c7
      Su Huang authored and Pavan Balaji's avatar Pavan Balaji committed
      
      
      added MPID_Win_allocate_shared and MPID_Win_shared_query support and
      fixes on passing incorrect rank to MPIDI_WinCtrlSend in MPIDI_WinPost_post
      and MPIDI_WinComplete_post
      
      minor fixes for 'win allocate shared'
      
      - compiles for bgq
      - UNRESOLVED: undefined shm key on bgq, or any other time the required
        environment variables are not set
      - UNRESOLVED: multiple calls to 'win allocate shared' will use the same
        shm key
      
      (ibm) F189033
      Signed-off-by: default avatarMichael Blocksome <blocksom@us.ibm.com>
      c6d910c7
  9. 31 Dec, 2013 2 commits
  10. 18 Dec, 2013 1 commit
  11. 04 Dec, 2013 1 commit
  12. 12 Apr, 2013 1 commit
  13. 18 Sep, 2012 1 commit
  14. 27 Aug, 2012 1 commit