- 12 Nov, 2014 3 commits
-
-
Pavan Balaji authored
Signed-off-by:
Antonio Pena Monferrer <apenya@mcs.anl.gov>
-
Pavan Balaji authored
We now use a target structure for each target ID that we want to send data to. This allows us to separate out target-specific states and more cleanly manage operations to a single target. Signed-off-by:
Antonio Pena Monferrer <apenya@mcs.anl.gov>
-
Full redesign, mainly of the functions in ptl_nm.c and the communications involving the "control" portal. Still some problems with flow control. Signed-off-by:
Ken Raffenetti <raffenet@mcs.anl.gov>
-
- 06 Nov, 2014 1 commit
-
-
Kenneth Raffenetti authored
There is a complete overhaul of this file on the way, but in the meantime we raise these limits to prevent deadlock in MPI_Finalize with process count >= 12. No reviewer.
-
- 03 Nov, 2014 1 commit
-
-
Portals4 by itself does not provide any flow-control. This needs to be managed by an upper-layer, such as MPICH. Before this patch we were relying on a bunch of unexpected buffers that were posted to the portals library to manage unexpected messages. However, since portals asynchronously pulls out messages from the network, if the application is delayed, it might result in the unexpected buffers being filled out and the portal disabled. This would cause MPICH to abort. In this patch, we implement an initial version of flow-control that allows us to reenable the portal when it gets disabled. All this is done in the context of the "rportals" wrappers that are implemented in the rptl.* files. We create an extra control portal that is only used by rportals. When the primary data portal gets disabled, the target sends PAUSE messages to all other processes. Once each process confirms that it has no outstanding packets on the wire (i.e., all packets have either been ACKed or NACKed), it sends a PAUSE-ACK message. When the target receives PAUSE-ACK messages from all processes (thus confirming that the network traffic to itself has been quiesced), it reenables the portal and sends an UNPAUSE message to all processes. This patch still does not deal with origin-side resource exhaustion. This can happen, for example, if we run out of space on the event queue on the origin side. Signed-off-by:
Ken Raffenetti <raffenet@mcs.anl.gov>
-
- 20 Oct, 2014 1 commit
-
-
Pavan Balaji authored
We were not setting the function states correctly in a bunch of functions. Modifications by Wesley to split up big commit. Signed-off-by:
Wesley Bland <wbland@anl.gov> Signed-off-by:
Pavan Balaji <balaji@anl.gov>
-
- 23 Sep, 2014 1 commit
-
-
Xin Zhao authored
Pass tail pointer to single direction linked-list API so that we optimize CONCAT and APPEND operations from O(N) to O(1). Note that we do not need MPL_LL_APPEND_VS2008 anymore because we no longer need to work around temporary variable. Similiarly, we can support MPL_LL_CONCAT for VS2008 now. Signed-off-by:
Pavan Balaji <balaji@anl.gov>
-
- 26 Oct, 2012 2 commits
-
-
Darius Buntinas authored
-
Darius Buntinas authored
-