[Mpi3-ft] Ticket #324: Clarify MPI_ERRORS_ARE_FATAL scope of abort
wbland at mcs.anl.gov
Mon May 13 11:05:15 CDT 2013
After looking at this ticket some more, Aurelien and I were confused about the objections to the ticket from the forum at large. It appeared that some of the objections reported by Dave on the ticket might have come from a misunderstanding in the forum of what the ticket meant. The proposed plan at this point is to discuss the ticket during our plenary in San Jose to try to discern the objection so we can bring a new version of this ticket if necessary or start the process again if the text is good.
On May 7, 2013, at 4:53 PM, Wesley Bland <wbland at mcs.anl.gov> wrote:
> author: jjhursey
> This ticket essentially links MPI_ERRORS_ARE_FATAL on a communicator to calling MPI_ABORT on the communicator, i.e. only the processes in that communicator are aborted, while other communicators could potentially remain functional.
> There was much discussion on the ticket about the scope of this change, and in the end the ticket has remained stagnant for about a year because of it, however I don't think that the changes here should be too controversial. According to the ticket, the main argument against it at the Japan meeting was that for some types of functions, there is not a request which can be used to provide error checking and therefore when an error occurs, the entire application would be forced to fall back to MPI_ERRORS_ARE_FATAL despite setting another error handler, therefore making FT difficult. Some alternate text was provided on the ticket.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the mpiwg-ft