[Mpi3-ft] Process failure document

Richard Graham rlgraham at ornl.gov
Tue Nov 4 15:33:46 CST 2008


I have captured a lot of what we have discussed about process
fault-tolerance, and filled in more missing gaps to help move us a long a
bit faster in our discussions.  Please take a look at the document before
the call tomorrow.  I would like to pick up discussing what to do when
collective communications fail.  There are still details missing that need
to be added.  No API¹s at this stage, just the ³model².  I ran this past 3
different application groups today ­ this seems to be along the lines of
what they are looking for, and they had some very useful comments...

Rich
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20081104/12cab745/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: process_failure_tech.txt
Type: application/octet-stream
Size: 6474 bytes
Desc: not available
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20081104/12cab745/attachment.obj>


More information about the mpiwg-ft mailing list