[Mpi3-ft] MPI Fault Tolerance scenarios

Greg Bronevetsky bronevetsky1 at llnl.gov
Mon Mar 2 12:41:15 CST 2009


>What I'm proposing is:
>
>- Define a standard set of FT error handlers for a defined set of 
>policies (see, e.g., Bronis' proposal aired at the telecon: nothing, 
>local MPI calls, pt2pt w/o affected process, coll w/o affected 
>process, pt2pt w/ affected process, coll w/ affected process)
>- Provide a prototype implementation of those handlers in terms of 
>the proposed API
>- Allow implementors to provide error handlers that do not use this 
>API, if they find a better way of fixing particular FT issues up
>- Allow users to create their own error handlers using the proposed 
>API, to which end the API should be functional inside an error handler

Alexander, just to clarify, are you suggesting that we have two APIs, 
one low-level API and another API that consists of easy-to-use error handlers?


Greg Bronevetsky
Post-Doctoral Researcher
1028 Building 451
Lawrence Livermore National Lab
(925) 424-5756
bronevetsky1 at llnl.gov
http://greg.bronevetsky.com 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20090302/82e382a9/attachment.html>


More information about the mpiwg-ft mailing list