[Mpi3-ft] Con Call on 1/4/2009
Greg Bronevetsky
bronevetsky1 at llnl.gov
Tue Jan 20 18:54:40 CST 2009
Here's my quick writeup of the major problems that we discussed with
writing modular apps on top of our proposed MPI fault tolerance spec
and an approach for making it relatively easy to write
module-specific error recovery algorithms without worrying about
other modules. I've attached a pdf version as well as a txt version
that will be easier to edit.
Greg Bronevetsky
Post-Doctoral Researcher
1028 Building 451
Lawrence Livermore National Lab
(925) 424-5756
bronevetsky1 at llnl.gov
At 06:58 PM 1/13/2009, Richard Graham wrote:
>OK, we will resume the calls next week, 1/21/2009.
>
>Rich
>
>
>On 1/13/09 11:42 AM, "Greg Bronevetsky" <bronevetsky1 at llnl.gov> wrote:
>
> >
> >> Unfortunately, for reasons out of [my] control, I did not manage to
> >> get the time to update the wiki and I doubt I will find any time
> >> before the call tomorrow. I'll have time to get back to this starting
> >> from tomorrow morning.
> >>
> >> I second your idea to cancel the call tomorrow.
> >
> > I have a protocol worked out to do micro-rollbacks that will work
> > well if we add to the API some kind of asynchronous event
> > notification mechanism like active messages. It will work not as well
> > without the extension. I'll update George's document once its posted
> > so that we have a unified document that describes the problem and the
> > proposed solutions.
> >
> > Greg Bronevetsky
> > Post-Doctoral Researcher
> > 1028 Building 451
> > Lawrence Livermore National Lab
> > (925) 424-5756
> > bronevetsky1 at llnl.gov
> >
> > _______________________________________________
> > mpi3-ft mailing list
> > mpi3-ft at lists.mpi-forum.org
> > http:// lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
>
>_______________________________________________
>mpi3-ft mailing list
>mpi3-ft at lists.mpi-forum.org
>http:// lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Support for Developing Fault Tolerant Modular MPI Applications.pdf
Type: application/pdf
Size: 177816 bytes
Desc: not available
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20090120/2b59fc3d/attachment-0001.pdf>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: Support for Developing Fault Tolerant Modular MPI Applications.txt
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20090120/2b59fc3d/attachment-0001.txt>
More information about the mpiwg-ft
mailing list