[MPI3-IO] [Mpi3-ft] Fault Tolerance & I/O Discussion

George Bosilca bosilca at eecs.utk.edu
Wed Feb 22 23:32:00 CST 2012


The promised file.



  george.

On Feb 23, 2012, at 00:28 , George Bosilca wrote:

> Based on the discussion and opinions expressed during this call, the new FT proposal has been extended to include wording about the I/O. The model for I/O is very synchronous, providing a single function related to I/O. This function can be used to invalidate a file, preempting any subsequent operation on it. The file will then have to be closed, and is necessary reopened based on a new communicator (a smaller version of the original one).
> 
> The current version if the FT proposal is attached to this email, and can be accessed in the working group wiki @ https://svn.mpi-forum.org/trac/mpi-forum-web/wiki/User_Level_Failure_Mitigation.
> 
> We expect to have a final version of the proposal by Monday noon EST, as requested by the MPI_Forum to be able to have a first reading at the next meeting. We are looking forward to your comments.
> 
>  george.
> 
> On Feb 14, 2012, at 21:56 , Josh Hursey wrote:
> 
>> Just a reminder that we are going to meet at 11 am Eastern on Wednesday to continue our discussion.
>> 
>> Thanks,
>> Josh
>> 
>> On Wed, Feb 8, 2012 at 1:24 PM, Josh Hursey <jjhursey at open-mpi.org> wrote:
>> We will meet Wednesday, Feb. 15 at 11 am Eastern to continue our discussion of fault tolerance in the I/O chapter.
>> 
>> We can use the following teleconf information:
>>   US Toll Free number: 877-801-8130
>>   Toll number: 1-203-692-8690
>>   Access Code: 1044056
>> 
>> Thanks,
>> Josh
>> 
>> On Mon, Feb 6, 2012 at 2:40 PM, Josh Hursey <jjhursey at open-mpi.org> wrote:
>> We had a good discussion today. Attached are some notes that I took from the call.
>> 
>> There were a few questions that we were discussing at the end of the call. As a result we are going to try to setup another teleconf.
>> 
>> Below is a doodle poll to pick a date/time:
>>    http://www.doodle.com/zw83eht6hh8mwmic
>> 
>> If you are interested in attending this teleconf, please fill out the poll by 2 pm Eastern on Wednesday, Feb. 8.
>> 
>> In the mean time let us keep discussing these issues on the FT and I/O mailing lists. My preference would be to just discuss it on the FT mailing list as there are more FT folks over there, and then we would not distract the discussions about the other I/O tickets.
>> 
>> Thanks,
>> Josh
>> 
>> On Mon, Feb 6, 2012 at 10:55 AM, Josh Hursey <jjhursey at open-mpi.org> wrote:
>> Just a reminder that we are meeting today at Noon Eastern. Enclosed are the call-in details and a link to the FT stabilization proposal.
>> 
>> Thanks,
>> Josh
>> 
>> 
>> On Thu, Feb 2, 2012 at 2:46 PM, Josh Hursey <jjhursey at open-mpi.org> wrote:
>> We will meet Monday, Feb. 6 from 12-1 pm EST/New York to discuss I/O in the context of the fault tolerance proposal (or the Super Bowl if we get bored).
>> 
>> We can use the following teleconf information:
>>   US Toll Free number: 877-801-8130
>>   Toll number: 1-203-692-8690
>>   Access Code: 1044056
>> 
>> The Run-Through Stabilization proposal can be found attached to the ticket:
>>   https://svn.mpi-forum.org/trac/mpi-forum-web/ticket/276
>>   https://svn.mpi-forum.org/trac/mpi-forum-web/attachment/ticket/276/FTWG-Process-FT-Draft-2011-12-20.pdf
>> 
>> We will be primarily focusing on section 17.12 of that document. I will try to send out a reminder the day of.
>> 
>> Thanks,
>> Josh
>> 
>> On Tue, Jan 31, 2012 at 5:04 PM, Josh Hursey <jjhursey at open-mpi.org> wrote:
>> Try the new link (I had to close the other poll due to a timezone problem):
>>   http://www.doodle.com/yifhpi5emyyzrspa
>> 
>> -- Josh
>> 
>> 
>> On Tue, Jan 31, 2012 at 4:41 PM, Mohamad Chaarawi <chaarawi at hdfgroup.org> wrote:
>> Hi Josh,
>> 
>> When I click the link, it says the poll has been deleted.
>> 
>> Thanks,
>> Mohamad
>> 
>> 
>> On 01/31/2012 11:52 AM, Josh Hursey wrote:
>>> (Cross posted to both the I/O and FT MPI-3 listservs)
>>> 
>>> During the FT plenary session at the Jan. MPI Forum meeting there were some concerns about fault tolerance semantics in the I/O chapter. We did not have much time to fully discuss the additional semantics during the meeting. To make sure that we push towards a complete set of semantics useful for the I/O community in the next draft I would like to have a teleconf to discuss the I/O chapter of the FT proposal. Preferability in the next week and a half (so we have time to fine tune things before the next forum meeting).
>>> 
>>> Below is a link to a doodle poll to find a good time for a teleconf. If you are interested in participating in this discussion, please fill this poll out by 2 PM Eastern on Thurs. Feb. 2 so we can set the date/time.
>>>    http://www.doodle.com/s3hz9daeh8pn483m
>>> 
>>> Thanks,
>>> Josh
>>> 
>>> -- 
>>> Joshua Hursey
>>> Postdoctoral Research Associate
>>> Oak Ridge National Laboratory
>>> http://users.nccs.gov/~jjhursey
>>> 
>>> 
>>> _______________________________________________
>>> MPI3-IO mailing list
>>> MPI3-IO at lists.mpi-forum.org
>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-io
>> 
>> 
>> _______________________________________________
>> MPI3-IO mailing list
>> MPI3-IO at lists.mpi-forum.org
>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-io
>> 
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> 
>> 
>> 
>> -- 
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>> _______________________________________________
>> mpi3-ft mailing list
>> mpi3-ft at lists.mpi-forum.org
>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
> 
> _______________________________________________
> mpi3-ft mailing list
> mpi3-ft at lists.mpi-forum.org
> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-io/attachments/20120223/f83a9ffd/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mpi3ft.pdf
Type: application/pdf
Size: 156070 bytes
Desc: not available
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-io/attachments/20120223/f83a9ffd/attachment-0001.pdf>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-io/attachments/20120223/f83a9ffd/attachment-0003.html>


More information about the mpiwg-io mailing list