[Mpi3-ft] FTWG conference call today
Aurélien Bouteiller
bouteill at icl.utk.edu
Wed Jan 23 10:33:47 CST 2013
Yes,
you have not received the email yet ?
Aurelien
Le 23 janv. 2013 à 11:10, "Sur, Sayantan" <sayantan.sur at intel.com> a écrit :
> Hi,
>
> Is there a meeting today?
>
> Thanks,
> Sayantan
>
>> -----Original Message-----
>> From: mpi3-ft-bounces at lists.mpi-forum.org [mailto:mpi3-ft-
>> bounces at lists.mpi-forum.org] On Behalf Of Aurélien Bouteiller
>> Sent: Wednesday, January 09, 2013 8:10 AM
>> To: MPI 3.0 Fault Tolerance and Dynamic Process Control working Group
>> Subject: Re: [Mpi3-ft] FTWG conference call today
>>
>> Dear WG members,
>>
>> This is a reminder that according to our planning, we are having our regular
>> phone meeting.
>>
>> Agenda:
>> - Followup on object state discussions
>>
>>
>> Date: Jan. 9, 2012
>> Time: Noon EDT/New York
>> Dial-in information: 218-339-4600
>> Code: 623998#
>>
>>
>> Next Meeting:
>> * Jan. 23, 2013
>>
>> Le 12 déc. 2012 à 13:31, "Sur, Sayantan" <sayantan.sur at intel.com> a écrit :
>>
>>> Hello WG members,
>>>
>>> Josh, Darius and I were on the call. We discussed our assignment to define
>> what happens to objects upon failure. Specifically, what happens to objects
>> that are created locally (i.e. do not require any remote processes to call MPI),
>> but the MPI implementation can store them in a distributed fashion.
>>>
>>> We had a short brainstorming session. The thoughts that were discussed
>> were:
>>>
>>> - We could require of the implementation that after failure and when such
>> objects are accessed, the implementation provides either SUCCESS or
>> FAILURE, i.e. there are no corrupted or partially available objects.
>>> - It could be that some alive ranks can read their objects, whereas others
>> cannot.
>>> - The app could use MPI_Comm_agree to reach consensus on whether all
>> required objects are able to be read on ranks that are alive.
>>> - For some objects, such as Datatype, there are no accessor functions other
>> than when it is used (e.g. Send/recv). It is possible that an MPI
>> implementation could return error when a datatype is used by app, but the
>> internal representation is not available to the implementation. However, this
>> is not very useful as the app then needs a way to discern why a send failed.
>>> - Would it make sense to add *_Check functions to objects to see if they
>> are still available (after failure)?
>>>
>>> Please let me know if I missed something in the notes.
>>>
>>> Sayantan
>>>
>>>
>>>> -----Original Message-----
>>>> From: mpi3-ft-bounces at lists.mpi-forum.org [mailto:mpi3-ft-
>>>> bounces at lists.mpi-forum.org] On Behalf Of Aurélien Bouteiller
>>>> Sent: Wednesday, December 12, 2012 6:33 AM
>>>> To: MPI 3.0 Fault Tolerance and Dynamic Process Control working Group
>>>> Subject: [Mpi3-ft] FTWG conference call today
>>>>
>>>> Dear working group members,
>>>>
>>>> We have our usual biweekly conference call planned for today.
>>>> Unfortunately, nobody from UT is available to attend, but the
>>>> conference call will be setup and available to the group anyway.
>>>>
>>>> We would appreciate if somebody could keep a summary of discussions.
>>>>
>>>>
>>>> Agenda:
>>>> - Followup items from the Meeting
>>>>
>>>>
>>>> Date: Dec 12, 2012
>>>> Time: Noon EDT/New York
>>>> Dial-in information: 218-339-4600
>>>> Code: 623998#
>>>>
>>>>
>>>> Next Meeting:
>>>> * Jan. 9, 2013
>>>>
>>>>
>>>> Please note: Dec. 26 date has been cancelled.
>>>>
>>>>
>>>> --
>>>> * Dr. Aurélien Bouteiller
>>>> * Researcher at Innovative Computing Laboratory
>>>> * University of Tennessee
>>>> * 1122 Volunteer Boulevard, suite 309b
>>>> * Knoxville, TN 37996
>>>> * 865 974 9375
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> mpi3-ft mailing list
>>>> mpi3-ft at lists.mpi-forum.org
>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
>>>
>>> _______________________________________________
>>> mpi3-ft mailing list
>>> mpi3-ft at lists.mpi-forum.org
>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
>>
>> --
>> * Dr. Aurélien Bouteiller
>> * Researcher at Innovative Computing Laboratory
>> * University of Tennessee
>> * 1122 Volunteer Boulevard, suite 309b
>> * Knoxville, TN 37996
>> * 865 974 9375
>>
>>
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> mpi3-ft mailing list
>> mpi3-ft at lists.mpi-forum.org
>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
>
> _______________________________________________
> mpi3-ft mailing list
> mpi3-ft at lists.mpi-forum.org
> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpi3-ft
--
* Dr. Aurélien Bouteiller
* Researcher at Innovative Computing Laboratory
* University of Tennessee
* 1122 Volunteer Boulevard, suite 309b
* Knoxville, TN 37996
* 865 974 9375
More information about the mpiwg-ft
mailing list