From wesley.bland at intel.com Mon Mar 18 08:13:10 2019 From: wesley.bland at intel.com (Bland, Wesley) Date: Mon, 18 Mar 2019 13:13:10 +0000 Subject: [mpiwg-ft] FTWG Con Call - 2019-03-18 Message-ID: The Fault Tolerance Working Group?s weekly con call is today at 12:00 PM Eastern. Today's agenda: * Upcoming virtual meeting If there's something else that people would like to discuss, please just send an email to the WG so we can get it on the agenda. Thanks, Wesley ......................................................................................................................................... Join from PC, Mac, Linux, iOS or Android: https://tennessee.zoom.us/j/632356722?pwd=lI4_169CGcewIumekTziMw Password: mpiforum Or iPhone one-tap (US Toll): +16468769923,632356722# or +16699006833,632356722# Or Telephone: Dial: +1 646 876 9923 (US Toll) +1 669 900 6833 (US Toll) Meeting ID: 632 356 722 International numbers available: https://zoom.us/u/6uINe Or an H.323/SIP room system: H.323: 162.255.37.11 (US West) or 162.255.36.11 (US East) Meeting ID: 632 356 722 Password: 364216 SIP: 632356722 at zoomcrc.com Password: 364216 ......................................................................................................................................... From lagunaperalt1 at llnl.gov Mon Mar 18 09:53:08 2019 From: lagunaperalt1 at llnl.gov (Laguna Peralta, Ignacio) Date: Mon, 18 Mar 2019 14:53:08 +0000 Subject: [mpiwg-ft] FTWG Con Call - 2019-03-18 In-Reply-To: References: Message-ID: <41ef0ec7-aac5-3f76-49f7-03520cd1e8b7@llnl.gov> I would like to discuss how to get started with writing text for the alternative proposals (e.g., global restart) and respective tickets. This shouldn't take much time, e.g., 10-15 min. Ignacio On 3/18/19 6:13 AM, Bland, Wesley via mpiwg-ft wrote: > The Fault Tolerance Working Group?s weekly con call is today at 12:00 PM Eastern. Today's agenda: > > * Upcoming virtual meeting > > If there's something else that people would like to discuss, please just send an email to the WG so we can get it on the agenda. > > Thanks, > Wesley > > ......................................................................................................................................... > Join from PC, Mac, Linux, iOS or Android: https://tennessee.zoom.us/j/632356722?pwd=lI4_169CGcewIumekTziMw > Password: mpiforum > > Or iPhone one-tap (US Toll): +16468769923,632356722# or +16699006833,632356722# > > Or Telephone: > Dial: > +1 646 876 9923 (US Toll) > +1 669 900 6833 (US Toll) > Meeting ID: 632 356 722 > International numbers available: https://zoom.us/u/6uINe > > Or an H.323/SIP room system: > H.323: 162.255.37.11 (US West) or 162.255.36.11 (US East) > Meeting ID: 632 356 722 > Password: 364216 > > SIP: 632356722 at zoomcrc.com > Password: 364216 > ......................................................................................................................................... > > _______________________________________________ > mpiwg-ft mailing list > mpiwg-ft at lists.mpi-forum.org > https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft > From wesley.bland at intel.com Mon Mar 18 11:31:54 2019 From: wesley.bland at intel.com (Bland, Wesley) Date: Mon, 18 Mar 2019 16:31:54 +0000 Subject: [mpiwg-ft] FTWG Con Call - 2019-03-18 In-Reply-To: <41ef0ec7-aac5-3f76-49f7-03520cd1e8b7@llnl.gov> References: <41ef0ec7-aac5-3f76-49f7-03520cd1e8b7@llnl.gov> Message-ID: So we had a short discussion of how to move forward with what we see as the two remaining pieces for the larger FT story that we discussed back in December/January (unfortunately it appears all formatting was lost in the archived version): https://lists.mpi-forum.org/pipermail/mpiwg-ft/2019-January/003624.html We agreed that it makes sense to move forward with two people leading two efforts: Aurelien - Function to get failed processes, failure acknowledgement, and communicator-based resilient broadcast that triggers error handling Ignacio - Checkpoint MPI state, Return to previous MPI state X Ignacio will work on a few slides for the next WG meeting that we can discuss and hopefully Aurelien can do the same for his pieces, and we can use that to figure out our larger story to present at the upcoming Virtual meeting on April 10. Then we can move forward with starting on text/implementations/plenaries/etc. One thing we want to try to accomplish is to update the external story that people discuss about the status and progress of Fault Tolerance in the MPI Standard. We want to make sure people understand that we?re making progress in a number of areas and that there are various levels of FT support that people can use in MPI 3.1, MPI 4.0, and future MPI versions. Right now, everyone outside of our group just assumes that it?s ULFM or bust, which isn?t accurate. Thanks, Wesley On Mar 18, 2019, at 9:53 AM, Laguna Peralta, Ignacio > wrote: I would like to discuss how to get started with writing text for the alternative proposals (e.g., global restart) and respective tickets. This shouldn't take much time, e.g., 10-15 min. Ignacio On 3/18/19 6:13 AM, Bland, Wesley via mpiwg-ft wrote: The Fault Tolerance Working Group?s weekly con call is today at 12:00 PM Eastern. Today's agenda: * Upcoming virtual meeting If there's something else that people would like to discuss, please just send an email to the WG so we can get it on the agenda. Thanks, Wesley ......................................................................................................................................... Join from PC, Mac, Linux, iOS or Android: https://tennessee.zoom.us/j/632356722?pwd=lI4_169CGcewIumekTziMw Password: mpiforum Or iPhone one-tap (US Toll): +16468769923,632356722# or +16699006833,632356722# Or Telephone: Dial: +1 646 876 9923 (US Toll) +1 669 900 6833 (US Toll) Meeting ID: 632 356 722 International numbers available: https://zoom.us/u/6uINe Or an H.323/SIP room system: H.323: 162.255.37.11 (US West) or 162.255.36.11 (US East) Meeting ID: 632 356 722 Password: 364216 SIP: 632356722 at zoomcrc.com Password: 364216 ......................................................................................................................................... _______________________________________________ mpiwg-ft mailing list mpiwg-ft at lists.mpi-forum.org https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft -------------- next part -------------- An HTML attachment was scrubbed... URL: