[mpiwg-ft] FTWG Con Call Today

Aurelien Bouteiller bouteill at icl.utk.edu
Wed Aug 2 12:48:22 CDT 2017


Rob, 

Here is the pointer to the dissertation about a production grade memory database using ULFM as a communication substrate that can deal with different types of faults. The approach taken here is to add a layer of resilient collective to supplement the rest of the API, and to facilitate the integration of this code in the rest of the software infrastructure. The result section at the end shown very good results, basically 1/2 the time of the restart approach (even for small problems with 1GB / process).



Best, 
Aurelien 


> On Jul 19, 2017, at 15:43, Van Der Wijngaart, Rob F <rob.f.van.der.wijngaart at intel.com> wrote:
> 
> Hi Aurélien,
>  
> Please don't forget to pass around the pointer to the PhD dissertation about ULFM experiences in the context of SAP's workloads.
> Meanwhile, following up on our brief discussion during today's meeting, I came across this paper <http://delivery.acm.org/10.1145/2650000/2642776/p63-hassani.pdf?ip=134.134.139.76&id=2642776&acc=ACTIVE%20SERVICE&key=AC116DD66AAF555C%2EAC116DD66AAF555C%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&CFID=787494528&CFTOKEN=40139878&__acm__=1500493397_2a08667a6d3e8d9d5d395a3a713ca7c5> by Tony et al. that addresses common ground between ULFM and FA_MPI. Probably already known by most of you (especially Tony J), but new to me. 
>  
> Rob
>   <>
> -----Original Message-----
> From: mpiwg-ft [mailto:mpiwg-ft-bounces at lists.mpi-forum.org] On Behalf Of Bland, Wesley
> Sent: Wednesday, July 19, 2017 7:00 AM
> To: FTWG <mpiwg-ft at lists.mpi-forum.org>
> Subject: [mpiwg-ft] FTWG Con Call Today
>  
> The Fault Tolerance Working Group’s biweekly con call is today at 3:00 PM Eastern. Today's agenda:
>  
> * Continuing FA-MPI discussion from previous call.
>  
> If there's something else that people would like to discuss, please just send an email to the WG so we can get it on the agenda.
>  
> Thanks, 
> Wesley 
>  
> .........................................................................................................................................
> Join from PC, Mac, Linux, iOS or Android: https://tennessee.zoom.us/j/632356722?pwd=lI4%2F169CGcewIumekTziMw%3D%3D <https://tennessee.zoom.us/j/632356722?pwd=lI4%2F169CGcewIumekTziMw%3D%3D>
>    Password: mpiforum
>  
> Or iPhone one-tap (US Toll):  +14086380968,632356722# or +16465588656,632356722#
>  
> Or Telephone:
>    Dial: +1 408 638 0968 (US Toll) or +1 646 558 8656 (US Toll)
>    Meeting ID: 632 356 722
>    International numbers available: https://tennessee.zoom.us/zoomconference?m=GscM59o_Qoig8v4aJl1OrsnXL-7Blrke <https://tennessee.zoom.us/zoomconference?m=GscM59o_Qoig8v4aJl1OrsnXL-7Blrke>
>  
> Or an H.323/SIP room system:
>    H.323: 162.255.37.11 (US West) or 162.255.36.11 (US East)
>    Meeting ID: 632 356 722
>    Password: 366244
>  
>    SIP: 632356722 at zoomcrc.com <mailto:632356722 at zoomcrc.com>
>    Password: 366244
> .........................................................................................................................................
>  
> _______________________________________________
> mpiwg-ft mailing list
> mpiwg-ft at lists.mpi-forum.org <mailto:mpiwg-ft at lists.mpi-forum.org>
> https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft <https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft>_______________________________________________
> mpiwg-ft mailing list
> mpiwg-ft at lists.mpi-forum.org <mailto:mpiwg-ft at lists.mpi-forum.org>
> https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft <https://lists.mpi-forum.org/mailman/listinfo/mpiwg-ft>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20170802/9241a143/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Jan Stengler - SAP - FaultTolerantCollectiveCommunicationAlgorithms.pdf
Type: application/pdf
Size: 4472405 bytes
Desc: not available
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20170802/9241a143/attachment-0001.pdf>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpi-forum.org/pipermail/mpiwg-ft/attachments/20170802/9241a143/attachment-0003.html>


More information about the mpiwg-ft mailing list