[mpiwg-ft] ULFM 2.0 release candidate

Aurelien Bouteiller bouteill at icl.utk.edu
Fri Nov 3 18:57:31 CDT 2017


All, 

This is with great pleasure that the Open MPI ULFM team announces the new release candidate for ULFM 2.0. 

[https://bitbucket.org/icldistcomp/ulfm2/get/ulfm2.0rc.tar.bz2]

The focus for ULFM 2.0 has been toward integration with current Open MPI master, performance, and stability.

- ULFM is now based upon Open MPI master branch (#689f1be9).
- Fault Tolerance is enabled by default and is controlled with MCA variables.
- Added support for multithreaded modes (MPI_THREAD_MULTIPLE, etc.)
- Added support for non-blocking collective operations (NBC).
- Added support for CMA shared memory transport (Vader).
- Added support for advanced failure detection at the MPI level.
 Implements the algorithm described in "Failure detection and
 propagation in HPC systems." <https://doi.org/10.1109/SC.2016.26>.
- Removed the need for special handling of CID allocation.
- Non-usable components are automatically removed from the build during configure
- RMA, FILES, and TOPO components are enabled by default, and usage in a fault tolerant execution warns that they may cause undefined behavior after a failure.
- Bugfixes, bugfixes, bugfixes


As usual, you can find more information on [http://fault-tolerance.org/2017/11/03/ulfm-2-0/] or on the (new*) project repository [https://bitbucket.org/icldistcomp/ulfm2/overview]


 Happy Hacking, 
The Open MPI ULFM team. 


* the ULFM1 repository will go into archival mode.


More information about the mpiwg-ft mailing list