<html>


<head>


<meta http-equiv="Content-Type" content="text/html; charset=utf-8">


</head>


<body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif;">


<div>Hello, </div>


<div><br>


</div>


<div>I want to propose an extension to persistent API to allow a blocking MPI_Start call. Currently, MPI_Start calls are non-blocking. So, proposal is something like MPI_Start (for blocking) and MPI_Istart (for non-blocking). Of course, to maintain backward


 compatibility we may have to think of an alternative API. I am not proposing the exact API here. </div>


<div><br>


</div>


<div>The motivation behind the proposal is that having the knowledge whether the corresponding MPI call is blocking or not can give better performance. For example, MPI_Isend followed by MPI_Wait is slower than the MPI_Send because internally MPI_Isend->MPI_Wait


 has to allocate additional data structures (for example, request pointer) and do more work. Similarly, lets look at an example of a bcast collective operation. </div>


<div><br>


</div>


<div>Tree based broadcast can be implemented in two ways:</div>


<ol>


<li>MPI_Recv (recv data from parent) -> FOREACHCHILD – MPI_Send (send data to children)</li><li>MPI_Irecv (recv data from  parent) -> MPI_Wait(wait for recv to complete) -> FOREACHCHILD – MPI_Isend (send data to childrent) -> MPI_WaitAll (wait for sends to complete)</li></ol>


<div>Having only a non-blocking MPI_Start call forces only implementation 2 as implementation 1 has blocking MPI calls. However, implementation 1 can be significantly faster that implementation 2 for small message sizes.</div>


<div><br>


</div>


<div>Looking forward to hear your feedback.</div>


<div><br>


</div>


<div>Thanks,</div>


<div>Akhil </div>


<div><br>


</div>


<div>


<div id="MAC_OUTLOOK_SIGNATURE"></div>


</div>


</body>


</html>