[Mpi-22] [Mpi-forum] MPI 2.2 proposal:resolving MPI_Request_free issues

Jeff Squyres jsquyres at [hidden]
Wed Jul 16 20:59:23 CDT 2008



On Jul 16, 2008, at 7:31 PM, Underwood, Keith D wrote:

>>> [erezh] Correct; this is the scenario I was describing with the RDMA
> write.
>
> It would be interesting to see exactly what the error mode here is.
> Retransmitting corrupted data should be ok, since a correctly  
> delivered
> message means that the retransmit must be dropped.  I suppose that if
> the NIC speaks virtual addresses and the free actually results in a  
> trap
> to the kernel that unmaps the pages, then the NIC could retransmit and
> find that there isn't a valid page table entry...

'zactly.  And then the local completion entry would be a failure -- so  
the sender would [erroneously] think that the message had failed to be  
delivered.

> Solution #5:  Change the advice to users - "...the arrival of the  
> reply
> informs the sender that the send has completed and the send buffer can
> be overwritten.  If the buffer will ever be freed, the application
> should call MPI_Wait or MPI_Cancel instead of MPI_Request_free."

That seems like a good compromise.


-- 
Jeff Squyres
Cisco Systems




More information about the Mpi-22 mailing list