[mpiwg-rma] same_op_no_op
Balaji, Pavan
balaji at anl.gov
Thu Mar 13 12:21:34 CDT 2014
Well, I think they are not allowed. If the implementation can assume that they are the same, if a single process does two accumulate operations, the implementation is allowed to add an assert saying first_op == second_op. That’ll blow up the program if they are not the same.
— Pavan
On Mar 13, 2014, at 12:16 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
> I don't think MPI-3 says they are not allowed, just that the
> implementation "will assume" that >1 operations (except for NO_OP) are
> never used concurrently. Since the effect of the implementation's
> assumptions cannot be known, I would like to assume that concurrent
> different operations are not necessarily atomic i.e. implementation
> defined, and most certainly not erroneous.
>
> As to screwing this up, yes, I agree on multiple levels :-)
>
> Jeff
>
> On Thu, Mar 13, 2014 at 12:02 PM, Balaji, Pavan <balaji at anl.gov> wrote:
>>
>> MPI-2.2 says that accumulate with different ops are not atomic.
>>
>> MPI-3 says that accumulate with different ops are not allowed (since same_op_no_op is default).
>>
>> I think we screwed that up?
>>
>> — Pavan
>>
>> On Mar 13, 2014, at 11:48 AM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>
>>> It is extremely difficult to see that this is what the MPI-3 standard says.
>>>
>>> First we have this:
>>>
>>> "The outcome of concurrent accumulate operations to the same location
>>> with the same predefined datatype is as if the accumulates were done
>>> at that location in some serial order. Additional restrictions on the
>>> operation apply; see the info key accumulate_ops in Section 11.2.1.
>>> Concurrent accumulate operations with different origin and target
>>> pairs are not ordered. Thus, there is no guarantee that the entire
>>> call to an accumulate operation is executed atomically. The effect of
>>> this lack of atomicity is limited: The previous correctness conditions
>>> imply that a location updated by a call to an accumulate operation
>>> cannot be accessed by a load or an RMA call other than accumulate
>>> until the accumulate operation has completed (at the target).
>>> Different interleavings can lead to different results only to the
>>> extent that computer arithmetics are not truly associative or
>>> commutative. The outcome of accumulate operations with overlapping
>>> types of different sizes or target displacements is undefined."
>>> [11.7.1 Atomicity]
>>>
>>> Then we have this:
>>>
>>> "accumulate_ops — if set to same_op, the implementation will assume
>>> that all concurrent accumulate calls to the same target address will
>>> use the same operation. If set to same_op_no_op, then the
>>> implementation will assume that all concurrent accumulate calls to the
>>> same target address will use the same operation or MPI_NO_OP. This can
>>> eliminate the need to protect access for certain operation types where
>>> the hardware can guarantee atomicity. The default is same_op_no_op."
>>> [11.2.1 Window Creation]
>>>
>>> I was not aware that the definition of info keys was normative, given
>>> that implementations are free to ignore them. Even if info key text
>>> is normative, one has to infer from the fact that same_op_no_op is the
>>> default info behavior - and thus RMA semantic - that accumulate
>>> atomicity is restricted to the case where one uses the same op or noop
>>> but not replace.
>>>
>>> The MPI-2.2 spec is unambiguous because it explicitly requires the
>>> same operation in 11.7.1 Atomicity. This text was removed in MPI-3.0
>>> in favor of the info key text.
>>>
>>> Best,
>>>
>>> Jeff
>>>
>>> On Tue, Mar 11, 2014 at 12:04 AM, Balaji, Pavan <balaji at anl.gov> wrote:
>>>>
>>>> MPI-2 defines atomicity only for the same operation, not any operation for MPI_ACCUMULATE.
>>>>
>>>> — Pavan
>>>>
>>>> On Mar 10, 2014, at 11:22 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>>>
>>>>> So MPI-2 denied compatibility between replace and not-replace?
>>>>>
>>>>> Jeff
>>>>>
>>>>> Sent from my iPhone
>>>>>
>>>>>> On Mar 11, 2014, at 12:06 AM, "Balaji, Pavan" <balaji at anl.gov> wrote:
>>>>>>
>>>>>>
>>>>>> It doesn’t break backward compatibility. The info argument is still useful when you don’t want to use replace. I don’t see anything wrong with it.
>>>>>>
>>>>>>> On Mar 10, 2014, at 11:01 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>>>>>>
>>>>>>> Does this or does this not break BW compatibility w.r.t. MPI-2.2 and
>>>>>>> did we do it intentionally? Unless we did so intentionally and
>>>>>>> explicitly, I will argue that the WG screwed up and the info key+val
>>>>>>> is invalid.
>>>>>>>
>>>>>>> Jeff
>>>>>>>
>>>>>>>> On Mon, Mar 10, 2014 at 11:03 PM, Balaji, Pavan <balaji at anl.gov> wrote:
>>>>>>>>
>>>>>>>> If a hardware can implement MPI_SUM, it should be able to implement MPI_SUM with 0 as well.
>>>>>>>>
>>>>>>>> But that’s not a generic solution.
>>>>>>>>
>>>>>>>> Jeff: at some point you were planning to bring in a ticket which does more combinations of operations than just same_op and no_op. Maybe it’s worthwhile bringing that up again?
>>>>>>>>
>>>>>>>> — Pavan
>>>>>>>>
>>>>>>>>> On Mar 10, 2014, at 9:26 PM, Jim Dinan <james.dinan at gmail.com> wrote:
>>>>>>>>>
>>>>>>>>> Maybe there's a loophole that I'm forgetting?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Mon, Mar 10, 2014 at 9:43 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>>>>>>>> How the hell can I do GA or SHMEM then? Roll my own mutexes and commit perf-suicide?
>>>>>>>>>
>>>>>>>>> Jeff
>>>>>>>>>
>>>>>>>>> Sent from my iPhone
>>>>>>>>>
>>>>>>>>>> On Mar 10, 2014, at 8:32 PM, Jim Dinan <james.dinan at gmail.com> wrote:
>>>>>>>>>>
>>>>>>>>>> You can't use replace and sum concurrently at a given target address.
>>>>>>>>>>
>>>>>>>>>> ~Jim.
>>>>>>>>>>
>>>>>>>>>> On Mon, Mar 10, 2014 at 4:30 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>>>>>>>>> Given the following, how do I use MPI_NO_OP, MPI_REPLACE and MPI_SUM
>>>>>>>>>> in accumulate/atomic operations in a standard-compliant way?
>>>>>>>>>>
>>>>>>>>>> accumulate_ops — if set to same_op, the implementation will assume
>>>>>>>>>> that all concurrent accumulate calls to the same target address will
>>>>>>>>>> use the same operation. If set to same_op_no_op, then the
>>>>>>>>>> implementation will assume that all concurrent accumulate calls to the
>>>>>>>>>> same target address will use the same operation or MPI_NO_OP. This can
>>>>>>>>>> eliminate the need to protect access for certain operation types where
>>>>>>>>>> the hardware can guarantee atomicity. The default is same_op_no_op.
>>>>>>>>>>
>>>>>>>>>> We discuss this before and the resolution was not satisfying to me.
>>>>>>>>>>
>>>>>>>>>> Thanks,
>>>>>>>>>>
>>>>>>>>>> Jeff
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> Jeff Hammond
>>>>>>>>>> jeff.science at gmail.com
>>>>>>>>>> _______________________________________________
>>>>>>>>>> mpiwg-rma mailing list
>>>>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>>>>>>>>> mpiwg-rma mailing list
>>>>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> mpiwg-rma mailing list
>>>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>>>>
>>>>>>>>> _______________________________________________
>>>>>>>>> mpiwg-rma mailing list
>>>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> mpiwg-rma mailing list
>>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> Jeff Hammond
>>>>>>> jeff.science at gmail.com
>>>>>>> _______________________________________________
>>>>>>> mpiwg-rma mailing list
>>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>>>
>>>>>> _______________________________________________
>>>>>> mpiwg-rma mailing list
>>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>> _______________________________________________
>>>>> mpiwg-rma mailing list
>>>>> mpiwg-rma at lists.mpi-forum.org
>>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>>
>>>> _______________________________________________
>>>> mpiwg-rma mailing list
>>>> mpiwg-rma at lists.mpi-forum.org
>>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>>
>>>
>>>
>>> --
>>> Jeff Hammond
>>> jeff.science at gmail.com
>>> _______________________________________________
>>> mpiwg-rma mailing list
>>> mpiwg-rma at lists.mpi-forum.org
>>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>>
>> _______________________________________________
>> mpiwg-rma mailing list
>> mpiwg-rma at lists.mpi-forum.org
>> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
>
>
>
> --
> Jeff Hammond
> jeff.science at gmail.com
> _______________________________________________
> mpiwg-rma mailing list
> mpiwg-rma at lists.mpi-forum.org
> http://lists.mpi-forum.org/mailman/listinfo.cgi/mpiwg-rma
More information about the mpiwg-rma
mailing list