The current inline assembly implementation of the atomics lacks "memory" in the clobber lists. This may lead to undesired compiler optimizations. We have never observed an issue, but it should be fixed.
An other bug is in the assembler encoding of the mvh* instructions which take and Address parameter.
An other bug is in the assembler encoding of the mvh* instructions which take and Address parameter.