Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-6448185

Consider adding StoreStore barriers to all constructor ends

XMLWordPrintable

    • Icon: Enhancement Enhancement
    • Resolution: Won't Fix
    • Icon: P4 P4
    • None
    • 6
    • specification
    • generic
    • generic

      The Java Memory Model is a tradeoff between performance and reliability.
      We do not promise sequential consistency, and will not do so, but
      providing more guarantees that we do at present while preserving performance
      is well worth considering.

      For final fields, the JMM guarantees that another thread will never see the field
      in its uninitialized state, even in the absence of a happens-before relationship.

      Many programs make the stronger assumption that this is the case for any field
      initialized in a constructor. It is certainly an easy "mistake" to make,
      and it is difficult to fix without adding much stronger restrictions
      (I'm thinking of making the field volatile).

      The canonical example is a caching int field
      in an otherwise immutable object initialized in a constructor to a -1
      value and then lazily initialized when needed.
      Even without synchronization, this is thread-safe, except that another
      thread might see the field set to zero.

      C2 currently inserts a StoreStore barrier at the end of a constructor
      if it contains a final field. Since there is no actual code emitted
      (on current JDK-targeted hardware)
      and merely some reorderings are no longer permitted if the constructor
      is inlined, it should be very cheap to do this.
      We should run a performance experiment.

      From email:

      > I'm not sure, but I think the memory barrier in c2 ends up emitting no
      > code on sparc and x86,

      Right.

      > but merely prevents further reordering
      > if inlined, which would mean the cost is very low.

      Always worth measuring to find out for sure though. Adding too many
      reordering constraints can make the instruction scheduler unhappy
      and in turn generate code that encounters a lot of memory stalls
      (which would likely be most apparent on P4/xeons).

      (Also: I don't think the escape-analysis stuff is smart enough
      to suppress reordering constraints when objects are proven
      thread-local. This probably needs attention, as does the lack
      of smarts about reloading final fields across various barriers.)

            abuckley Alex Buckley
            martin Martin Buchholz
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved:
              Imported:
              Indexed: