Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8178811

Minimize the AVX <-> SSE transition penalty through generation of vzeroupper instruction on x86

XMLWordPrintable

    • Icon: Enhancement Enhancement
    • Resolution: Fixed
    • Icon: P2 P2
    • 10
    • 9, 10
    • hotspot
    • b21
    • x86

      This fix minimizes the AVX to SSE and SSE to AVX transition penalty through generation of vzeroupper instruction. With this patch we see zero transitions with penalty per SPECjbb2015 jOPS on BDW and a significant reduction on SKX CPU event vector width mismatch from 65 to 0.01 per SPECjbb2015 jOPS. We have also implemented an enhancement to disable vzeroupper generation for Knights family where the instruction has high penalty and is not recommended. The option UseVzeroupper is used to control generation of vzeroupper instruction and gets set to false on the Knights family.
      We observed ~3% gain on SPECJvm2008 composite result on Skylake.

            vdeshpande Vivek Deshpande (Inactive)
            vdeshpande Vivek Deshpande (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved:

                Estimated:
                Original Estimate - 6 weeks
                6w
                Remaining:
                Remaining Estimate - 2 days
                2d
                Logged:
                Time Spent - Not Specified Time Not Required
                Not Specified