Improve performance of floating point reduction kernels

XMLWordPrintable

    • Type: Enhancement
    • Resolution: Unresolved
    • Priority: P4
    • tbd
    • Affects Version/s: 25
    • Component/s: hotspot

      Our recent runs on different leading edge platforms show that Floating point reduction kernels are better off SLP vectorization than with SLP.

      While ADD MUL shows degraded performance on Granite Rapids AP.
      Min/Max shows performance degradation on Sierra Forest SP (AVX2 only Xeon)

      Please find attached the data.

        1. test_reduction_perf.java
          13 kB
          Jatin Bhateja
        2. reduction_kernel_performance.png
          226 kB
          Jatin Bhateja

            Assignee:
            Jatin Bhateja
            Reporter:
            Jatin Bhateja
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: