Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8350463

AArch64: Add vector rearrange support for small lane count vectors

XMLWordPrintable

    • b16
    • aarch64

      Currently the AArch64 implementation of vector rearrange is not completed for vector types with lane count smaller than 4 (see [1]). This makes some benchmarks with Long/Double vector types have large performance gap on NVIDIA Grace, the sve2 architecture with 128-bit vector size, with other SVE and X86 machines.

      Vector rearrange relays on a vector shuffle input, which the payload is a byte array previously. Since the supported min vector lane count for byte is 4 on AArch64, the lane count limitation is also added for rearrange. But considering the payload of vector shuffle has been changed to the data type of each vector (i.e. `int` for `IntVector`) recently [2], we can remove this lane count limitation for rearrange.
       
      [1] https://github.com/openjdk/jdk/blob/master/src/hotspot/cpu/aarch64/aarch64_vector.ad#L209
      [2] https://bugs.openjdk.org/browse/JDK-8310691

            xgong Xiaohong Gong
            xgong Xiaohong Gong
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated:
              Resolved: