-
Enhancement
-
Resolution: Not an Issue
-
P4
-
21
Currently MulReductionVL is only allowed with avx512dq support.
case Op_MulReductionVL:
if (VM_Version::supports_avx512dq() == false) {
return false;
}
https://github.com/openjdk/jdk/blob/3dfadeebd023efb03a400f2b2656567a4154421a/src/hotspot/cpu/x86/x86.ad#L1437-L1440
The relevant vpmullq instruction exists also on earlier AVX versions. (Ah no, it actually only exists for AVX512, my bad.)
https://www.felixcloutier.com/x86/pmulld:pmullq
See attached Test.java for an example.
Does not vectorize:
./java -XX:-TieredCompilation -Xbatch -XX:CompileCommand=compileonly,Test::test -XX:+TraceSuperWord -XX:UseAVX=2 Test.java
Vectorizes, provided your machine has avx512dq support.
./java -XX:-TieredCompilation -Xbatch -XX:CompileCommand=compileonly,Test::test -XX:+TraceSuperWord -XX:UseAVX=3 Test.java
Code snippet:
static long test(long[] data) {
long mul = 1;
for (int i = 2; i < N-2; i++) {
long v = data[i] + 5;
mul *= v; // MulReductionVL currently requires avx512dq support
}
return mul;
}
I discovered this during the work of JDK-8302139.
case Op_MulReductionVL:
if (VM_Version::supports_avx512dq() == false) {
return false;
}
https://github.com/openjdk/jdk/blob/3dfadeebd023efb03a400f2b2656567a4154421a/src/hotspot/cpu/x86/x86.ad#L1437-L1440
The relevant vpmullq instruction exists also on earlier AVX versions. (Ah no, it actually only exists for AVX512, my bad.)
https://www.felixcloutier.com/x86/pmulld:pmullq
See attached Test.java for an example.
Does not vectorize:
./java -XX:-TieredCompilation -Xbatch -XX:CompileCommand=compileonly,Test::test -XX:+TraceSuperWord -XX:UseAVX=2 Test.java
Vectorizes, provided your machine has avx512dq support.
./java -XX:-TieredCompilation -Xbatch -XX:CompileCommand=compileonly,Test::test -XX:+TraceSuperWord -XX:UseAVX=3 Test.java
Code snippet:
static long test(long[] data) {
long mul = 1;
for (int i = 2; i < N-2; i++) {
long v = data[i] + 5;
mul *= v; // MulReductionVL currently requires avx512dq support
}
return mul;
}
I discovered this during the work of JDK-8302139.
- relates to
-
JDK-8302139 Speed up SuperWord reduction tests
- In Progress