-
Enhancement
-
Resolution: Fixed
-
P3
-
16
-
b27
Vector broadcast are operations are inefficient for float and double elements due to the use of Float.floatToIntBits and Double.doubleToLongBits. The raw variants should be used to avoid NaN checks.
Further, this is an opportunity to review the code generation for broadcast on x86. Specifically, broadcast vs. shuffle instructions.
Further, this is an opportunity to review the code generation for broadcast on x86. Specifically, broadcast vs. shuffle instructions.