Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Unresolved
Priority: P4
Fix Version/s: tbd
Affects Version/s: 27
Component/s: hotspot
Labels:
- performance

Subcomponent:
compiler
CPU:

aarch64
OS:

generic

On Neoverse-V1/V2, the SVE `CPY (immediate, merging)` instruction performs better than the SVE `CPY (immediate, zeroing) instruction. Optimizing `CPY(immediate, zeroing)` as `MOVI + CPY(immediate, merging) gets performance uplift of **12%** to **100%** in specific Java Vector API micro-benchmarks depending on the specific operation and data types involved.

Currently the SVE `CPY (immediate, zeroing) instruction is used in code generated by `VectorStoreMaskNode` and `VectorReinterpretNode`. Doing this optimization benefits all Vector APIs that generates these two IRs, such as `VectorMask.intoArray()` and `Vector.toLong()`.

links to

Review(master) openjdk/jdk/29359

Assignee:: Eric Fang
Reporter:: Eric Fang
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: 2025-12-24 20:42
Updated:: 2026-01-22 04:43

Details

Description

Attachments

Issue Links

Activity

People

Dates