Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P4
Fix Version/s: 26
Affects Version/s: 26
Component/s: hotspot
Labels:
- aarch64
- c2
- performance
- sve
- vectorapi

Subcomponent:
compiler
Resolved In Build:
b24

Current implementation for these two APIs on AArch64 SVE is not efficient enough. SVE does not support naive predicate instructions for these two APIs. Instead, they are now implemented with pure vector instructions. However, the output of "fromLong()" and input of "toLong" are defined as the mask with predicate register on SVE architectures. Hence, for API "fromLong", it needs to generate a vector mask stored in a vector register, and then convert it to the predicate at the end. The opposite action is needed for "toLong" at the start of the backend code generation.

These conversions have higher cost and are implemented in the IR's backend codegen part, which is much more in-efficient and influences the performance of these two APIs.

Consider it has two IRs in C2 to do the conversion specially (e.g. VectorLoadMask/VectorStoreMask), we can move these part from backend and to IR-level. This also matches with current IR pattern for these two APIs on architectures that do not support the predicate feature. Additionally, some mid-end IR optimizations can also be shared.

relates to

JDK-8371446 VectorAPI: Add unit tests for masks from various long values

Resolved

JDK-8374043 C2: assert(_base >= VectorMask && _base <= VectorZ) failed: Not a Vector

Open

JDK-8370666 VectorAPI: Add clear comments for vector mask relative code in c2

Open

links to

Commit(master) openjdk/jdk/676e6fd8

Review(master) openjdk/jdk/27481

Assignee:: Xiaohong Gong
Reporter:: Xiaohong Gong
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: 2025-09-09 19:10
Updated:: 2025-12-19 09:10
Resolved:: 2025-11-12 17:39

Details

Description

Attachments

Issue Links

Activity

People

Dates