Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Unresolved
Priority: P4
Fix Version/s: tbd
Affects Version/s: 26
Component/s: hotspot
Labels:
- c2
- c2-igvn
- performance

Subcomponent:
compiler

ModI/LNode::Ideal implements an optimization that (repeatedly) applies And, RShift, and Add operations if the divisor is a constant of the form 2^k - 1. The number of repetitions needed (unroll_factor) depends on the divisor and is currently limited to 5 repetitions (above, the optimization isn't applied).

In a fixup step, the optimization also produces multiple comparisons and conditional moves.

Depending on the hardware, the optimization can yield worse performance, so we should evaluate whether the unroll limit of 5 is appropriate or if the optimization is worth to keep around in general.

I'm attaching a microbenchmark that covers all Mersenne numbers for int and long. The optimization can be disabled using -XX:ConditionalMoveLimit=0.
I'm also attaching the results measured on my machine (Ryzen 9 3900X) where the optimization is always worse than the more general div by constant optimization. On aarch64 (Neoverse-N1), the results are less conclusive, i.e., a low unroll factor provides better results.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Modulo.java
19 kB
2025-10-17 12:05
results-base-aarch64.txt
5 kB
2025-10-17 12:27
results-base-x86_64.txt
5 kB
2025-10-17 12:09
results-cmov0-aarch64.txt
5 kB
2025-10-17 12:28
results-cmov0-x86_64.txt
5 kB
2025-10-17 12:09

relates to

JDK-8366815 C2: Delay Mod/Div by constant transformation

Open

Assignee:: Unassigned
Reporter:: Hannes Greule
Votes:: 0 Vote for this issue
Watchers:: 1 Start watching this issue

Created:: 2025-10-17 12:28
Updated:: 2025-10-19 22:15

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates