Loading...

XML

Word

Printable

Type: Bug
Resolution: Fixed
Priority: P4
Fix Version/s: 23
Affects Version/s: 21, 22
Component/s: hotspot
Labels:

Subcomponent:
runtime
Introduced In Version:

18
Resolved In Build:
b08
CPU:

generic
OS:

linux

Issue	Fix Version	Assignee	Priority	Status	Resolution	Resolved In Build
JDK-8336123	21.0.5	Liming Liu	P4	Resolved	Fixed	b01

Since ~~JDK-8272807~~ (version 18 b10), the implementation of os::pretouch_memory got updated from volatile-write-0 to each page in a range, to atomic-add-0, which was to "permit use of memory concurrently with pretouch".

However when we have options -XX:+AlwaysPreTouch -XX:+UseTransparentHugePages together for an app using huge amount of memory like >200GiB. According to logs of numastat -mnv, the transparent huge pages would be fragmented into regular pages unexpectedly. Later on kernel would try to assemble all regular pages (for example 4KB) to make up huge pages (in this case 2MB) gradually. This procedure would cause minutes or longer than half an hour, which impacted the performance on not only the startup phase, but also eventual scores of some key benchmarks especially those highly counting on 99th percentile response-time ones.

From the viewpoint of kernel, if we use “load” instruction in the first place (such as volatile-add-0, or atomic-add-0), kernel actually doesn’t allocate any page, but use “zero” page (it is a special page with all 0) instead. The following write (store instructions) will trigger COW (copy-on-write). But kernel will allocate small pages instead of huge pages for COW. Later on, the huge pages are installed by the dedicated kernel thread asynchronously. Kernel did allocate huge page for COW prior v5.8, but the behavior was changed due to commit https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=3917c80280c9 (“thp: change CoW semantics for anon-THP”).

Therefore, the issue behaves differently with various Linux versions, 4.18 is fine while almost all recent ones like 5.8+, 6.x could show same problem. We could not go back to use volatile-write-0, or revert kernels to the last-known-good point. Instead, madvise call with MADV_POPULATE_WRITE (since Linux 5.14) can be the right way of pretouching memory, and it works well with both regular and huge pages.

backported by

JDK-8336123 pretouch_memory by atomic-add-0 fragments huge pages unexpectedly

Resolved

relates to

JDK-8272807 Permit use of memory concurrent with pretouch

Resolved

JDK-8335865 Shenandoah: Improve THP pretouch after JDK-8315923

Resolved

JDK-8338305 Linux kernels 5.8...5.13 are still affected by huge pages fragmentation even after JDK-8315923

Open

JDK-8324776 runtime/os/TestTransparentHugePageUsage.java fails with The usage of THP is not enough

Resolved

JDK-8324781 runtime/Thread/TestAlwaysPreTouchStacks.java failed with Expected a higher ratio between stack committed and reserved

Resolved

JDK-8324817 Parallel GC does not pre-touch all heap pages when AlwaysPreTouch enabled and large page disabled

Resolved

JDK-8325218 gc/parallel/TestAlwaysPreTouchBehavior.java fails

Resolved

JDK-8327171 Fix more NULL usage backsliding

Resolved

links to

Commit openjdk/jdk/a65a8952

Commit(master) openjdk/jdk21u-dev/cb40bc22

Review openjdk/jdk/15781

Review(master) openjdk/jdk21u-dev/825

(4 relates to, 4 links to)

Assignee:: Liming Liu

Reporter:: Patrick Zhang

Votes:: 0 Vote for this issue

Watchers:: 12 Start watching this issue

Created:: 2023-09-08 03:20

Updated:: 2024-10-22 21:34

Resolved:: 2024-01-26 08:45

Details

Backports

Description

Attachments

Issue Links

Activity

People

Dates