Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P4
Fix Version/s: 25
Affects Version/s: 17, 21, 25
Component/s: hotspot
Labels:

Subcomponent:
runtime
Resolved In Build:
b12

Issue	Fix Version	Assignee	Priority	Status	Resolution	Resolved In Build
JDK-8355058	24.0.2	Xiaolong Peng	P4	Resolved	Fixed	b04
JDK-8355316	21.0.8	Xiaolong Peng	P4	Resolved	Fixed	b01

We have a production use case where leaving the safepoint introduced a significant latency, mostly due to VMThread getting de-scheduled. There are mitigations for this, see JDK-8350324. This issue is about capturing such an event in safepoint logging more comprehensively.

In those cases we would see something like:

[3.664s][info][gc ] GC(7) Pause Young (Allocation Failure) 277M->3M(989M) 5.145ms
[3.694s][info][safepoint] Safepoint "SerialCollectForAllocation", Time since last: 374870126 ns, Reaching safepoint: 271019 ns, At safepoint: 35859436 ns, Total: 36130455 ns

...which is extremely confusing: the GC itself took ~5ms, yet "At safepoint" mentions 35ms! All that time is actually spent disarming the WaitBarrier, and eating the de/re-scheduling for VMThread at the same time.

I think a "Leaving safepoint" counter would be good to have. It comes with the symmetry advantage with "Reaching safepoint" counter. "Leaving safepoint" measures the time spent in safepoint machinery from the "finishing side". And, it more clearly captures the transitional state where some threads might be still at safepoint, and some have already unparked, like "Reaching safepoint".

So the hypothetical log for the excerpt above would look something like:

[3.664s][info][gc ] GC(7) Pause Young (Allocation Failure) 277M->3M(989M) 5.145ms
[3.694s][info][safepoint] Safepoint "SerialCollectForAllocation", Time since last: 374870126 ns, Reaching safepoint: 271019 ns, At safepoint: 5859436 ns, Leaving safepoint: 30000000 ns, Total: 36130455 ns

backported by

JDK-8355058 Include timings for leaving safepoint in safepoint logging

Resolved

JDK-8355316 Include timings for leaving safepoint in safepoint logging

Resolved

relates to

JDK-8350324 Prioritize VMThread for safepoint operations

Open

links to

Commit(master) openjdk/jdk21u-dev/732dca8c

Commit(master) openjdk/jdk24u/2bf1ce4e

Commit(master) openjdk/jdk/9ec46968

Review(master) openjdk/jdk21u-dev/1683

Review(master) openjdk/jdk24u/186

Review(master) openjdk/jdk/23756

(4 links to)

Assignee:: Xiaolong Peng

Reporter:: Aleksey Shipilev

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025-02-19 00:44

Updated:: 2025-04-22 07:44

Resolved:: 2025-02-26 09:32

Details

Backports

Description

Attachments

Issue Links

Activity

People

Dates