Loading...

XML

Word

Printable

Type: Bug
Resolution: Fixed
Priority: P3
Fix Version/s: 15
Affects Version/s: 15, 16
Component/s: hotspot
Labels:
- oracle-triage-15

Subcomponent:
jfr
Resolved In Build:
b30

Issue	Fix Version	Assignee	Priority	Status	Resolution	Resolved In Build
JDK-8248592	16	Markus Grönlund	P3	Resolved	Fixed	b04
JDK-8250126	15.0.2	Markus Grönlund	P3	Resolved	Fixed	b01
JDK-8250425	15.0.1	Markus Grönlund	P3	Resolved	Fixed	b03

Although ~~JDK-8242088~~ improved scalability and performance in general for most subsystems, it had an unfortunate and overlooked side-effect on JfrCheckpointManager:

~~JDK-8242088~~ consolidated the mspace's to aggregate a 'free_list' and a 'live_list' (previously they were called 'free_list' and 'full_list') and usages have been streamlined to have the 'free_list' actually be a free list and the 'live_list' to be the active, in-use or live list, instead of the more poorly named 'full_list'.

Before ~~JDK-8242088~~, JfrCheckpointManager used the free_list for the two statically allocated buffers (512 Kb) and the full_list was used to hold transient allocated buffers (also 512 kb). This meant that a fetch attempt to lease a statically allocated buffer took at most O(2).

With ~~JDK-8242088~~, the statically allocated buffers are now stored in the live_list. But, the transient allocated buffers are also stored in the live_list.
This has caused the access time to become a function of the number of buffers, indirectly becoming a function of the number of concurrent threads.

On systems with a high number of parallel threads, this becomes problematic.

~~JDK-8242088~~ also revealed that JfrCheckpointManager is heavily over-provisioning memory for transient buffers: the size of a transient buffer is the minimum element size for the JfrCheckpointMspace, which is 512 kb by default. But on buffer release, a transient buffer will be retired, making most of the allocated space unavailable until the next chunk rotation (a flushpoint involving checkpoint data currently only writes contents, but do not deallocate transient buffers, which is post-poned until chunk rotation).

We should address both of these aspects by using two mspaces instead of a single global JfrCheckpointMspace. One mspace is to be specialized for threads and one is to be specialized for the global access. This is similar to the layout of JfrStorage.

backported by

JDK-8248592 Poor scalability in JfrCheckpointManager when using many threads after JDK-8242088

Resolved

JDK-8250126 Poor scalability in JfrCheckpointManager when using many threads after JDK-8242088

Resolved

JDK-8250425 Poor scalability in JfrCheckpointManager when using many threads after JDK-8242088

Resolved

relates to

JDK-8242088 Replace mutually exclusive lists with concurrent alternatives

Resolved

JDK-8234595 JfrBuffer::reinitialize failed "assert(!lease()) failed: invariant"

Closed

JDK-8247965 Two JFR tests failing in Loom repo

Closed

(1 relates to)

Assignee:: Markus Grönlund

Reporter:: Markus Grönlund

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2020-06-29 07:50

Updated:: 2024-10-16 19:19

Resolved:: 2020-06-30 10:08

Details

Backports

Description

Attachments

Issue Links

Activity

People

Dates