Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P4
Fix Version/s: 24
Affects Version/s: None
Component/s: hotspot
Labels:
None

Subcomponent:
gc
Resolved In Build:
b11

We currently have several mechanisms used by different GCs (or even within the same GC) for breaking up the processing of large arrays into chunks that can be dealt with in parallel. This feature is used in conjunction with task stealing to allow several worker threads to each work on a different chunk of a large array. This can help balance work, especially if there are very large arrays for which processing might happen to only start late in a phase.

G1 young/mixed collections and ParallelGC young collections both use PartialArrayScanTasks. These encode the remaining work in the length field of either the from-space array (ParallelGC) or the to-space array (G1). But this doesn't work for pinned array objects, so G1 can't use it for arrays that have failed evacuation. This approach also can't be used by concurrent GCs.

Shenandoah encodes the work state in the high address bits of the array oop in the task. That places limits on the available address space that might not even be valid now (see Linux Large Virtual address space, for example).

G1 concurrent marking and ZGC use segregated task queues for array chunks. This allows normal queues and array chunk queues to have different element sizes, allowing the array chunk queues to carry additional information. However, this complicates queue processing, work stealing, and termination detection.

~~JDK-8332455~~ proposed using the Shenandoah encoding more widely, with a fallback to fatter tasks (with unused fields in the common oop/narrowOop tasks) if the address space limitation was a problem (which is always, for 32bit platforms). That proposal received some pushback, and ended up being abandoned because the author found they didn't need this to solve a different problem they were working on.

During the review of ~~JDK-8332455~~, a different approach was proposed. The proposal is to allocate a state object, whose address is the same size as other values in the task queue (so avoids either segregated queues or larger queue elements), which carries the needed task information. Arena+free-list allocation is used to reduce the cost of such an approach.

This approach avoids the use-case limits of PartialArrayScanTasks, avoids the additional complexities of segregated queues, and avoids the address space limitation of the Shenandoah encoding. However, it has costs for allocation and management of state objects. The impact of those costs needs to be minimized and the result compared to existing mechanisms.

relates to

JDK-8340470 G1: Adopt PartialArrayState to consolidate marking stack in Full GC

Open

JDK-8341630 G1: Adopt PartialArrayState to consolidate marking stack in concurrent marking

Open

JDK-8340119 Remove oopDesc::size_might_change()

Resolved

JDK-8340573 Remove unused G1ParScanThreadState::_partial_objarray_chunk_size

Resolved

JDK-8341331 G1: Add array chunking statistics counters similar to parallel gc

Closed

JDK-8341332 Refactor array chunking statistics counters

Closed

JDK-8338248 PartialArrayStateAllocator::Impl leaks Arena array

Resolved

JDK-8271870 G1: Add objArray splitting when scanning object with evacuation failure

Resolved

JDK-8311163 Parallel: Improve large object handling during evacuation

Resolved

JDK-8339668 Parallel: Adopt PartialArrayState to consolidate marking stack in Full GC

Resolved

JDK-8332455 Improve G1/ParallelGC tasks to not override array lengths

Closed

JDK-8339097 Parallel: Compact GC to split array early for task stealing

Closed

links to

Commit(master) openjdk/jdk/6a3d0452

Review(master) openjdk/jdk/20445

(7 relates to, 2 links to)

Assignee:: Kim Barrett

Reporter:: Kim Barrett

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024-08-01 17:54

Updated:: 2024-10-07 02:57

Resolved:: 2024-08-11 11:36

Details

Description

Attachments

Issue Links

Activity

People

Dates