Loading...

XML

Word

Printable

Type: Bug
Resolution: Fixed
Priority: P3
Fix Version/s: hs25
Affects Version/s: hs25
Component/s: hotspot
Labels:
- InternalURL-Comment
- sqe-8-noreglabel-backlog-startfresh

Subcomponent:
gc
Resolved In Build:
b02
CPU:

generic
OS:

generic

Issue	Fix Version	Assignee	Priority	Status	Resolution	Resolved In Build
JDK-8000126	8	John Coomes	P3	Resolved	Fixed	b58
JDK-8017981	7u45	Bengt Rutisson	P3	Closed	Fixed	b01
JDK-8002849	7u40	Bengt Rutisson	P3	Resolved	Fixed	b01
JDK-2230043	hs24	Bengt Rutisson	P3	Resolved	Fixed	team

Hal Mo reported the following issue on the hotspot-gc-dev alias:

http://mail.openjdk.java.net/pipermail/hotspot-gc-dev/2012-September/004978.html

Hi all,

This is Hal Mo<kungu.mjh at taobao.com> from Alibaba Group(with OCA).

Our hadoop namenode crashed, when we set the heap size to 135G using CMS GC.
Attached please find the crash log(hs_err_pid.log).

I can steadily reproduce the crash on a test machine with 190G physical
memory, by a simple command:
$ java -Xmx135g -XX:+UseConcMarkSweepGC

Then I build a debug jvm and use gdb to debug the problem.

call stack

C [libc.so.6+0x7a9b0] memset+0x40
V [libjvm.so+0x2b6c42]
BlockOffsetArray::set_remainder_to_point_to_start_incl(unsigned long,
unsigned long, bool)+0xce
V [libjvm.so+0x2b7043]
BlockOffsetArray::set_remainder_to_point_to_start(HeapWord*, HeapWord*,
bool)+0x71
V [libjvm.so+0x2b728d]
BlockOffsetArray::BlockOffsetArray(BlockOffsetSharedArray*, MemRegion,
bool)+0x9f
V [libjvm.so+0x3c089f]
BlockOffsetArrayNonContigSpace::BlockOffsetArrayNonContigSpace(BlockOffsetSharedArray*,
MemRegion)+0x37
V [libjvm.so+0x3be56f]
CompactibleFreeListSpace::CompactibleFreeListSpace(BlockOffsetSharedArray*,
MemRegion, bool, FreeBlockDictionary::DictionaryChoice)+0x9b
V [libjvm.so+0x3fd2e1]
ConcurrentMarkSweepGeneration::ConcurrentMarkSweepGeneration(ReservedSpace,
unsigned long, int, CardTableRS*, bool,
FreeBlockDictionary::DictionaryChoice)+0x1df
V [libjvm.so+0x4dc03e] GenerationSpec::init(ReservedSpace, int,
GenRemSet*)+0x37c
V [libjvm.so+0x4ced40] GenCollectedHeap::initialize()+0x510
V [libjvm.so+0x7c23c3] Universe::initialize_heap()+0x31d
V [libjvm.so+0x7c27ec] universe_init()+0xa6
V [libjvm.so+0x5056e2] init_globals()+0x34
V [libjvm.so+0x7ac926] Threads::create_vm(JavaVMInitArgs*, bool*)+0x23a
V [libjvm.so+0x53f3d4] JNI_CreateJavaVM+0x7a

in function BlockOffsetArray::set_remainder_to_point_to_start_inc, inside
the for loop:
    size_t reach = start_card - 1 + (power_to_cards_back(i+1) - 1);
when i = 7, the value of reach was 0. then the loop could not break, and
    _array->set_offset_array(start_card_for_region, reach, offset,
reducing);
accessed the wrong address, and crashed.

the root cause was
static size_t power_to_cards_back(uint i) {
    return (size_t)(1 << (LogBase * i));
}
the literal 1 is a 32bit int, and 1<<32 overflow.

Here was my fix(has been tested), also found in attached file
cms_large_heap_crash.patch

+++ b/src/share/vm/memory/blockOffsetTable.hpp
@@ -289,7 +289,7 @@
};

static size_t power_to_cards_back(uint i) {
- return (size_t)(1 << (LogBase * i));
+ return (size_t)1 << (LogBase * i);
}
static size_t power_to_words_back(uint i) {
return power_to_cards_back(i) * N_words;

Contributed-by: Hal Mo <kungu.mjh at taobao.com>

Similar situation also found in G1, but the size is mega(2^20) based.
2^(32+20) is too large to overflow.

backported by

JDK-2230043 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Resolved

JDK-8000126 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Resolved

JDK-8002849 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Resolved

JDK-8002850 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Closed

JDK-8002851 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Closed

JDK-8017981 BlockOffsetArray::power_to_cards_back() needs to handle > 32 bit shifts

Closed

duplicates

JDK-8011190 JVM Cores dumped

Closed

JDK-8022892 Java VM crashes with access violation when started with -XX:+UseConcMarkSweepGC

Closed

(1 backported by, 2 duplicates)

Assignee:: Bengt Rutisson (Inactive)
Reporter:: Bengt Rutisson (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: 2012-09-12 00:44
Updated:: 2013-09-18 05:30
Resolved:: 2012-09-25 04:05
Imported:: 26/Sep/12 10:16 AM
Indexed:: 26/Sep/12 2:15 AM

Details

Backports

Description

Attachments

Issue Links

Activity

People

Dates