-
Bug
-
Resolution: Fixed
-
P3
-
1.4.2, 5.0u8
Issue | Fix Version | Assignee | Priority | Status | Resolution | Resolved In Build |
---|---|---|---|---|---|---|
JDK-2176734 | 7 | Y. Ramakrishna | P3 | Closed | Fixed | b03 |
JDK-2141970 | 6u1 | Y. Ramakrishna | P3 | Resolved | Fixed | b01 |
JDK-2141969 | 5.0u10 | Poonam Bajaj Parhar | P3 | Resolved | Fixed | b02 |
JDK-2141971 | 1.4.2_14 | Poonam Bajaj Parhar | P3 | Resolved | Fixed | b01 |
The ER is 1.5.0_06-erdist-2006-02-01. The bug addressed was 6367204
The hardware is:
Dell 4xDual core Xeon
32GB RAM
10x RAID 10 HDD
The OS is Server 2003.
The java version and configuration:
JRE v.1.5.0_06-erdist-20060201
28GB heap
GC options: -XX:+UseParNewGC -XX:+UseConcMarkSweepGC
-XX:ParallelGCThreads=7 -XX:NewSize=128M -XX:MaxNewSize=128M
-XX:+CMSIncrementalMode -XX:+CMSIncrementalPacing
-XX:CMSIncrementalDutyCycleMin=0 -XX:CMSIncrementalDutyCycle=10
-XX:CMSMarkStackSize=8M -XX:CMSMarkStackSizeMax=32M -XX:+UseLargePages
-XX:+DisableExplicitGC
The application is a custom distributed database server based on
TCP/IP and Sleepycat DBJE
The symptoms:
After running smoothly for ~1-4 days straight with
constant but light load, the ParNew GC's jump from ~150 ms every 30
seconds to 5-20 seconds out of every 30 seconds. The start of the
degenerate ParNew GCs seem to mostly (but not always) coincide with the
start of a new CMS mark phase. The general pattern is to spend 20-90%
of the time in young GC, which eventually quiesces down to acceptable
levels after ~4 hours of GC pain (frequently to re-start after the next
CMS sweep).
The load was constant and unvaried from our side, so we
don't see any application-level cause for the degenerate GC performance.
They ran a test with 5.0u8 and the problem seemed to be pushed out.
The time to failure went to 48 hours for the initial 5 second spike and
another day or so to hit the ~20 second spikes.
They were running with large pages so they ran a test without it and with their ER 5.0u6 and the problem seemed to have gone away, but returned many days later.
Turning off large pages seem to have also extended the running but eventually
they still see the problem
- backported by
-
JDK-2141969 ParNewGC times spiking, eventually taking up 20 out of every 30 seconds
- Resolved
-
JDK-2141970 ParNewGC times spiking, eventually taking up 20 out of every 30 seconds
- Resolved
-
JDK-2141971 ParNewGC times spiking, eventually taking up 20 out of every 30 seconds
- Resolved
-
JDK-2176734 ParNewGC times spiking, eventually taking up 20 out of every 30 seconds
- Closed
- duplicates
-
JDK-6413516 4 unusual very long ParNew GC
- Closed
- relates to
-
JDK-6237967 CMS: ability to tweak demand forecasting and coalition policy to control fragmentation
- Resolved
-
JDK-6459113 CMS+ParNew: wildly different ParNew pause times depending on heap shape caused by allocation spread
- Resolved
-
JDK-6446077 Long GC times after Concurrent Mode Failure
- Closed