Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P4
Fix Version/s: 24
Affects Version/s: 23
Component/s: hotspot
Labels:
- jni

Subcomponent:
runtime
Resolved In Build:
b14

The GetStringUTFLength function returns the length as a jint (jsize) value and so is limited to returning at most Integer.MAX_VALUE. But a Java string can itself consist of Integer.MAX_VALUE characters, each of which may require more than one byte to represent them in modified UTF-8 format.** It follows then that this function cannot return the correct answer for all String values and yet the specification makes no mention of this, nor of any possible error to report if this situation is encountered.

**The modified UTF-8 format used by the VM can require up to six bytes to represent one unicode character, but six byte characters are stored as UTF16 surrogate pairs. Hence the most bytes per character is 3, and so the maximum length is 3*Integer.MAX_VALUE. With compact strings this reduces to 2*Integer.MAX_VALUE.

causes

JDK-8360255 runtime/jni/checked/TestLargeUTF8Length.java fails with -XX:-CompactStrings

Resolved

JDK-8370646 TestLargeUTF8Length.java needs lots of memory

Resolved

csr for

JDK-8338709 [JNI] The JNI Specification needs to address the limitations of integer UTF-8 String lengths

Closed

links to

Commit(master) openjdk/jdk/90f3f432

Review(master) openjdk/jdk/20784

UTF8 lengths should be size_t not int

Resolved

David Holmes

Assignee:: David Holmes
Reporter:: David Holmes
Votes:: 1 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: 2024-03-24 23:30
Updated:: 2025-11-05 03:27
Resolved:: 2024-09-03 20:41

Details

Description

Attachments

Issue Links

Sub-Tasks

Activity

People

Dates