Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: P4
Fix Version/s: None
Affects Version/s: 1.1.6
Component/s: core-libs
Labels:
- EUCJP
- SJIS

Subcomponent:
java.io
CPU:

unknown
OS:

solaris_2.6

String objects creatd by the following String constructor are different by the encoding schemes used in the byte data when the buffer contains extra data which cannot be converted.

String(byte[] buffer, int offset, int count, String enc) will

Attached program will produce the following result, where buffer contains extra 1 byte data in each encoding.

length (via EUCJP) : 4
length (via SJIS) : 0

This shows that in EUCJP encoding, String() will ignore the extra byte and returns the String object created from the previous 8 byte data, whereas in
SJIS encoding, it ignores not only the extra byte but the whole data in the buffer and returns an empty string.

(The JDK1.1.7 and JDK1.2 spec does not exactly mention which result will be appropriate.)

public class StringTest {
     public static void main(String args[]) throws Exception {
         String str = "\u3042\u3044\u3046\u3048\u304a"; // "AIUEO" in Japanese

         String eucjp = new String(str.getBytes("EUCJP"), 0, 9, "EUCJP");
         String sjis = new String(str.getBytes("SJIS"), 0, 9, "SJIS");

         System.out.println("length (via EUCJP) : " + eucjp.length());
         System.out.println("length (via SJIS) : " + sjis.length());
     }
}

duplicates

JDK-4526769 java.nio specification needs clarifications from JSR-51 draft 0.60

Resolved

Assignee:: Michael Mccloskey (Inactive)

Reporter:: J. Duke

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Created:: 1999-02-24 03:21

Updated:: 2002-01-14 11:17

Resolved:: 2002-01-14 11:17

Imported:: 15/Sep/12 9:53 PM

Indexed:: 17/Jul/12 6:25 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates