Loading...

XML

Word

Printable

Type: Bug
Resolution: Fixed
Priority: P3
Fix Version/s: 7
Affects Version/s: 6u1
Component/s: core-libs
Labels:
- licbug
- licdirect

Subcomponent:
java.nio.charsets
Resolved In Build:
b62
CPU:

x86
OS:

linux

I tried to run attached testcase (Cp943Test), which checks the Cp943 code range.

Cp943Test returns U+0000 and U+FFFD for unconvertable character.

For example,
0x81ad: 0000
0x81ae: 0000
0x81af: 0000
0x81b0: 0000
...
0x8581: fffd
0x8587: fffd
0x85e0: fffd
0x85f0: fffd

What's difference these two unconvertable character?
I think it should returns U+FFFD instead of U+0000.

=====================================================================================
import java.io.*;

class Cp943Test {
    public static void main (String[] args) {
       byte[] test = new byte[2];
       char[] charBuffer = new char[1];
       sun.io.ByteToCharCp943 convertor = new sun.io.ByteToCharCp943();
       for(int i = 0x81; i<= 0xfc; i++) {
           if (0x9f < i && i < 0xe0) continue;
           for(int j = 0x40; j <= 0xfc; j++) {
               if (j == 0x7f) continue;
               test[0] = (byte) i;
               test[1] = (byte) j;
               try {
                  int nc = convertor.convert (test, 0, 2, charBuffer, 0, 1);
                  System.out.printf("0x%02x%02x: %04x\n", i, j, (int)charBuffer[0]);
               }
               catch (Exception e) { }
           }
        }
    }
}

Assignee:: Xueming Shen

Reporter:: Erik Larsen (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Created:: 2007-06-13 07:24

Updated:: 2010-04-02 15:29

Resolved:: 2009-06-22 14:37

Imported:: 15/Sep/12 1:24 PM

Indexed:: 17/Jul/12 10:56 AM

Details

Description

Attachments

Activity

People

Dates