-
Enhancement
-
Resolution: Fixed
-
P4
-
1.4.1, 5.0
-
b32
-
generic, x86
-
generic, windows_2000
Name: pa48320 Date: 11/25/2002
[Problem description]
There are a coulpe of charset alias/canonical names that are supported
in java.io/java.lang are not supported in java.nio.charset.Charset.
From the compatibility point of view, it would be much better
to add these names as aliases in java.nio.charset.Charset
to support them in java.nio.
Actually, some of names such as "ISO8859_15" or "KOI8_R" are quite
commonly used in the existing system so that it will be a problem
when the system is migrated to use java.nio.
Unsupported charset names are following;
Unsupported charset name / To be mapped to
--------------------------------------------------
ansi-1251 / windows-1251
big5hk / Big5-HKSCS
big5-hkscs:unicode3.0 / Big5-HKSCS
unicode-1-1-utf-8 / UTF-8
default / US-ASCII
ascii7 / US-ASCII (*sun bug 4721369)
8859_2 / ISO-8859-2
iso_8859-2:1987 / ISO-8859-2
iso8859-2 / ISO-8859-2
ibm912 / ISO-8859-2
ibm-912 / ISO-8859-2
cp912 / ISO-8859-2
912 / ISO-8859-2
8859_3 / ISO-8859-3
iso_8859-3:1988 / ISO-8859-3
iso8859-3 / ISO-8859-3
ibm913 / ISO-8859-3
ibm-913 / ISO-8859-3
cp913 / ISO-8859-3
913 / ISO-8859-3
8859_4 / ISO-8859-4
iso_8859-4:1988 / ISO-8859-4
iso8859-4 / ISO-8859-4
ibm914 / ISO-8859-4
ibm-914 / ISO-8859-4
cp914 / ISO-8859-4
914 / ISO-8859-4
8859_5 / ISO-8859-5
iso_8859-5:1988 / ISO-8859-5
iso8859-5 / ISO-8859-5
ibm915 / ISO-8859-5
ibm-915 / ISO-8859-5
cp915 / ISO-8859-5
915 / ISO-8859-5
8859_6 / ISO-8859-6
iso_8859-6:1987 / ISO-8859-6
iso8859-6 / ISO-8859-6
ibm1089 / ISO-8859-6
ibm-1089 / ISO-8859-6
cp1089 / ISO-8859-6
1089 / ISO-8859-6
8859_7 / ISO-8859-7
iso_8859-7:1987 / ISO-8859-7
iso8859-7 / ISO-8859-7
ibm813 / ISO-8859-7
ibm-813 / ISO-8859-7
cp813 / ISO-8859-7
813 / ISO-8859-7
8859_8 / ISO-8859-8
iso_8859-8:1988 / ISO-8859-8
iso8859-8 / ISO-8859-8
ibm916 / ISO-8859-8
ibm-916 / ISO-8859-8
cp916 / ISO-8859-8
916 / ISO-8859-8
8859_9 / ISO-8859-9
iso8859-9 / ISO-8859-9
ibm920 / ISO-8859-9
ibm-920 / ISO-8859-9
cp920 / ISO-8859-9
920 / ISO-8859-9
8859_13 / ISO-8859-13
iso_8859-13 / ISO-8859-13
iso8859-13 / ISO-8859-13
ISO8859_15 / ISO-8859-15
KOI8_R / KOI8-R
UnicodeBigUnmarked / UTF-16BE
UnicodeLittleUnmarked / UTF-16LE
UTF16 / UTF-16
[Reproduce procedure]
Here's a simple tester code.
Usage: java test1 <charset name>
----- test1.java
import java.nio.charset.*;
public class test1 {
public static void main(String[] argv) {
try{
Charset c = Charset.forName(argv[0]);
System.out.println(argv[0] + ": supported (" + c.name() +")");
} catch(UnsupportedCharsetException uce) {
System.out.println(argv[0] + ": NOT supported");
}
}
}
----- end of test1.java
(Review ID: 167318)
======================================================================
###@###.### 2003-09-04
A CAP member reports the same problem:
RFC 1345 defines a lot of character sets (including Cp852). Could you
please add them to Tiger? The information in that RFC contains machine
readable character and character set definitions, which are easy to parse.
[Problem description]
There are a coulpe of charset alias/canonical names that are supported
in java.io/java.lang are not supported in java.nio.charset.Charset.
From the compatibility point of view, it would be much better
to add these names as aliases in java.nio.charset.Charset
to support them in java.nio.
Actually, some of names such as "ISO8859_15" or "KOI8_R" are quite
commonly used in the existing system so that it will be a problem
when the system is migrated to use java.nio.
Unsupported charset names are following;
Unsupported charset name / To be mapped to
--------------------------------------------------
ansi-1251 / windows-1251
big5hk / Big5-HKSCS
big5-hkscs:unicode3.0 / Big5-HKSCS
unicode-1-1-utf-8 / UTF-8
default / US-ASCII
ascii7 / US-ASCII (*sun bug 4721369)
8859_2 / ISO-8859-2
iso_8859-2:1987 / ISO-8859-2
iso8859-2 / ISO-8859-2
ibm912 / ISO-8859-2
ibm-912 / ISO-8859-2
cp912 / ISO-8859-2
912 / ISO-8859-2
8859_3 / ISO-8859-3
iso_8859-3:1988 / ISO-8859-3
iso8859-3 / ISO-8859-3
ibm913 / ISO-8859-3
ibm-913 / ISO-8859-3
cp913 / ISO-8859-3
913 / ISO-8859-3
8859_4 / ISO-8859-4
iso_8859-4:1988 / ISO-8859-4
iso8859-4 / ISO-8859-4
ibm914 / ISO-8859-4
ibm-914 / ISO-8859-4
cp914 / ISO-8859-4
914 / ISO-8859-4
8859_5 / ISO-8859-5
iso_8859-5:1988 / ISO-8859-5
iso8859-5 / ISO-8859-5
ibm915 / ISO-8859-5
ibm-915 / ISO-8859-5
cp915 / ISO-8859-5
915 / ISO-8859-5
8859_6 / ISO-8859-6
iso_8859-6:1987 / ISO-8859-6
iso8859-6 / ISO-8859-6
ibm1089 / ISO-8859-6
ibm-1089 / ISO-8859-6
cp1089 / ISO-8859-6
1089 / ISO-8859-6
8859_7 / ISO-8859-7
iso_8859-7:1987 / ISO-8859-7
iso8859-7 / ISO-8859-7
ibm813 / ISO-8859-7
ibm-813 / ISO-8859-7
cp813 / ISO-8859-7
813 / ISO-8859-7
8859_8 / ISO-8859-8
iso_8859-8:1988 / ISO-8859-8
iso8859-8 / ISO-8859-8
ibm916 / ISO-8859-8
ibm-916 / ISO-8859-8
cp916 / ISO-8859-8
916 / ISO-8859-8
8859_9 / ISO-8859-9
iso8859-9 / ISO-8859-9
ibm920 / ISO-8859-9
ibm-920 / ISO-8859-9
cp920 / ISO-8859-9
920 / ISO-8859-9
8859_13 / ISO-8859-13
iso_8859-13 / ISO-8859-13
iso8859-13 / ISO-8859-13
ISO8859_15 / ISO-8859-15
KOI8_R / KOI8-R
UnicodeBigUnmarked / UTF-16BE
UnicodeLittleUnmarked / UTF-16LE
UTF16 / UTF-16
[Reproduce procedure]
Here's a simple tester code.
Usage: java test1 <charset name>
----- test1.java
import java.nio.charset.*;
public class test1 {
public static void main(String[] argv) {
try{
Charset c = Charset.forName(argv[0]);
System.out.println(argv[0] + ": supported (" + c.name() +")");
} catch(UnsupportedCharsetException uce) {
System.out.println(argv[0] + ": NOT supported");
}
}
}
----- end of test1.java
(Review ID: 167318)
======================================================================
###@###.### 2003-09-04
A CAP member reports the same problem:
RFC 1345 defines a lot of character sets (including Cp852). Could you
please add them to Tiger? The information in that RFC contains machine
readable character and character set definitions, which are easy to parse.
- relates to
-
JDK-4966197 coulpe of charset alias/canonical names that are not supported in nio
-
- Resolved
-