Name: boT120536 Date: 02/14/2001
java version "1.3.0_01"
Java(TM) 2 Runtime Environment, Standard Edition (build 1.3.0_01)
Java HotSpot(TM) Client VM (build 1.3.0_01, mixed mode)
Unicode Standard Annex #13 contains guidelines for dealing with newlines in
Unicode text. It it available at http://www.unicode.org/unicode/reports/tr13.
The document explicitly states that the Unicode LS (\u2028) and PS (\u2028)
characters are to be converted to the valie of the system
property "line.separator" when translating from Unicode to other character sets.
However, this is not the behavior exhibited by the Java io classes. For example:
public class Bug {
public static void main(String[] args) {
System.out.println("one\u2028two");
}
}
This program would be expected to print something like:
one
two
by translating the \u2028 into the correct line separator. Instead it prints:
one?two
I have run the above program with file.encoding set to "ASCII", "8859_1",
and "Cp1252" (the actual default for my platform), and the results are always
the same.
(Review ID: 116892)
======================================================================
java version "1.3.0_01"
Java(TM) 2 Runtime Environment, Standard Edition (build 1.3.0_01)
Java HotSpot(TM) Client VM (build 1.3.0_01, mixed mode)
Unicode Standard Annex #13 contains guidelines for dealing with newlines in
Unicode text. It it available at http://www.unicode.org/unicode/reports/tr13.
The document explicitly states that the Unicode LS (\u2028) and PS (\u2028)
characters are to be converted to the valie of the system
property "line.separator" when translating from Unicode to other character sets.
However, this is not the behavior exhibited by the Java io classes. For example:
public class Bug {
public static void main(String[] args) {
System.out.println("one\u2028two");
}
}
This program would be expected to print something like:
one
two
by translating the \u2028 into the correct line separator. Instead it prints:
one?two
I have run the above program with file.encoding set to "ASCII", "8859_1",
and "Cp1252" (the actual default for my platform), and the results are always
the same.
(Review ID: 116892)
======================================================================