Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-4415272

\u2028 and \u2029 should map to line.separator per Unicode 4.0 Newline Guidelines

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: P4 P4
    • None
    • 1.3.0
    • core-libs
    • x86
    • windows_2000

      Name: boT120536 Date: 02/14/2001


      java version "1.3.0_01"
      Java(TM) 2 Runtime Environment, Standard Edition (build 1.3.0_01)
      Java HotSpot(TM) Client VM (build 1.3.0_01, mixed mode)

      Unicode Standard Annex #13 contains guidelines for dealing with newlines in
      Unicode text. It it available at http://www.unicode.org/unicode/reports/tr13.

      The document explicitly states that the Unicode LS (\u2028) and PS (\u2028)
      characters are to be converted to the valie of the system
      property "line.separator" when translating from Unicode to other character sets.

      However, this is not the behavior exhibited by the Java io classes. For example:

      public class Bug {
          public static void main(String[] args) {
              System.out.println("one\u2028two");
          }
      }

      This program would be expected to print something like:

      one
      two

      by translating the \u2028 into the correct line separator. Instead it prints:

      one?two

      I have run the above program with file.encoding set to "ASCII", "8859_1",
      and "Cp1252" (the actual default for my platform), and the results are always
      the same.
      (Review ID: 116892)
      ======================================================================

            Unassigned Unassigned
            bonealsunw Bret O'neal (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Imported:
              Indexed: