Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-4199601

Cp942 and Cp942C converters have problems with ITAIJI characters

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: P4 P4
    • 1.3.0
    • 1.2.0
    • core-libs
    • kestrel
    • generic
    • generic



      Name: bb33257 Date: 12/23/98


      The converters for Cp942 and Cp942C (Japanese IBM PC) have a number of problems
      dealing with the "ITAIJI" characters.

      The broken conversions, and the corrections, are as follows:

      ByteToCharCp942, ByteToCharCp942C ITAIJI mapping
      IBM PC to Unicode
       Current mapping Correct mapping
       x8BA0 -> u4FE0 x8BA0 -> u4FA0
       x948D -> u525D x948D -> u5265
       x88A0 -> u555E x88A0 -> u5516
       x8A9A -> u5699 x8A9A -> u565B
       x9458 -> u56CA x9458 -> u56A2
       x9355 -> u5861 x9355 -> u586B
       x9BA0 -> u5C5B x9BA0 -> u5C4F
       x8EC6 -> u5C62 x8EC6 -> u5C61
       x92CD -> u6451 x92CD -> u63B4
       x917E -> u6414 x917E -> u63BB
       x9DB7 -> u6522 x9DB7 -> u6505
       x9E94 -> u688E x9E94 -> u688D
       x93C0 -> u7006 x93C0 -> u6D9C
       x94AC -> u6F51 x94AC -> u6E8C
       x898B -> u7130 x898B -> u7114
       x9189 -> u7626 x9189 -> u75E9
       x9398 -> u79B1 x9398 -> u7977
       x925C -> u7C1E x925C -> u7BAA
       xE379 -> u7E48 xE379 -> u7E66
       x8C71 -> u7E6B x8C71 -> u7E4B
       x8F4A -> u7E61 x8F4A -> u7E4D
       xE445 -> u8141 xE445 -> u80FC
       x9789 -> u840A x9789 -> u83B1
       x8FD3 -> u8523 x8FD3 -> u848B
       x90E4 -> u87EC x90E4 -> u8749
       x9858 -> u881F x9858 -> u874B
       x8BEB -> u8EC0 x8BEB -> u8EAF
       x94AE -> u91B1 x94AE -> u9197
       x8FDD -> u91AC x8FDD -> u91A4
       x966A -> u9830 x966A -> u982C
       xE8F6 -> u9839 xE8F6 -> u983D
       x935E -> u985A x935E -> u985B
       x91CB -> u9A52 x91CB -> u9A28
       x89A8 -> u9DD7 x89A8 -> u9D0E
       x8CB2 -> u9E7C x8CB2 -> u9E78
       x8D8D -> u9EB4 x8D8D -> u9EB9
       x96CB -> u9EB5 x96CB -> u9EBA
       xFA59 -> uF86F xFA59 -> u2116

      CharToByteCp942, CharToByteCp942C ITAIJI mapping
      Unicode to IBM PC
       Current mapping Correct mapping
       u4FA0 -> x3F u4FA0 -> x8BA0
       u5265 -> x3F u5265 -> x948D
       u5516 -> x3F u5516 -> x88A0
       u565B -> x3F u565B -> x8A9A
       u56A2 -> x3F u56A2 -> x9458
       u586B -> x3F u586B -> x9355
       u5C4F -> x3F u5C4F -> x9BA0
       u5C61 -> x3F u5C61 -> x8EC6
       u63B4 -> x3F u63B4 -> x92CD
       u63BB -> x3F u63BB -> x917E
       u6505 -> x3F u6505 -> x9DB7
       u6805 -> x3F u6805 -> x8DF2
       u688D -> x3F u688D -> x9E94
       u6D9C -> x3F u6D9C -> x93C0
       u6E8C -> x3F u6E8C -> x94AC
       u7114 -> x3F u7114 -> x898B
       u75E9 -> x3F u75E9 -> x9189
       u7977 -> x3F u7977 -> x9398
       u7BAA -> x3F u7BAA -> x925C
       u7E66 -> x3F u7E66 -> xE379
       u7E4B -> x3F u7E4B -> x8C71
       u7E4D -> x3F u7E4D -> x8F4A
       u80FC -> x3F u80FC -> xE445
       u8346 -> x3F u8346 -> x8C74
       u83B1 -> x3F u83B1 -> x9789
       u848B -> x3F u848B -> x8FD3
       u8749 -> x3F u8749 -> x90E4
       u874B -> x3F u874B -> x9858
       u8EAF -> x3F u8EAF -> x8BEB
       u9197 -> x3F u9197 -> x94AE
       u91A4 -> x3F u91A4 -> x8FDD
       u982C -> x3F u982C -> x966A
       u983D -> x3F u983D -> xE8F6
       u985B -> x3F u985B -> x935E
       u9A28 -> x3F u9A28 -> x91CB
       u9D0E -> x3F u9D0E -> x89A8
       u9E78 -> x3F u9E78 -> x8CB2
       u9EB9 -> x3F u9EB9 -> x8D8D
       u9EBA -> x3F u9EBA -> x96CB
       u2116 -> x3F u2116 -> xFA59

      Additional chars for MS compatibility:
                              u2015 -> x815C
                              uFF5E -> x8160
                              u2225 -> x8161
                              uFF0D -> x817C
                              uFFE4 -> xFA55



      I'm filing this bug on behalf of IBM Japan's DBCS group. Since
      I don't really know that much about these issues myself, feel free
      to contact Masayuki Fuse at <###@###.###> for more info.
      ======================================================================

            duke J. Duke
            bcbeck Brian Beck (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved:
              Imported:
              Indexed: