-
Bug
-
Resolution: Fixed
-
P4
-
1.2.0
-
kestrel
-
generic
-
generic
Name: bb33257 Date: 12/23/98
The converters for Cp942 and Cp942C (Japanese IBM PC) have a number of problems
dealing with the "ITAIJI" characters.
The broken conversions, and the corrections, are as follows:
ByteToCharCp942, ByteToCharCp942C ITAIJI mapping
IBM PC to Unicode
Current mapping Correct mapping
x8BA0 -> u4FE0 x8BA0 -> u4FA0
x948D -> u525D x948D -> u5265
x88A0 -> u555E x88A0 -> u5516
x8A9A -> u5699 x8A9A -> u565B
x9458 -> u56CA x9458 -> u56A2
x9355 -> u5861 x9355 -> u586B
x9BA0 -> u5C5B x9BA0 -> u5C4F
x8EC6 -> u5C62 x8EC6 -> u5C61
x92CD -> u6451 x92CD -> u63B4
x917E -> u6414 x917E -> u63BB
x9DB7 -> u6522 x9DB7 -> u6505
x9E94 -> u688E x9E94 -> u688D
x93C0 -> u7006 x93C0 -> u6D9C
x94AC -> u6F51 x94AC -> u6E8C
x898B -> u7130 x898B -> u7114
x9189 -> u7626 x9189 -> u75E9
x9398 -> u79B1 x9398 -> u7977
x925C -> u7C1E x925C -> u7BAA
xE379 -> u7E48 xE379 -> u7E66
x8C71 -> u7E6B x8C71 -> u7E4B
x8F4A -> u7E61 x8F4A -> u7E4D
xE445 -> u8141 xE445 -> u80FC
x9789 -> u840A x9789 -> u83B1
x8FD3 -> u8523 x8FD3 -> u848B
x90E4 -> u87EC x90E4 -> u8749
x9858 -> u881F x9858 -> u874B
x8BEB -> u8EC0 x8BEB -> u8EAF
x94AE -> u91B1 x94AE -> u9197
x8FDD -> u91AC x8FDD -> u91A4
x966A -> u9830 x966A -> u982C
xE8F6 -> u9839 xE8F6 -> u983D
x935E -> u985A x935E -> u985B
x91CB -> u9A52 x91CB -> u9A28
x89A8 -> u9DD7 x89A8 -> u9D0E
x8CB2 -> u9E7C x8CB2 -> u9E78
x8D8D -> u9EB4 x8D8D -> u9EB9
x96CB -> u9EB5 x96CB -> u9EBA
xFA59 -> uF86F xFA59 -> u2116
CharToByteCp942, CharToByteCp942C ITAIJI mapping
Unicode to IBM PC
Current mapping Correct mapping
u4FA0 -> x3F u4FA0 -> x8BA0
u5265 -> x3F u5265 -> x948D
u5516 -> x3F u5516 -> x88A0
u565B -> x3F u565B -> x8A9A
u56A2 -> x3F u56A2 -> x9458
u586B -> x3F u586B -> x9355
u5C4F -> x3F u5C4F -> x9BA0
u5C61 -> x3F u5C61 -> x8EC6
u63B4 -> x3F u63B4 -> x92CD
u63BB -> x3F u63BB -> x917E
u6505 -> x3F u6505 -> x9DB7
u6805 -> x3F u6805 -> x8DF2
u688D -> x3F u688D -> x9E94
u6D9C -> x3F u6D9C -> x93C0
u6E8C -> x3F u6E8C -> x94AC
u7114 -> x3F u7114 -> x898B
u75E9 -> x3F u75E9 -> x9189
u7977 -> x3F u7977 -> x9398
u7BAA -> x3F u7BAA -> x925C
u7E66 -> x3F u7E66 -> xE379
u7E4B -> x3F u7E4B -> x8C71
u7E4D -> x3F u7E4D -> x8F4A
u80FC -> x3F u80FC -> xE445
u8346 -> x3F u8346 -> x8C74
u83B1 -> x3F u83B1 -> x9789
u848B -> x3F u848B -> x8FD3
u8749 -> x3F u8749 -> x90E4
u874B -> x3F u874B -> x9858
u8EAF -> x3F u8EAF -> x8BEB
u9197 -> x3F u9197 -> x94AE
u91A4 -> x3F u91A4 -> x8FDD
u982C -> x3F u982C -> x966A
u983D -> x3F u983D -> xE8F6
u985B -> x3F u985B -> x935E
u9A28 -> x3F u9A28 -> x91CB
u9D0E -> x3F u9D0E -> x89A8
u9E78 -> x3F u9E78 -> x8CB2
u9EB9 -> x3F u9EB9 -> x8D8D
u9EBA -> x3F u9EBA -> x96CB
u2116 -> x3F u2116 -> xFA59
Additional chars for MS compatibility:
u2015 -> x815C
uFF5E -> x8160
u2225 -> x8161
uFF0D -> x817C
uFFE4 -> xFA55
I'm filing this bug on behalf of IBM Japan's DBCS group. Since
I don't really know that much about these issues myself, feel free
to contact Masayuki Fuse at <###@###.###> for more info.
======================================================================