-
Bug
-
Resolution: Won't Fix
-
P3
-
None
-
8u5
CP939 character encoder does not handle invalid characters properly.
CP939 is EBCDIC mixed converter, it has two conversion state (SBCS and DBCS),
it's controlled by SO and SI characters. It seems CP939's replacement
character is \x6F.
If invalid character is on DBCS state, it's changed to \x6F without SI.
So valid characters are garbled after an invalid character is encoded.