Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8277442

Supporting charset GB18030-2005

XMLWordPrintable

    • Icon: Enhancement Enhancement
    • Resolution: Won't Fix
    • Icon: P3 P3
    • None
    • 8, 11, 17, 18
    • core-libs
    • None

      The GB13080 charset has two versions, GB13080-2000 and GB13080-2005. The character mappings corresponding to Unicode codepoints '\u1E3F' and '\uE7C7' were swapped in GB18030-2005, when compared to GB13080-2000, reportedly to correct a mistake. That is the only difference between the two versions.
      From an operating systems standpoint, Linux has moved to GB18030-2005 base via glibc. However, Windows 10 and AIX 7.2 are still on GB18030-2000. We have users who desire to use GB18030-2005. It would be a good interim solution to support both the charset versions.
      From a brief discussion over the i18n-dev mailing list, we seemed to have reached a consensus that GB13080 as it is defined today will continue to be GB18030-2000. We propose no changes to it. Alongside, we propose that a new Charset with the name "GB18030-2005" be implemented. This is the scope of this enhancement request.
      In the future feature releases, we might want to introduce "GB13080-2000" as an alias of the existing "GB13080" and provide a system property to switch between the two versions. However, this is out of the scope of this enhancement request.

            naoto Naoto Sato
            rpatil Ramanand Patil (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: