Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-4499288

(cs spec) Charset terminology problems

XMLWordPrintable

    • b25
    • generic
    • generic
    • Not verified

      In the java.nio.charset.Charset description, there are some inaccurate
      definitions.

        - "a charset is defined as the combination of a coded character set and
          a character-encoding scheme" assumes one coded character set. But it
          should be one or multiple coded character sets.

        - "full Unicode, which is the same as ISO 10646-1" is not accurate. They
          are not the same thing as Unicode.org says. If it means only in the
          context of a coded character set, the description should clarify the
          revisions.

        - In "A character-encoding scheme is a mapping between a coded character
          set and a set of octet (eight-bit byte) sequences.", "a coded character
          set" should be "a coded character set or several coded character
          sets" as defined in RFC 2278. The examples clarify it, though.

        - "Some schemes, however, are associated with multiple character sets;
          EUC, for example, can be used to encode characters in a variety of
          Asian character sets." should be "Some schemes, however, are
          associated with multiple coded character sets; EUC, for example, can
          be used to encode characters in a variety of Asian coded character
          sets."

            sherman Xueming Shen
            okutsu Masayoshi Okutsu
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved:
              Imported:
              Indexed: