Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8339284

Invalid character in OtherCommonLocales.properties

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: P4 P4
    • 22
    • 21
    • core-libs
    • None

      In JDK 21, the contents of make/jdk/src/classes/build/tools/cldrconverter/OtherCommonLocales.properties are invalid UTF-8:

      $ grep --text mgo ./make/jdk/src/classes/build/tools/cldrconverter/OtherCommonLocales.properties
      mgo=Meta��

      The file was adapted from CLDR data that I think is now located here: https://raw.githubusercontent.com/unicode-org/cldr-staging/release-42/docs/charts/42/tsv/locale-coverage.tsv

      From

      curl https://raw.githubusercontent.com/unicode-org/cldr-staging/release-42/docs/charts/42/tsv/locale-coverage.tsv | grep Meta | xxd

      it looks like that was supposed to be the character U+02BC (cabc in the UTF-8 encoding). The bytes in the corresponding range of the checked-in file are `a0ba`, which is not a valid UTF-8 encoding. The contents of OtherCommonLocales.properties in the latest version of the JDK repo match the CLDR data.

            naoto Naoto Sato
            cushon Liam Miller-Cushon
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: