Big string constants duplicated between encodings

XMLWordPrintable

    • Type: Bug
    • Resolution: Cannot Reproduce
    • Priority: P4
    • None
    • Affects Version/s: 1.2.0
    • Component/s: core-libs
    • None

      There are a number of replicated strings in the encodings. For example,
      CharToByteCp949 and CharToByteCp949C are almost identical (diff them to
      see). This representation is bad for footprint on disk and in memory.
      (I discovered the redundancy while looking at compression of constant pool
      redundancies.)

      Other giant byte-for-byte duplications are in ByteToCharCp930 and
      CharToByteEUC_KR. In addition to byte-for-byte equal arrays, it would
      also make sense to represent arrays that differ only in one element
      differently, for example, by testing for the differing index instead
      of replicating the array.

            Assignee:
            Xueming Shen
            Reporter:
            David Stoutamire (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved:
              Imported:
              Indexed: