Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Duplicate
Priority: P4
Fix Version/s: None
Affects Version/s: 1.2.0
Component/s: core-libs
Labels:
- converterspi
- licbug

Subcomponent:
java.nio.charsets
CPU:

generic
OS:

generic

Name: bb33257 Date: 08/14/98

As far as character set conversions go, the Java API is
deficient, and seriously compromises its functionality.

No Converter Names. You can't get a human-readable alias for converters--so if you have no idea
what Cp964 is, for example, you are stuck. For an example of how
this is used, see Netscape's Encoding menu.

Moreover, many of the names are very misleading, such as the ones listed in the table below. If you
are converting a series of strings to be later concatenated, for example, you only want a signature on
the very first of them (if at all)--so you need to know to start with UnicodeBig, but continue with
UnicodeBigUnmarked (or UnicodeLittle and UnicodeLittleUnmarked, resp.).

  Java Name Description
  Unicode, UnicodeBig Unicode (UTF-16BE), prefaced by FEFF signature
  UnicodeLittle Little Endian Unicode (UTF-16LE), prefaced by FFFE signature
  UnicodeBigUnmarked Unicode (UTF-16BE)
  UnicodeLittleUnmarked Little Endian Unicode (UTF-16LE)
  JIS JIS with ISO 2022 announcers

Illegal Codes. There is no way to control what happens with illegal byte sequences. They are
usually just skipped, with no warning.

======================================================================

duplicates

JDK-4313884 New I/O: Character-set API

Resolved

Assignee:: Unassigned
Reporter:: Brian Beck (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 0 Start watching this issue

Created:: 1998-08-14 13:04
Updated:: 2001-07-16 15:49
Resolved:: 2001-07-16 15:49
Imported:: 15/Sep/12 1:11 PM
Indexed:: 17/Jul/12 10:46 AM

Details

Description

Attachments

Issue Links

Activity

People

Dates