Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P3
Fix Version/s: 7
Affects Version/s: 5.0, 6, 7
Component/s: core-libs
Labels:
- unicode
- verify-pit

Subcomponent:
java.lang
Resolved In Build:
b55
CPU:

generic
OS:

generic
Verification:
Verified

Unicode 5.0 will be released in CY06Q3. Refer to the following URL:

http://www.unicode.org/versions/Unicode5.0.0/

Major changes from 4.0.0 to 4.1.0:

    * Addition of 1273 new characters to the standard, including those to complete roundtrip mapping of the HKSCS and GB 18030 standards, five new currency signs, some characters for Indic and Korean, and eight new scripts. (The exact list of additions can be seen in DerivedAge.txt, in the age=4.1 section.)
    * Change in the end of the CJK Unified Ideographs range from U+9FA5 to U+9FBB, with the addition of some Han characters. The boundaries of such ranges are sometimes hardcoded in software, in which case the hardcoded value needs to be changed.
    * New Unicode Standard Annexes: UAX #31, Identifier and Pattern Syntax and UAX #34, Unicode Named Character Sequences, and significant changes to other Unicode Standard Annexes.

Unicode 5.0 (beta) includes (diffs from 4.1.0):

    * addition of more than 1300 new characters (5 new scripts)
    * full interoperability with the most recent versions of GB 18030, JIS X 0213, and HKSCS
    * case mappings update
    * improvements on text segmation
    * tightens conformance requirements on Bidi implementations
Unicode 5.1 was released on April 4, 2008. We should support it.
For details, see http://www.unicode.org/press/pr-5.1.html

duplicates

JDK-5034599 RFE: Upgrade to Unicode 4.1

Closed

JDK-6562766 [Nor] Uncode 5.1 support in java.text.Normalizer

Closed

JDK-6565620 [Ch, Str] Unicode 5.1 support in Character and String

Closed

relates to

JDK-4640853 Support latest Unicode version

Resolved

JDK-6866243 Javadoc for java.lang.Character still refers to Unicode 4 instead of 5

Closed

JDK-8020037 String.toLowerCase incorrectly increases length, if string contains \u0130 char

Closed

JDK-8041791 String.toLowerCase regression - violates Unicode standard

Closed

(2 relates to)

Assignee:: Yuka Kamiya (Inactive)
Reporter:: Masayoshi Okutsu
Votes:: 0 Vote for this issue
Watchers:: 1 Start watching this issue

Created:: 2006-03-26 20:10
Updated:: 2020-02-19 15:20
Resolved:: 2011-03-07 14:21
Imported:: 15/Sep/12 11:23 PM
Indexed:: 17/Jul/12 7:42 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates