Details
-
Enhancement
-
Resolution: Fixed
-
P3
-
5.0, 6, 7
-
b55
-
generic
-
generic
-
Verified
Description
http://www.unicode.org/versions/Unicode5.0.0/
Major changes from 4.0.0 to 4.1.0:
* Addition of 1273 new characters to the standard, including those to complete roundtrip mapping of the HKSCS and GB 18030 standards, five new currency signs, some characters for Indic and Korean, and eight new scripts. (The exact list of additions can be seen in DerivedAge.txt, in the age=4.1 section.)
* Change in the end of the CJK Unified Ideographs range from U+9FA5 to U+9FBB, with the addition of some Han characters. The boundaries of such ranges are sometimes hardcoded in software, in which case the hardcoded value needs to be changed.
* New Unicode Standard Annexes: UAX #31, Identifier and Pattern Syntax and UAX #34, Unicode Named Character Sequences, and significant changes to other Unicode Standard Annexes.
Unicode 5.0 (beta) includes (diffs from 4.1.0):
* addition of more than 1300 new characters (5 new scripts)
* full interoperability with the most recent versions of GB 18030, JIS X 0213, and HKSCS
* case mappings update
* improvements on text segmation
* tightens conformance requirements on Bidi implementations
Unicode 5.1 was released on April 4, 2008. We should support it.
For details, see http://www.unicode.org/press/pr-5.1.html
Attachments
Issue Links
- duplicates
-
JDK-5034599 RFE: Upgrade to Unicode 4.1
- Closed
-
JDK-6562766 [Nor] Uncode 5.1 support in java.text.Normalizer
- Closed
-
JDK-6565620 [Ch, Str] Unicode 5.1 support in Character and String
- Closed
- relates to
-
JDK-4640853 Support latest Unicode version
- Resolved
-
JDK-6866243 Javadoc for java.lang.Character still refers to Unicode 4 instead of 5
- Closed
-
JDK-8020037 String.toLowerCase incorrectly increases length, if string contains \u0130 char
- Closed
-
JDK-8041791 String.toLowerCase regression - violates Unicode standard
- Closed