Issue | Fix Version | Assignee | Priority | Status | Resolution | Resolved In Build |
---|---|---|---|---|---|---|
JDK-2018282 | 1.2.0 | John Oconner | P2 | Resolved | Fixed | 1.2beta3 |
Name: bb33257 Date: 12/10/97
Korean text made up to conjoining Hangul jamo (rather than
precomposed Hangul syllables) isn't treated properly by the
iterator returned by BreakIterator.getCharacterInstance().
It treats each jamo element as a character, rather than treating
whole syllables as characters.
Try the following test text:
\u1109\u1161\u11bc
\u1112\u1161\u11bc
<space>
\u1112\u1161\u11ab
\u110b\u1175\u11ab
<space>
\u110b\u1167\u11ab
\u1112\u1161\u11b8
<space>
\u110c\u1161\u11bc
\u1105\u1169
\u1100\u116d
\u1112\u116c
The breaks should be where the line divisions in the preceding
example are ("<space>" denotes an ASCII space). Right now, there's
a character break after every Unicode character.
======================================================================
- backported by
-
JDK-2018282 BreakIterator doesn't handle conjoining Hangul jamo
- Resolved