-
Bug
-
Resolution: Fixed
-
P4
-
17
-
b04
-
generic
-
generic
-
Verified
Issue | Fix Version | Assignee | Priority | Status | Resolution | Resolved In Build |
---|---|---|---|---|---|---|
JDK-8276874 | 17.0.3-oracle | Alex Kasko | P4 | Resolved | Fixed | b01 |
JDK-8276719 | 17.0.2 | Alex Kasko | P4 | Resolved | Fixed | b04 |
JDK-8310038 | 11.0.21 | Paul Hohensee | P4 | Resolved | Fixed | b01 |
1. Using OpenJDK bundled XML Transformer.
2. Input XML contains Unicode supplementary character.
3. Output file format is HTML.
In this case, generated HTML contains incorrect character.
If output file format is TEXT, generated TEXT is fine.
Transformer has serializer class ToHTMLStream and ToTextStream.
In ToTextStream, surrogate pair character is converted #&xxxx; format,
but ToHTMLStream does not convert to #&xxxx; format.
I think ToHTMLStream should implement like as ToTextStream.
To reproduce,
1. Compile and run attached SurrogateTest.java. Result file is stored in testdata directory.
2. Comfirm the value of the input tag on the case01out.html.
Expected: <input id="tag1" value="𠮟">
Actual: <input id="tag1" value="𠮟�">
3. ToTextStream result is fine (case02out.txt)
- backported by
-
JDK-8276719 XML Transformer outputs Unicode supplementary character incorrectly to HTML
- Resolved
-
JDK-8276874 XML Transformer outputs Unicode supplementary character incorrectly to HTML
- Resolved
-
JDK-8310038 XML Transformer outputs Unicode supplementary character incorrectly to HTML
- Resolved
- relates to
-
JDK-8274606 Fix jaxp/javax/xml/jaxp/unittest/transform/SurrogateTest.java test
- Resolved
- links to
-
Commit openjdk/jdk11u-dev/ad283ae8
-
Commit openjdk/jdk17u/0ab99ed0
-
Commit openjdk/jdk/83bce94c
-
Review openjdk/jdk11u-dev/1907
-
Review openjdk/jdk17u/162
-
Review openjdk/jdk/4474
-
Review(master) openjdk/jdk8u-dev/517