Following negative JCK test currently passes when being run against RI (JDK8b132) however it should fail:
xml_schema/msData/regex/jaxp/reU6.html#reU6.i
This negative test verifies that unicode symbol ȿ doesn't match \w regex pattern (XSD and XML are attached). But actually it should because ȿ according to [1] belongs to Ll general category (not P, Z and C) while XSD standard [2] states:
\w [#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)
The minimized testcase is attached. In order to reproduce the bug:
1. Compile attached Test32.java;
2. Run resulted Test32.class with two arguments: path to attached reU6.xsd and path to attached reU6.xml;
[1] http://www.unicode.org/Public/6.2.0/ucd/UnicodeData.txt
[2] http://www.w3.org/TR/xmlschema-2/
xml_schema/msData/regex/jaxp/reU6.html#reU6.i
This negative test verifies that unicode symbol ȿ doesn't match \w regex pattern (XSD and XML are attached). But actually it should because ȿ according to [1] belongs to Ll general category (not P, Z and C) while XSD standard [2] states:
\w [#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)
The minimized testcase is attached. In order to reproduce the bug:
1. Compile attached Test32.java;
2. Run resulted Test32.class with two arguments: path to attached reU6.xsd and path to attached reU6.xml;
[1] http://www.unicode.org/Public/6.2.0/ucd/UnicodeData.txt
[2] http://www.w3.org/TR/xmlschema-2/