Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8069054

reU6 JCK test: 23F unicode character should match \w

XMLWordPrintable

    • Verified

      Following negative JCK test currently passes when being run against RI (JDK8b132) however it should fail:

      xml_schema/msData/regex/jaxp/reU6.html#reU6.i

      This negative test verifies that unicode symbol ȿ doesn't match \w regex pattern (XSD and XML are attached). But actually it should because ȿ according to [1] belongs to Ll general category (not P, Z and C) while XSD standard [2] states:

          \w [#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)

      The minimized testcase is attached. In order to reproduce the bug:

      1. Compile attached Test32.java;
      2. Run resulted Test32.class with two arguments: path to attached reU6.xsd and path to attached reU6.xml;

      [1] http://www.unicode.org/Public/6.2.0/ucd/UnicodeData.txt
      [2] http://www.w3.org/TR/xmlschema-2/

        1. reU6.xml
          0.3 kB
        2. reU6.xsd
          0.6 kB
        3. Test32.java
          3 kB

            joehw Joe Wang
            grakov Georgiy Rakov (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: