Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8303056

Improve support for Unicode characters and digits in JavaDoc search

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: P4 P4
    • 22
    • 20
    • tools
    • None
    • b10
    • generic
    • generic

      There are a few problems in JavaDoc search related to Unicode (non-ASCII) letters and digits.

       - Unicode (non-ASCII) letters confuse the word boundary algorithm and are findable when they occur in the middle or at the end of a word.
       - Unicode (non-ASCII) letters break the camel-case detection when they immediately follow an upper-case letter in a camel-case signature.
       - (Groups of) digits are not recognized as words by the search algorithm unless they follow a separator or non-letter character.

      Some of these issues are implementation bugs, but the JavaDoc search specification should be updated to be more explicit about how these cases are handled. There are also other areas where the search spec could use an update, for instance there are JavaDoc tags other than `{@index}` which create search items and should be mentioned/referred to in the spec.

            hannesw Hannes Wallnoefer
            hannesw Hannes Wallnoefer
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: