Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-4120119

Parcing problem if html has long plain text between body tags

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: P4 P4
    • None
    • 1.2.0
    • client-libs
    • None

      A) Introduction
      I feel there is a parcing problem when HTML file has only plain text file between <body> and </body>.
      I attached 3 test html files:
      test-big.html(68KB)
      test-fail.html(39KB)
      test-ok.html(27K)

      test-fail.html and test-ok.html has the following format. Only difference is their plain text file size between <body> and </body>:
      <html><head><title>some title</title></head>
      <body>
      [Just plain text file]
      </body>
      </html>

      test-big.html is much more big compare to the test-fail.html and test-ok.html ,and it has lots of html tags between <body> and </body>.

      B) PROBLEM
      I used the attached HtmlReader to read those 3 files. First of all I noticed ParagraphView can read test-big.html which is bigger that other two files.

      Reading test-ok.html was fine while reading test-fail.html was failed:
      com.sun.java.swing.text.StateInvariantError: infinite loop in formatting
              at com.sun.java.swing.text.ParagraphView.rebuildRows(ParagraphView.java:163)
              at com.sun.java.swing.text.ParagraphView.layout(ParagraphView.java:127)
              at com.sun.java.swing.text.BoxView.setSize(BoxView.java:129)
              at com.sun.java.swing.text.BoxView.layout(BoxView.java:560)
              at com.sun.java.swing.text.BoxView.setSize(BoxView.java:129)
              at com.sun.java.swing.text.DefaultTextUI$RootView.paint(DefaultTextUI.java:740)
              at com.sun.java.swing.text.DefaultTextUI.paintSafely(DefaultTextUI.java:335)
              at com.sun.java.swing.text.DefaultTextUI$SafePainter.run(DefaultTextUI.java:999)
              at com.sun.java.swing.text.AbstractDocument.render(AbstractDocument.java:249)
              at com.sun.java.swing.text.DefaultTextUI.paint(DefaultTextUI.java:442)
              at com.sun.java.swing.plaf.ComponentUI.update(ComponentUI.java:47)
              at com.sun.java.swing.JComponent.paintComponent(JComponent.java:374)
              at com.sun.java.swing.JComponent.paint(JComponent.java:579)
              at com.sun.java.swing.JComponent.paintChildren(JComponent.java:434)
              at com.sun.java.swing.JComponent.paint(JComponent.java:582)
              at com.sun.java.swing.JViewport.paint(JViewport.java:266)
              at com.sun.java.swing.JComponent.paintChildren(JComponent.java:434)
              at com.sun.java.swing.JComponent.paint(JComponent.java:582)
              at com.sun.java.swing.JComponent.paintChildren(JComponent.java:434)
              at com.sun.java.swing.JComponent.paint(JComponent.java:582)
              at com.sun.java.swing.JComponent.paintChildren(JComponent.java:434)
              at com.sun.java.swing.JComponent.paint(JComponent.java:582)
              at com.sun.java.swing.JLayeredPane.paint(JLayeredPane.java:396)
              at com.sun.java.swing.JComponent.paintChildren(JComponent.java:434)
              at com.sun.java.swing.JComponent.paint(JComponent.java:567)
              at java.awt.Container.paint(Container.java:702)
              at java.awt.Component.dispatchEventImpl(Component.java:1723)
              at java.awt.Container.dispatchEventImpl(Container.java:939)
              at java.awt.Window.dispatchEventImpl(Window.java:443)
              at java.awt.Component.dispatchEvent(Component.java:1704)
              at java.awt.EventDispatchThread.run(EventDispatchThread.java:63)

      The fact that decides fail or succeed in this format is the SIZE of plain text between <body> tags because:
      if I add more plain text into test-ok.html(27KB) so that its size is about the size of test-fail.html(39KB) then test-ok.html is also failed. But I can't tell the maximum size that ParagraphView can display is about 27KB because even test-big.html(68KB) was fine to display. One interesting fact is 'parcing performance compare' as below.

      test-big.html(68KB) : 25 sec to display
      test-ok.html(27KB): over 45 sec to display

      C) Summary
      There is an inefficience in parcing html file that has plain text between body tags. As a result of this I got exceptions if the file size is bigger than around 27KB.
      bae-chul.kim@Eng 1998-03-16

            tprinzing Tim Prinzing (Inactive)
            bkimsunw Bae-chul Kim (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved:
              Imported:
              Indexed: