Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2555

Text with [underline] + [another format] in word document generates overlapping html tags.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.17
    • 2.0.0, 1.21
    • None
    • None

    Description

      I have a sample .docx document which contains one single line of text**++.

      Making that text to be:

      • underlined
        • AND at least one of the following two
      • italic
      • bold****

      will cause the generated .xhtml file to contain overlapping tags.

       

      Example:

      The quick brown fox jumps over the lazy dog.

      will result in

      <b><u>The quick brown fox jumps over the lazy dog.</b></u>

      which causes some browser (Firefox, Chrome) to give an error and not display the content of the file...

       

      Attachments

        1. Clipboard02.jpg
          81 kB
          Serban Alexe

        Issue Links

          Activity

            People

              grossws Konstantin Gribov
              serban83 Serban Alexe
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: