Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2391

Extract <script> elements in html as "attachment" type MACRO like we do in the PDFParser

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.16
    • None
    • None

    Attachments

      1. TIKA_2391____first_draft.patch
        8 kB
        Tim Allison
      2. testScripts.htm
        53 kB
        Tim Allison
      3. proposed_output.txt
        28 kB
        Tim Allison

      Issue Links

        Activity

          People

            tallison Tim Allison
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: