Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2104

Upgrade to a version of POI that fixes common bugs in macro extraction, when available

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.15, 2.0.0
    • None
    • None

    Description

      On TIKA-2069, we found two bugs in POI that prevented the extraction of macros from MSOffice files. Let's use this issue to track fixes in POI.

      Current known bugs are POI:
      60162 duplicate of 59302
      60158
      59830
      59858
      60273

      After we release Tika 1.14, let's remove the catch blocks in Tika and rerun against our regression corpus to help identify the most common bugs and find new ones.

      As always, patches are welcome on POI!

      Attachments

        1. newExceptionsInBDetails.xlsx
          119 kB
          Tim Allison
        2. newExceptionsInBByMimeTypeByStackTrace.xlsx
          7 kB
          Tim Allison

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tallison Tim Allison
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: