Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1673

Don't include source file name in embedded path with RecursiveParserWrapper

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      The RecursiveParserWrapper has been including the source file in the embedded file path ("X-TIKA:embedded_resource_path"), as in "test_recursive.docx/embed1.zip/embed2.zip/embed3.zip/embed3.txt". If the client forgets to send in a file name or if a filename doesn't exist, then the RecursiveParserWrapper defaults to "embed-1", which is wrong.

      Let's drop the source file name from the path so that the above will be "/embed1.zip/embed2.zip/embed3.zip/embed3.txt".

        Attachments

          Activity

            People

            • Assignee:
              tallison@apache.org Tim Allison
              Reporter:
              tallison@apache.org Tim Allison
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: