Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-7189

Allow DIH to extract content from embedded documents via Tika

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

      Description

      DIH's TikaEntityProcessor doesn't currently extract content from embedded documents/attachments within a file. It might be useful if users could configure whether or not to include extraction of content from embedded documents.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              shalin Shalin Shekhar Mangar
              Reporter:
              tallison Tim Allison

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment