Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-2416

Solr Cell fails to index Zip file contents

    XMLWordPrintableJSON

Details

    Description

      Working with the latest Solr Trunk code and seems the Tika handlers for Solr Cell (ExtractingDocumentLoader.java) and Data Import handler (TikaEntityProcessor.java) fails to index the zip file contents again.
      It just indexes the file names again.
      This issue was addressed some time back, late last year, but seems to have reappeared with the latest code.

      Jira for the Data Import handler part with the patch and the testcase - https://issues.apache.org/jira/browse/SOLR-2332.

      Attachments

        1. SOLR-2416_ExtractingDocumentLoader.patch
          0.6 kB
          Jayendra Patil
        2. SOLR-4216.patch
          1 kB
          Steve Molloy

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jayendra patil Jayendra Patil
              Votes:
              4 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: