Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-1563

SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • None
    • None
    • Lucene/SOLR connector
    • None

    Description

      I am encountering this problem:

      I have checked "Use the Extract Update Handler:" param then I am getting an error on Solr i.e. null:org.apache.solr.common.SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes

      If I ignore tika exception, my documents get indexed but dont have content field on Solr.

      I am using Solr 7.3.1 and manifoldCF 2.8.1
      I am using solr cell and hence not configured external tika extractor in manifoldCF pipeline

      Please help me with this problem

      Thanks in advance

      Attachments

        1. managed-schema
          33 kB
          Sneha
        2. solrconfig.xml
          54 kB
          Sneha
        3. manifold settings.docx
          248 kB
          Subasini Rath
        4. solr.log
          2.46 MB
          Subasini Rath
        5. manifoldcf.log
          989 kB
          Subasini Rath
        6. Document simple history.docx
          407 kB
          Subasini Rath
        7. Manifold and Solr settings_CustomField.docx
          344 kB
          Subasini Rath
        8. schema.png
          21 kB
          Subasini Rath
        9. path.png
          16 kB
          Subasini Rath

        Activity

          People

            kwright@metacarta.com Karl Wright
            snehasingh684@gmail.com Sneha
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: