Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-681

Crawling JCIFS to ElasticSearch does not index due to 'tmp' file extensions

    XMLWordPrintableJSON

Details

    Description

      When crawling a JCIFS repository to ElasticSearch, SharedDriveConnector.java#checkIngest() always returns false. The result of that is, that nothing will be indexed in ElasticSearch.

      This seems to happen, since the localFile's extension is 'tmp', rather than the original extension. The line

      //SharedDriveConnector.java[1842]
      isIndexable = activities.checkDocumentIndexable(localFile); 
      

      always evaluates 'false'. Adding 'tmp' to the list of the ElasticSearch Connectors 'allowed file extensions' changes that.

      Attachments

        Activity

          People

            kwright@metacarta.com Karl Wright
            konradkonrad konrad
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: