Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-681

Crawling JCIFS to ElasticSearch does not index due to 'tmp' file extensions

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      When crawling a JCIFS repository to ElasticSearch, SharedDriveConnector.java#checkIngest() always returns false. The result of that is, that nothing will be indexed in ElasticSearch.

      This seems to happen, since the localFile's extension is 'tmp', rather than the original extension. The line

      //SharedDriveConnector.java[1842]
      isIndexable = activities.checkDocumentIndexable(localFile); 
      

      always evaluates 'false'. Adding 'tmp' to the list of the ElasticSearch Connectors 'allowed file extensions' changes that.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            kwright@metacarta.com Karl Wright
            konradkonrad konrad
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment