Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-1120

ElasticSearch connector document deletion fails when document already deleted

    XMLWordPrintableJSON

Details

    Description

      This is the exception that is thrown:

      ERROR 2014-12-12 14:45:24,915 (Document cleanup thread '2') - Exception tossed: Server/page not found
      org.apache.manifoldcf.core.interfaces.ManifoldCFException: Server/page not found
              at org.apache.manifoldcf.agents.output.elasticsearch.ElasticSearchConnection.handleResultCode(ElasticSearchConnection.java:234)
              at org.apache.manifoldcf.agents.output.elasticsearch.ElasticSearchConnection.call(ElasticSearchConnection.java:203)
              at org.apache.manifoldcf.agents.output.elasticsearch.ElasticSearchDelete.execute(ElasticSearchDelete.java:45)
              at org.apache.manifoldcf.agents.output.elasticsearch.ElasticSearchConnector.removeDocument(ElasticSearchConnector.java:578)
              at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.removeDocument(IncrementalIngester.java:2350)
              at org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.documentDeleteMultiple(IncrementalIngester.java:1059)
              at org.apache.manifoldcf.crawler.system.DocumentCleanupThread.run(DocumentCleanupThread.java:189)
      

      The problem is that a 404 error is returned, and it is treated as fatal by the ES connector, but that's wrong according to the REST API spec:

      http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/delete-doc.html

      Attachments

        1. CONNECTORS-1120.patch
          2 kB
          Karl Wright

        Activity

          People

            kwright@metacarta.com Karl Wright
            kwright@metacarta.com Karl Wright
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: