Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2751

nutch clean does not work with secured solr cloud

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 1.16
    • 1.17
    • indexer
    • None

    Description

      I am calling nutch clean to remove 404 entries from Solr, but fail with exception below.

      Adding and updating entries is working fine. Hence, index-writer config seems to be correct in general.

      Identical behaviour in 1.15 and 1.16, although SolrIndexWriter.java has been modified for delete case.

      No more ideas, where to look at....

       

      2019-11-01 14:45:55,664 INFO solr.SolrIndexWriter - SolrIndexer: deleting 14/14 documents
      2019-11-01 14:45:55,768 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 0
      2019-11-01 14:45:55,780 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 1
      2019-11-01 14:45:55,858 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 2
      2019-11-01 14:45:55,887 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 3
      2019-11-01 14:45:55,903 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 4
      2019-11-01 14:45:55,938 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      , retry? 5
      2019-11-01 14:45:55,938 DEBUG concurrent.ExecutorHelper - afterExecute in thread: pool-4-thread-1, runnable type: java.util.concurrent.FutureTask
      2019-11-01 14:45:55,940 INFO mapred.LocalJobRunner - reduce task executor complete.
      2019-11-01 14:45:55,941 WARN mapred.LocalJobRunner - job_local2086525572_0001
      java.lang.Exception: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
      at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:491)
      at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:558)
      Caused by: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
      at org.apache.solr.client.solrj.impl.CloudSolrClient.directUpdate(CloudSolrClient.java:553)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.sendRequest(CloudSolrClient.java:1014)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:885)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.request(CloudSolrClient.java:818)
      at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219)
      at org.apache.nutch.indexwriter.solr.SolrIndexWriter.push(SolrIndexWriter.java:270)
      at org.apache.nutch.indexwriter.solr.SolrIndexWriter.commit(SolrIndexWriter.java:214)
      at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:205)
      at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:257)
      at org.apache.nutch.indexer.CleaningJob$DeleterReducer.cleanup(CleaningJob.java:115)
      at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:179)
      at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:627)
      at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
      at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:346)
      at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      at java.lang.Thread.run(Thread.java:748)
      Caused by: org.apache.solr.client.solrj.SolrServerException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
      at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:657)
      at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:255)
      at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:244)
      at org.apache.solr.client.solrj.impl.LBHttpSolrClient.doRequest(LBHttpSolrClient.java:483)
      at org.apache.solr.client.solrj.impl.LBHttpSolrClient.request(LBHttpSolrClient.java:413)
      at org.apache.solr.client.solrj.impl.CloudSolrClient.lambda$directUpdate$0(CloudSolrClient.java:528)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188)
      ... 3 more
      Caused by: org.apache.http.client.ClientProtocolException
      at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:187)
      at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:83)
      at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:56)
      at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:542)
      ... 10 more
      Caused by: org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
      at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:226)
      at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:185)
      at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
      at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:111)
      at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
      ... 13 more

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dhammling Daniel Hammling
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: