Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2124

redirect following same link again and again , max redirect exceed and went db_gone

    XMLWordPrintableJSON

Details

    • Patch Available
    • Important

    Description

      Hello, followredirect is not working in trunk. please see the below log.

      Fetcher: throughput threshold retries: 5
      fetcher.maxNum.threads can't be < than 50 : using 50 instead
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=1
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=1

      fetching http://www.wikipedia.com/wiki/URL_redirection (queue crawl delay=5000ms)
      fetching http://www.wikipedia.com/wiki/URL_redirection (queue crawl delay=5000ms)
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=2
      fetching http://www.wikipedia.com/wiki/URL_redirection (queue crawl delay=5000ms)
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=2
      fetching http://www.wikipedia.com/wiki/URL_redirection (queue crawl delay=5000ms)
      fetching http://www.wikipedia.com/wiki/URL_redirection (queue crawl delay=5000ms)
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=2
      -activeThreads=1, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=2
      - redirect count exceeded http://www.wikipedia.com/wiki/URL_redirection

      Thread FetcherThread has no more work available
      -finishing thread FetcherThread, activeThreads=0
      -activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0, fetchQueues.getQueueCount=2
      -activeThreads=0
      Fetcher: finished at 2015-09-28 19:32:05, elapsed: 00:00:09
      Parsing : 20150928193153

      Attachments

        1. NUTCH-2124.patch
          2 kB
          Sebastian Nagel

        Activity

          People

            snagel Sebastian Nagel
            soniyk40 Yogendra Kumar Soni
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: