Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-69

fetcher.threads.per.host ignored

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • None
    • None
    • fetcher
    • None

    Description

      Fetcher ignores 'maximum threads per host'.
      If you fetch less domains with multiple threads, some webservers feel attacked or could not serve you any more.
      So you loose lots of existing pages in your segments.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jaekle Matthias Jaekle
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: