Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-293

support for Crawl-delay in Robots.txt

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.8
    • 0.8
    • fetcher
    • None

    Description

      Nutch need support for Crawl-delay defined in robots.txt, it is not a standard but a de-facto standard.
      See:
      http://help.yahoo.com/help/us/ysearch/slurp/slurp-03.html
      Webmasters start blocking nutch since we do not support it.

      Attachments

        1. crawlDelayv1.patch
          5 kB
          Stefan Groschupf

        Activity

          People

            Unassigned Unassigned
            joa23 Stefan Groschupf
            Votes:
            1 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: