Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1544

Nutch crawls only first site from seed list

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Invalid
    • Affects Version/s: 1.6
    • Fix Version/s: 1.6
    • Component/s: None
    • Labels:
      None
    • Environment:

      Ubuntu 12.04.

      Description

      Nutch crawls only first site from seed list and then finish. It doesn't give any error or something else. I'm leaving my log and regex urlfilter.

      Regards

        Attachments

        1. regex-urlfilter.txt
          2 kB
          Adam89
        2. hadoop.log
          432 kB
          Adam89

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              adam89 Adam89

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment