Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1544

Nutch crawls only first site from seed list

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • 1.6
    • 1.6
    • None
    • None
    • Ubuntu 12.04.

    Description

      Nutch crawls only first site from seed list and then finish. It doesn't give any error or something else. I'm leaving my log and regex urlfilter.

      Regards

      Attachments

        1. hadoop.log
          432 kB
          Adam89
        2. regex-urlfilter.txt
          2 kB
          Adam89

        Activity

          People

            Unassigned Unassigned
            adam89 Adam89
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: