Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2341

bin/crawl do not fetch batchId generated by bash script

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Auto Closed
    • 2.3.1
    • 2.5
    • bin
    • None
    • Nutch2.3.1 + mongodb

    Description

      I use bin/crawl to crawl url with no data returned, however use bin/nutch step by step fetch, it successed.

      bin/nutch generate -topN 10 -crawlId nutch -batchId  "12345-123"
      bin/nutch fetch "1482737147-29630548" -crawlId nutch -threads 20 # "1482737147-29630548" is generated by 'bin/nutch geneate', here if use batchId "12345-123" as bin/crawl do, then no data returned.
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            WallaceXia YunXia
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: