Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2341

bin/crawl do not fetch batchId generated by bash script

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Auto Closed
    • Affects Version/s: 2.3.1
    • Fix Version/s: 2.5
    • Component/s: bin
    • Labels:
      None
    • Environment:

      Nutch2.3.1 + mongodb

      Description

      I use bin/crawl to crawl url with no data returned, however use bin/nutch step by step fetch, it successed.

      bin/nutch generate -topN 10 -crawlId nutch -batchId  "12345-123"
      bin/nutch fetch "1482737147-29630548" -crawlId nutch -threads 20 # "1482737147-29630548" is generated by 'bin/nutch geneate', here if use batchId "12345-123" as bin/crawl do, then no data returned.
      

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              WallaceXia YunXia
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: