Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2664

WebApp for Nutch running in deploy Mode Creates Seed Directory in local FileSystem

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Auto Closed
    • Affects Version/s: 2.3.1
    • Fix Version/s: 2.5
    • Component/s: REST_api, web gui
    • Environment:

      Nutch-2.3.1

      Hbase-1.2.3

      Hadoo- 2.5.2

       

      Description

      When creating crawl jobs using nutch webapp, seed directory gets created in temp (/tmp on linux) directory in local filesystem. This prevents crawl job to inject urls. As injection of url fails, no further phases of crawl can be executed. Seed Directory needs to be created on HDFS in case of Nutch running in deploy mode.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              gajananwatkar Gajanan Watkar
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: