Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Auto Closed
-
2.3.1
-
Nutch-2.3.1
Hbase-1.2.3
Hadoo- 2.5.2
Description
When creating crawl jobs using nutch webapp, seed directory gets created in temp (/tmp on linux) directory in local filesystem. This prevents crawl job to inject urls. As injection of url fails, no further phases of crawl can be executed. Seed Directory needs to be created on HDFS in case of Nutch running in deploy mode.