Description
In crawl script you can find something likeĀ
if [[ -d "$CRAWL_PATH"/hostdb ]]; then
echo "Processing sitemaps based on hosts in HostDB"
__bin_nutch sitemap "$CRAWL_PATH"/crawldb -hostdb "$CRAWL_PATH"/hostdb -threads $NUM_THREADS
fi
if [[ -d "$CRAWL_PATH"/hostdb ]]; doesnt work for HDFS only for local mode.
Attachments
Issue Links
- blocks
-
NUTCH-2510 Crawl script modification. HostDb : generate, optional usage and description
- Closed
- links to