Details
Description
I configured the nutch-0.9 as well as nutch-1.0 to crawl intranet website. The machine access the internet/intranet using proxy i had made this setup in nutch-default.xml
everything works well untill i run script, when fetcher tries to access the urls from seed gives error as
unable to resolve www.urladdres.com , skipping
QueueFeeder finished: total 1 records.
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-finishing thread FetcherThread, activeThreads=0
-activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0
-activeThreads=0
Fetcher: done