Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-805

Unable to resolve the url-blah-blah, skipping

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Incomplete
    • Affects Version/s: 0.9.0
    • Fix Version/s: nutchgora
    • Component/s: fetcher
    • Labels:
    • Environment:

      CentOS, Nutch -0.9, jdk1.6.0_18

      Description

      I configured the nutch-0.9 as well as nutch-1.0 to crawl intranet website. The machine access the internet/intranet using proxy i had made this setup in nutch-default.xml

      everything works well untill i run script, when fetcher tries to access the urls from seed gives error as

      unable to resolve www.urladdres.com , skipping
      QueueFeeder finished: total 1 records.
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0
      -activeThreads=0
      Fetcher: done

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              patkaustubh86 P Kaustubh
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: