Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-805

Unable to resolve the url-blah-blah, skipping

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Incomplete
    • 0.9.0
    • nutchgora
    • fetcher
    • CentOS, Nutch -0.9, jdk1.6.0_18

    Description

      I configured the nutch-0.9 as well as nutch-1.0 to crawl intranet website. The machine access the internet/intranet using proxy i had made this setup in nutch-default.xml

      everything works well untill i run script, when fetcher tries to access the urls from seed gives error as

      unable to resolve www.urladdres.com , skipping
      QueueFeeder finished: total 1 records.
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -finishing thread FetcherThread, activeThreads=0
      -activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0
      -activeThreads=0
      Fetcher: done

      Attachments

        Activity

          People

            Unassigned Unassigned
            patkaustubh86 P Kaustubh
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: