Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2131

Problem running nutch(crawl) with selenium

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Do
    • 1.10
    • None
    • nutch server
    • None
    • Ubuntu 12.04 32-bit

    Description

      Hello,

      I had a few issues with running selenium on Ubuntu.
      I am trying to follow the tutorial that has a description to install the nutch selenium plugin, https://github.com/apache/nutch/tree/trunk/src/plugin/protocol-selenium
      I was successfully able to include the plugin and build nutch again.

      But during the crawling process,
      I get the error "Unable to connect to host 127.0.0.1 on port 7055 after 45000 ms" .
      I tried to do research on this and I think that the Firefox version I am using and Selenium jars are incompatible.(I'm not sure if this is the issue)

      So I downgraded my Firefox to version(41 downgraded to 33), but I am still getting the same error.
      Is there a compatible version of firefox that I need to install or is there any other problem?

      I am using selenium that is integrated in nutch-1.10 and nutch version is 1.10.

      I have used 2.44.0 selenium standalone software with firefox version 33 and everything works fine.

      Please help me with this.

      Attachments

        Activity

          People

            Unassigned Unassigned
            ashwiniramesha Ashwini
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: