Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2312

Support PhantomJS as a WebDriver in protocol-selenium

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Trivial
    • Resolution: Incomplete
    • 1.12
    • None
    • protocol

    Description

      PhantomJS is a great parallelizable and headless browser to work with Nutch via protocol-selenium. It looks like the phantomjs JAR is already in the dependencies, and an empty initialization for the PhantomJSDriver exists in protocol-selenium source code.

      However, at its current state, protocol-selenium will not fetch any URLs with phantomjs, and configurations must be passed in via a DesiredCapabilities object. Also a parameter must be created to allow users to add a path to their phantomjs binary inside nutch-site.xml.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jxihong Joey Hong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified