Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2141

Change the InteractiveSelenium plugin handler Interface to return page content

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.11
    • plugin

    Description

      The handler interface in the protocol-interactiveselenium plugin currently provide methods to manipulate the page content and the HTTPResponse class read's the page content from the driver. This limits the amount of HTML content that could be returned to nutch.

      The processDriver method could return a String object instead. This is particularly helpful in cases such as handling pagination when multiple pages' content can be appended and returned from the handler.

      Attachments

        Activity

          People

            chrismattmann Chris A. Mattmann
            balajig17 Balaji Gurumurthy
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: