Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2429

Fix Plugin System to allow protocol plugins to bundle their URLStreamHandlers

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 1.14
    • Fix Version/s: 1.17
    • Component/s: commoncrawl
    • Labels:
      None
    • Environment:

      Tested on both Nutch 1.13 and 1.14 in Ubuntu Linux with OpenJDK 1.8.

      Description

      While trying to use the protocol-smb plugin (which is not part of the Nutch distribution) I realized there are four steps to successfully make use of a protocol plugin:

      1 - put the artifact into the plugins directory
      2 - modify Nutch configuration files to allow smb:// urls plus include the plugin to the loaded list
      3 - extract jcifs.jar and place it on the system classpath
      4 - run nutch with the correct system property

      While steps 1 and 2 seem obvious, 3 and 4 require knowledge of plugin internals which does not feel right for nutch and plugin users. Even more, the jcifs.jar would exist twice on the classpath and could even cause further problems during runtime.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                hiranchaudhuri Hiran Chaudhuri
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated: