Description
Introduces new configuration file for mapping protocol plugins to hostnames.
# This file defines a hostname to protocol plugin mapping. Each line takes a # host name followed by a tab, followed by the ID of the protocol plugin. You # can find the ID in the protocol plugin's plugin.xml file. # # <hostname>\t<plugin_id>\n # nutch.apache.org org.apache.nutch.protocol.httpclient.Http # tika.apache.org org.apache.nutch.protocol.http.Http #
Attachments
Attachments
Issue Links
- contains
-
NUTCH-2653 ProtocolFactory.getProtocol(url) creates separate plugin instances for http/https
- Closed
- duplicates
-
NUTCH-2126 Use selenium protocol for specific sites
- Closed
- links to