Description
Based on discussion on the dev list, to use Nutch for some security research valid use cases (DDoS; DNS and other testing), I am going to create a patch that allows a whitelist:
<property> <name>robot.rules.whitelist</name> <value>132.54.99.22,hostname.apache.org,foo.jpl.nasa.gov</value> <description>Comma separated list of hostnames or IP addresses to ignore robot rules parsing for. </description> </property>
Attachments
Issue Links
- is required by
-
NUTCH-1992 Port whitelist from NUTCH-1927 to 2.x
- Closed