Description
This patch contains a new utility which allows to check the configuration of the indexing filters. The IndexingFiltersChecker reads and parses a URL and run the indexers on it. Displays the fields obtained and the first
100 characters of their value.
Can be used e.g. ./nutch org.apache.nutch.indexer.IndexingFiltersChecker http://www.lemonde.fr/
Attachments
Attachments
Issue Links
- relates to
-
NUTCH-1038 Port IndexingFiltersChecker to 2.0
- Closed