Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2482

index-geoip not to add null values to document fields

    XMLWordPrintableJSON

    Details

      Description

      The plugin index-geoip may add null values to document fields which then cause further errors, here a NPE in IndexingFiltersChecker when toString() is called on null:

      $ bin/nutch indexchecker -Dstore.ip.address=true -Dindex.geoip.usage=cityDatabase \
           -Dplugin.includes="protocol-http|parse-html|index-(basic|geoip)" http://www.example.com/
      ...
      Exception in thread "main" java.lang.NullPointerException
              at org.apache.nutch.indexer.IndexingFiltersChecker.fetch(IndexingFiltersChecker.java:340)
              at org.apache.nutch.indexer.IndexingFiltersChecker.run(IndexingFiltersChecker.java:127)
              at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
              at org.apache.nutch.indexer.IndexingFiltersChecker.main(IndexingFiltersChecker.java:370)
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                snagel Sebastian Nagel
                Reporter:
                snagel Sebastian Nagel
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: