Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2482

index-geoip not to add null values to document fields

    XMLWordPrintableJSON

Details

    Description

      The plugin index-geoip may add null values to document fields which then cause further errors, here a NPE in IndexingFiltersChecker when toString() is called on null:

      $ bin/nutch indexchecker -Dstore.ip.address=true -Dindex.geoip.usage=cityDatabase \
           -Dplugin.includes="protocol-http|parse-html|index-(basic|geoip)" http://www.example.com/
      ...
      Exception in thread "main" java.lang.NullPointerException
              at org.apache.nutch.indexer.IndexingFiltersChecker.fetch(IndexingFiltersChecker.java:340)
              at org.apache.nutch.indexer.IndexingFiltersChecker.run(IndexingFiltersChecker.java:127)
              at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
              at org.apache.nutch.indexer.IndexingFiltersChecker.main(IndexingFiltersChecker.java:370)
      

      Attachments

        Issue Links

          Activity

            People

              snagel Sebastian Nagel
              snagel Sebastian Nagel
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: