Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1390

readdb -url $url throws NPE with gora-cassandra

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • nutchgora
    • 2.2
    • crawldb
    • None

    Description

      After successfully injecting, generating, fetching (without parsing enabled), parsing, updatingdb, then executinga readdb passing a particular -url argument I get a lovely NPE

      lewis@lewis:~/ASF/nutchgora/runtime/local$ ./bin/nutch readdb -url http://www.trancearoundtheworld.com
      WebTableReader: java.lang.NullPointerException
      	at org.apache.gora.cassandra.store.CassandraClient.getFamilyMap(CassandraClient.java:220)
      	at org.apache.gora.cassandra.store.CassandraStore.execute(CassandraStore.java:108)
      	at org.apache.nutch.crawl.WebTableReader.read(WebTableReader.java:234)
      	at org.apache.nutch.crawl.WebTableReader.run(WebTableReader.java:476)
      	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
      	at org.apache.nutch.crawl.WebTableReader.main(WebTableReader.java:412)
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            lewismc Lewis John McGibbney
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: