Uploaded image for project: 'Apache Any23 (Retired)'
  1. Apache Any23 (Retired)
  2. ANY23-351

NullPointerException in HCardExtractor

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3
    • 2.3
    • microformats
    • None

    Description

      When extracting from the url: https://cambridgewi.com/make-cambridge-home/char/V/

      I get the following NullPointerException, which kills the entire extraction process:

      java.lang.NullPointerException
      	at org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119)
      	at org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288)
      	at org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267)
      	at org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130)
      	at org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66)
      	at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102)
      	at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44)
      	at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480)
      	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259)
      	at org.apache.any23.Any23.extract(Any23.java:302)
      	at org.apache.any23.Any23.extract(Any23.java:437)
      

      Attachments

        Issue Links

          Activity

            People

              hansbrende Hans Brende
              hansbrende Hans Brende
              Votes:
              0 Vote for this issue
              Watchers:
              3 Stop watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Slack

                  Issue deployment