Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-2903

MalformedURLException logging in TikaEntityProcessor

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Incomplete
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:

      Description

      When using TikaEntityProcessor to fetch only certain documents, the logging is filled with SEVERE exceptions.

      There should be a way to handle this exception with a lot less logging.

      17-nov-2011 15:23:34 org.apache.solr.handler.dataimport.BinURLDataSource getData
      SEVERE: Exception thrown while getting data
      java.net.MalformedURLException: no protocol: null
      at java.net.URL.<init>(URL.java:567)
      at java.net.URL.<init>(URL.java:464)
      at java.net.URL.<init>(URL.java:413)
      at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80)
      at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37)
      at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102)
      at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311)
      at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222)
      at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
      at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440)
      at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421)
      17-nov-2011 15:23:34 org.apache.solr.common.SolrException log
      SEVERE: Exception in entity : tika:org.apache.solr.handler.dataimport.DataImportHandlerException: Exception in invoking url null Processing Document # 1445
      at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72)
      at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:88)
      at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37)
      at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102)
      at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
      at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311)
      at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222)
      at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
      at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440)
      at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421)
      Caused by: java.net.MalformedURLException: no protocol: null
      at java.net.URL.<init>(URL.java:567)
      at java.net.URL.<init>(URL.java:464)
      at java.net.URL.<init>(URL.java:413)
      at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80)
      ... 13 more

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              okkeklein Okke Klein
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: