Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-11231

NullPointerException in TikaLanguageIdentifierUpdateProcessor.getExpectedSize when performing language identification using unset fields

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 6.6, 7.0
    • 7.2, 8.0
    • contrib - LangId
    • None

    Description

      If I configure the TikaLanguageIdentifierUpdateProcessorFactory that includes a `langid.fl` that isn't set on the document, I get a NPE:

      {{java.lang.NullPointerException
      at org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.getExpectedSize(TikaLanguageIdentifierUpdateProcessor.java:116)
      at org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.concatFields(TikaLanguageIdentifierUpdateProcessor.java:76)
      at org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.detectLanguage(TikaLanguageIdentifierUpdateProcessor.java:53)
      at org.apache.solr.update.processor.LanguageIdentifierUpdateProcessor.process(LanguageIdentifierUpdateProcessor.java:219)}}

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sarowe Steven Rowe
            cbeer Chris Beer
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment