Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3107

Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 3.6, 4.0-ALPHA
    • 3.6, 4.0-ALPHA
    • contrib - LangId
    • None

    Description

      The language-detection library used by LangDetectLanguageIdentifierUpdateProcessor uses a random sampling feature enabled by default as a means of avoiding local noise in input. The feature has its merits, but it can also be confusing to users who aren't aware of it since it may give different on the same input. I recommend turning it off to prevent confusion.

      Attachments

        1. SOLR-3107.patch
          0.9 kB
          Christian Moen

        Activity

          People

            rcmuir Robert Muir
            cm Christian Moen
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: