Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3107

Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.6, 4.0-ALPHA
    • Fix Version/s: 3.6, 4.0-ALPHA
    • Component/s: contrib - LangId
    • Labels:
      None

      Description

      The language-detection library used by LangDetectLanguageIdentifierUpdateProcessor uses a random sampling feature enabled by default as a means of avoiding local noise in input. The feature has its merits, but it can also be confusing to users who aren't aware of it since it may give different on the same input. I recommend turning it off to prevent confusion.

        Attachments

        1. SOLR-3107.patch
          0.9 kB
          Christian Moen

          Activity

            People

            • Assignee:
              rcmuir Robert Muir
              Reporter:
              cm Christian Moen
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: