Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3107

Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.6, 4.0-ALPHA
    • Fix Version/s: 3.6, 4.0-ALPHA
    • Component/s: contrib - LangId
    • Labels:
      None

      Description

      The language-detection library used by LangDetectLanguageIdentifierUpdateProcessor uses a random sampling feature enabled by default as a means of avoiding local noise in input. The feature has its merits, but it can also be confusing to users who aren't aware of it since it may give different on the same input. I recommend turning it off to prevent confusion.

      1. SOLR-3107.patch
        0.9 kB
        Christian Moen

        Activity

        Hide
        cm Christian Moen added a comment -

        Attached a trivial patch tested on trunk.

        Show
        cm Christian Moen added a comment - Attached a trivial patch tested on trunk .
        Hide
        rcmuir Robert Muir added a comment -

        +1, i neglected to do this when initially adding this... lets fix this.

        Show
        rcmuir Robert Muir added a comment - +1, i neglected to do this when initially adding this... lets fix this.
        Hide
        rcmuir Robert Muir added a comment -

        Thanks Christian!

        Show
        rcmuir Robert Muir added a comment - Thanks Christian!

          People

          • Assignee:
            rcmuir Robert Muir
            Reporter:
            cm Christian Moen
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development