Solr
  1. Solr
  2. SOLR-3107

Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 3.6, 4.0-ALPHA
    • Fix Version/s: 3.6, 4.0-ALPHA
    • Component/s: contrib - LangId
    • Labels:
      None

      Description

      The language-detection library used by LangDetectLanguageIdentifierUpdateProcessor uses a random sampling feature enabled by default as a means of avoiding local noise in input. The feature has its merits, but it can also be confusing to users who aren't aware of it since it may give different on the same input. I recommend turning it off to prevent confusion.

      1. SOLR-3107.patch
        0.9 kB
        Christian Moen

        Activity

        Hide
        Christian Moen added a comment -

        Attached a trivial patch tested on trunk.

        Show
        Christian Moen added a comment - Attached a trivial patch tested on trunk .
        Hide
        Robert Muir added a comment -

        +1, i neglected to do this when initially adding this... lets fix this.

        Show
        Robert Muir added a comment - +1, i neglected to do this when initially adding this... lets fix this.
        Hide
        Robert Muir added a comment -

        Thanks Christian!

        Show
        Robert Muir added a comment - Thanks Christian!

          People

          • Assignee:
            Robert Muir
            Reporter:
            Christian Moen
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development