Solr
  1. Solr
  2. SOLR-1984

add HyphenationCompoundWordTokenFilterFactory class

    Details

    • Type: New Feature New Feature
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.1, 4.0-ALPHA
    • Component/s: Schema and Analysis
    • Labels:
      None

      Description

      Please can you include my contribution into Solr night builds.

      I can not compile on Linux server, I have tested only on Windows.

        Activity

        Hide
        P B added a comment -

        source code

        Show
        P B added a comment - source code
        Hide
        Robert Muir added a comment -

        Thank you very much for contributing this, its true there is no factory for this feature.

        I updated your code with a few tweaks:

        • allow null dictionary. This allows the use of just the hyphenation grammar (LUCENE-1287)
        • allow encoding to be specified (but default to UTF-8). Some of the grammar distributions from offo dont use UTF-8 encoding.
        • set onlyLongestMatch default to 'false'. this is just to be consistent with the TokenFilter itself, which defaults to false.
        • added the Apache-licensed danish grammar to test-files, along with a small dictionary and some test cases.

        if no one objects, i'll commit in a bit.

        Show
        Robert Muir added a comment - Thank you very much for contributing this, its true there is no factory for this feature. I updated your code with a few tweaks: allow null dictionary. This allows the use of just the hyphenation grammar ( LUCENE-1287 ) allow encoding to be specified (but default to UTF-8). Some of the grammar distributions from offo dont use UTF-8 encoding. set onlyLongestMatch default to 'false'. this is just to be consistent with the TokenFilter itself, which defaults to false. added the Apache-licensed danish grammar to test-files, along with a small dictionary and some test cases. if no one objects, i'll commit in a bit.
        Hide
        Robert Muir added a comment -

        Committed revision 962555, 962559 (3x)

        Show
        Robert Muir added a comment - Committed revision 962555, 962559 (3x)
        Hide
        Grant Ingersoll added a comment -

        Bulk close for 3.1.0 release

        Show
        Grant Ingersoll added a comment - Bulk close for 3.1.0 release

          People

          • Assignee:
            Robert Muir
            Reporter:
            P B
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development