Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-7940

Bengali Analyzer for Lucene

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 7.1, 8.0
    • modules/analysis
    • New

    Description

      Dear All,

      I have noticed that, an issue(https://issues.apache.org/jira/browse/LUCENE-2725) was created to add Bengali Analyzer into LUCENE but it was nearly 7(seven) years ago. I didn't see any update in that issue on JIRA.

      In few days ago, I am in need of analyzing my Bangla documents(I have used Elasticsearch). I have contacted with a member of elastic.co. He suggested me to do a contribution with my research codes to LUCENE.

      I have started reviewing the codes of "modules/analysis". I have noticed that, Hindi analyzer is added already. By following HindiAnalyzer and HindiStemmer codes, I have developed BengaliAnalyzer for LUCENE.

      I have followed two research papers and implemented features which are needed.

      Please give me instructions, what should I do next.

      Thanks

      Attachments

        Activity

          People

            Unassigned Unassigned
            sunkuet02 Md. Abdulla-Al-Sun
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 168h
                168h
                Remaining:
                Remaining Estimate - 168h
                168h
                Logged:
                Time Spent - Not Specified
                Not Specified