Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2959

[GSoC] Implementing State of the Art Ranking for Lucene

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New

    Description

      Lucene employs the Vector Space Model (VSM) to rank documents, which compares
      unfavorably to state of the art algorithms, such as BM25. Moreover, the architecture is
      tailored specically to VSM, which makes the addition of new ranking functions a non-
      trivial task.

      This project aims to bring state of the art ranking methods to Lucene and to implement a
      query architecture with pluggable ranking functions.

      The wiki page for the project can be found at http://wiki.apache.org/lucene-java/SummerOfCode2011ProjectRanking.

      Attachments

        1. implementation_plan.pdf
          49 kB
          David Mark Nemeskey
        2. LUCENE-2959_mockdfr.patch
          8 kB
          Robert Muir
        3. LUCENE-2959_nocommits.patch
          22 kB
          Robert Muir
        4. LUCENE-2959.patch
          435 kB
          Robert Muir
        5. LUCENE-2959.patch
          539 kB
          Robert Muir
        6. proposal.pdf
          85 kB
          David Mark Nemeskey

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rcmuir Robert Muir
            david_nemeskey David Mark Nemeskey
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 343h
                343h
                Remaining:
                Remaining Estimate - 343h
                343h
                Logged:
                Time Spent - Not Specified
                Not Specified

                Slack

                  Issue deployment