Lucene - Core
  1. Lucene - Core
  2. LUCENE-2959

[GSoC] Implementing State of the Art Ranking for Lucene

    Details

    • Lucene Fields:
      New

      Description

      Lucene employs the Vector Space Model (VSM) to rank documents, which compares
      unfavorably to state of the art algorithms, such as BM25. Moreover, the architecture is
      tailored specically to VSM, which makes the addition of new ranking functions a non-
      trivial task.

      This project aims to bring state of the art ranking methods to Lucene and to implement a
      query architecture with pluggable ranking functions.

      The wiki page for the project can be found at http://wiki.apache.org/lucene-java/SummerOfCode2011ProjectRanking.

      1. proposal.pdf
        85 kB
        David Mark Nemeskey
      2. implementation_plan.pdf
        49 kB
        David Mark Nemeskey
      3. LUCENE-2959_mockdfr.patch
        8 kB
        Robert Muir
      4. LUCENE-2959_nocommits.patch
        22 kB
        Robert Muir
      5. LUCENE-2959.patch
        539 kB
        Robert Muir
      6. LUCENE-2959.patch
        435 kB
        Robert Muir

        Issue Links

          Activity

          David Mark Nemeskey created issue -
          David Mark Nemeskey made changes -
          Field Original Value New Value
          Attachment proposal.pdf [ 12473255 ]
          David Mark Nemeskey made changes -
          Attachment implementation_plan.pdf [ 12473256 ]
          David Mark Nemeskey made changes -
          Comment [ The proposal originally sent to the mailing list. ]
          David Mark Nemeskey made changes -
          Comment [ The implementation plan. ]
          David Mark Nemeskey made changes -
          Link This issue relates to LUCENE-2091 [ LUCENE-2091 ]
          David Mark Nemeskey made changes -
          Link This issue relates to LUCENE-2392 [ LUCENE-2392 ]
          Simon Willnauer made changes -
          Labels gsoc2011 lucene-gsoc-11 mentor
          Simon Willnauer made changes -
          Labels mentor gsoc2011, lucene-gsoc-11 mentor,
          Simon Willnauer made changes -
          Labels gsoc2011, lucene-gsoc-11 mentor, gsoc2011 lucene-gsoc-11 mentor
          Robert Muir made changes -
          Attachment LUCENE-2959_mockdfr.patch [ 12474975 ]
          Robert Muir made changes -
          Assignee Robert Muir [ rcmuir ]
          Robert Muir made changes -
          Fix Version/s flexscoring branch [ 12316437 ]
          David Mark Nemeskey made changes -
          Description Lucene employs the Vector Space Model (VSM) to rank documents, which compares
          unfavorably to state of the art algorithms, such as BM25. Moreover, the architecture is
          tailored specically to VSM, which makes the addition of new ranking functions a non-
          trivial task.

          This project aims to bring state of the art ranking methods to Lucene and to implement a
          query architecture with pluggable ranking functions.
          Lucene employs the Vector Space Model (VSM) to rank documents, which compares
          unfavorably to state of the art algorithms, such as BM25. Moreover, the architecture is
          tailored specically to VSM, which makes the addition of new ranking functions a non-
          trivial task.

          This project aims to bring state of the art ranking methods to Lucene and to implement a
          query architecture with pluggable ranking functions.

          The wiki page for the project can be found at http://wiki.apache.org/lucene-java/SummerOfCode2011ProjectRanking.
          Robert Muir made changes -
          Attachment LUCENE-2959_nocommits.patch [ 12493275 ]
          Robert Muir made changes -
          Attachment LUCENE-2959.patch [ 12493806 ]
          Robert Muir made changes -
          Attachment LUCENE-2959.patch [ 12493814 ]
          Robert Muir made changes -
          Fix Version/s 4.0 [ 12314025 ]
          Robert Muir made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Robert Muir
              Reporter:
              David Mark Nemeskey
            • Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 343h
                343h
                Remaining:
                Remaining Estimate - 343h
                343h
                Logged:
                Time Spent - Not Specified
                Not Specified

                  Development