Details

    • Lucene Fields:
      New

      Description

      With LUCENE-3174 done, we can finally work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.

      Done:

      • EasyStats: contains all statistics that might be relevant for a ranking algorithm
      • EasySimilarity: the ancestor of all the other similarities. Hides the DocScorers and as much implementation detail as possible
      • BM25: the current "mock" implementation might be OK
      • LM
      • DFR
      • The so-called Information-Based Models
      1. LUCENE-3220.patch
        8 kB
        David Mark Nemeskey
      2. LUCENE-3220.patch
        13 kB
        David Mark Nemeskey
      3. LUCENE-3220.patch
        12 kB
        David Mark Nemeskey
      4. LUCENE-3220.patch
        12 kB
        David Mark Nemeskey
      5. LUCENE-3220.patch
        9 kB
        David Mark Nemeskey
      6. LUCENE-3220.patch
        8 kB
        David Mark Nemeskey
      7. LUCENE-3220.patch
        6 kB
        David Mark Nemeskey
      8. LUCENE-3220.patch
        6 kB
        David Mark Nemeskey
      9. LUCENE-3220.patch
        36 kB
        David Mark Nemeskey
      10. LUCENE-3220.patch
        52 kB
        David Mark Nemeskey
      11. LUCENE-3220.patch
        52 kB
        David Mark Nemeskey
      12. LUCENE-3220.patch
        50 kB
        David Mark Nemeskey
      13. LUCENE-3220.patch
        46 kB
        David Mark Nemeskey
      14. LUCENE-3220.patch
        42 kB
        David Mark Nemeskey
      15. LUCENE-3220.patch
        42 kB
        David Mark Nemeskey
      16. LUCENE-3220.patch
        39 kB
        David Mark Nemeskey
      17. LUCENE-3220.patch
        31 kB
        David Mark Nemeskey
      18. LUCENE-3220.patch
        27 kB
        David Mark Nemeskey
      19. LUCENE-3220.patch
        27 kB
        David Mark Nemeskey
      20. LUCENE-3220.patch
        39 kB
        David Mark Nemeskey
      21. LUCENE-3220.patch
        4 kB
        David Mark Nemeskey
      22. LUCENE-3220.patch
        4 kB
        David Mark Nemeskey
      23. LUCENE-3220.patch
        4 kB
        David Mark Nemeskey
      24. LUCENE-3220.patch
        4 kB
        David Mark Nemeskey

        Issue Links

          Activity

          Gavin made changes -
          Link This issue is depended upon by LUCENE-3357 [ LUCENE-3357 ]
          Gavin made changes -
          Link This issue blocks LUCENE-3357 [ LUCENE-3357 ]
          Robert Muir made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Fix Version/s flexscoring branch [ 12316437 ]
          Resolution Fixed [ 1 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489883 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489739 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489567 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489566 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489391 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489390 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12489109 ]
          David Mark Nemeskey made changes -
          Labels gsoc gsoc gsoc2011
          Component/s core/query/scoring [ 12311984 ]
          David Mark Nemeskey made changes -
          Link This issue blocks LUCENE-3357 [ LUCENE-3357 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12488865 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12487143 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12486487 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12486420 ]
          David Mark Nemeskey made changes -
          Description With [LUCENE-3174|https://issues.apache.org/jira/browse/LUCENE-3174] done, we can finally work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.

          TODO:
           * {{EasyStats}}: contains all statistics that might be relevant for a ranking algorithm
           * {{EasySimilarity}}: the ancestor of all the other similarities. Hides the DocScorers and as much implementation detail as possible
           * _BM25_: the current "mock" implementation might be OK
           * _LM_
           * _DFR_

          Done:
          With [LUCENE-3174|https://issues.apache.org/jira/browse/LUCENE-3174] done, we can finally work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.

          Done:
           * {{EasyStats}}: contains all statistics that might be relevant for a ranking algorithm
           * {{EasySimilarity}}: the ancestor of all the other similarities. Hides the DocScorers and as much implementation detail as possible
           * _BM25_: the current "mock" implementation might be OK
           * _LM_
           * _DFR_
           * The so-called _Information-Based Models_

          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12486316 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12485997 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12485296 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12485161 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483961 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483917 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483844 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483821 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483452 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483273 ]
          David Mark Nemeskey made changes -
          Comment [ Done. ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483271 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483271 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483168 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483164 ]
          David Mark Nemeskey made changes -
          Attachment LUCENE-3220.patch [ 12483155 ]
          David Mark Nemeskey made changes -
          Parent LUCENE-2959 [ 12501006 ]
          Issue Type New Feature [ 2 ] Sub-task [ 7 ]
          David Mark Nemeskey made changes -
          Field Original Value New Value
          Description With [LUCENE-3174|https://issues.apache.org/jira/browse/LUCENE-3174] done, we can finally work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.

          TODO:
           * `EasyStats`: contains all statistics that might be relevant for a ranking algorithm
           * `EasySimilarity`: the ancestor of all the other similarities. Hides the DocScorers and as much implementation detail as possible
           * _BM25_: the current "mock" implementation might be OK
           * _LM_
           * _DFR_

          Done:
          With [LUCENE-3174|https://issues.apache.org/jira/browse/LUCENE-3174] done, we can finally work on implementing the standard ranking models. Currently DFR, BM25 and LM are on the menu.

          TODO:
           * {{EasyStats}}: contains all statistics that might be relevant for a ranking algorithm
           * {{EasySimilarity}}: the ancestor of all the other similarities. Hides the DocScorers and as much implementation detail as possible
           * _BM25_: the current "mock" implementation might be OK
           * _LM_
           * _DFR_

          Done:
          David Mark Nemeskey created issue -

            People

            • Assignee:
              David Mark Nemeskey
              Reporter:
              David Mark Nemeskey
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Due:
                Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 336h
                336h
                Remaining:
                Remaining Estimate - 336h
                336h
                Logged:
                Time Spent - Not Specified
                Not Specified

                  Development