Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-16675

Introduce the possibility to rerank topK results with vector similarity functions using DenseVectorField

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Blocker
    • Resolution: Done
    • None
    • 9.3
    • None

    Description

      When using knnQParser in reranking pay attention to the top-K parameter.

      The second pass score(deriving from KNN search) is calculated only if the document d from the first pass is within the K nearest neighbors(in the whole index) of the target vector to search.

      This is a current limitation.

      The final ranked list of results will have the first pass score(main query q) combined with the second pass score(the approximated similarity function distance to the target vector to search).

      Ideally, it should be possible to:

      • Rerank top K results with vector similarity. We should compute the vector similarity function using the DenseVectorField value of all the documents in top K results without the need of running a KNN query.
      • Use only the second pass score as the final score

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            abenedetti Alessandro Benedetti
            eliaporciani Elia Porciani
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 0.5h
                0.5h

                Slack

                  Issue deployment