Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10057

Evaluate/Design a MatchMost/MatchAll doc set

Agile BoardAttach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Per discussion in SOLR-9764 Design a memory efficient DocSet if a query returns all docs, if a DocSet matches most of documents, current implementation using BitDocSet can be optimized. This JIRA is to evaluate and design an optimized DocSet for these use cases.

      The basic idea is to use an inverted integer list to enumerate all doc ids that doesn't match. If the DocSet matches most of docs, it can be pretty efficient. Meanwhile other ideas are welcome.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            michael.sun Michael Sun

            Dates

              Created:
              Updated:

              Slack

                Issue deployment