XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Patch Available
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      We are currently testing TRA using Solr 7.7, having >300 shards in the alias, with much growth in the coming months.
      The "hot" data(in our case, more recent) will be stored on stronger nodes(SSD, more RAM, etc).
      A proposal of optimizing queries sorted by router.field(the field which TRA uses to route the data to the correct collection) has emerged.
      Perhaps, in queries which are sorted by router.field, Solr could be smart enough to wait for the more recent collections, and in case the limit was reached cancel other queries(or just not block and wait for the results)?

      For example:

      When querying a TRA which with a filter on a different field than router.field, but sorting by router.field desc, limit=100.
      Since this is a TRA, solr will issue queries for all the collections in the alias.
      But to optimize this particular type of query, Solr could wait for the most recent collection in the TRA, see whether the result set matches or exceeds the limit. If so, the query could be returned to the user without waiting for the rest of the shards. If not, the issuing node will block until the second query returns, and so forth, until the limit of the request is reached.

      This might also be useful for deep paging, querying each collection and only skipping to the next once there are no more results in the specified collection.

      Thoughts or inputs are always welcome.
      This is just my two cents, and I'm always happy to brainstorm.

      Thanks in advance.

      Attachments

        1. SOLR-13125.patch
          79 kB
          mosh
        2. SOLR-13125.patch
          79 kB
          mosh
        3. SOLR-13125.patch
          79 kB
          mosh
        4. SOLR-13125-no-commit.patch
          80 kB
          mosh

        Issue Links

          Activity

            People

              gus Gus Heck
              moshebla mosh
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m