Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-14987

SolrStream ends up creating a new HttpSolrClient for every replica being queried instead of reusing for the same node

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 8.8, 9.0
    • streaming expressions
    • None

    Description

      Looking into some streaming expression performance issues when there are many collections with many shards being queried and I found that SolrStream's open method creates a new HttpSolrClient for every replica being queried. For instance, in my test case, I have 10 collections with 100 shards each (rf=1) and I get 1000 HttpSolrClient instances in my SolrClientCache. If I reuse HttpSolrClient's per node hosting a replica (so 10 in my case), the query time for my expression drops by half (not too mention the reduced allocation load on the JVM).

      Attachments

        Activity

          People

            thelabdude Timothy Potter
            thelabdude Timothy Potter
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 2h 10m
                2h 10m