Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-6264

Distributed commit and optimize are executed serially across all replicas.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 4.10, 6.0
    • SolrCloud
    • None

    Description

      Regardless of whether one agrees with optimizing, when you execute an optimize request using waitSearcher=true, the requests from the controller node are sent to each replica in the collection serially.

      You can send the optimize command to the update handler for a collection to any node in the cluster. For instance, if I had a collection named "foo":

      curl -i -v http://localhost:8984/solr/foo/update --data-binary '<optimize maxSegments="1" waitSearcher="true"/>' -H 'Content-type:application/xml'

      The node that receives this request will collect the URL for all "live" replicas in the collection (not just leaders) (see DistributedUpdateProcessor#getCollectionUrls) and then forward the commit request to each of them. On the surface, the code looks like it forwards the request asynchronously to all replicas. However, this is not actually what happens; the commit requests to each replica in the collection will be processed serially when using waitSearcher=true (because ConcurrentUpdateSolrServer's background queue processing is by-passed for commits).

      Bottom-line, if you request the collection to be optimized, the request gets forwarded around as you'd expect but is done synchronously so can take a long time.

      Attachments

        1. SOLR-6264.patch
          6 kB
          Mark Miller
        2. SOLR-6264.patch
          7 kB
          Mark Miller

        Activity

          People

            markrmiller@gmail.com Mark Miller
            thelabdude Timothy Potter
            Votes:
            1 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: