Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-13445

Preferred replicas on nodes with same system properties as the query master

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 8.2, 9.0
    • None
    • None

    Description

      Currently, Solr chooses a random replica for each shard to fan out the query request. However, this presents a problem when running Solr in multiple availability zones.

      If one availability zone fails then it affects all Solr nodes because they will try to connect to Solr nodes in the failed availability zone until the request times out. This can lead to a build up of threads on each Solr node until the node goes out of memory. This results in a cascading failure.

      This issue try to solve this problem by adding

      • another shardPreference param named node.sysprop, so the query will be routed to nodes with same defined system properties as the current one.
      • default shardPreferences on the whole cluster, which will be stored in /clusterprops.json.
      • a cacher for fetching other nodes system properties whenever /live_nodes get changed.

      Attachments

        1. SOLR-13445.patch
          44 kB
          Cao Manh Dat
        2. SOLR-13445.patch
          44 kB
          Cao Manh Dat
        3. SOLR-13445.patch
          42 kB
          Cao Manh Dat

        Issue Links

          Activity

            People

              caomanhdat Cao Manh Dat
              caomanhdat Cao Manh Dat
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: