Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-5821

Search inconsistency on SolrCloud replicas

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Reopened
    • Critical
    • Resolution: Unresolved
    • 4.6.1, 4.7.1
    • None
    • SolrCloud

    Description

      We use the following infrastructure:

      SolrCloud with 1 shard and 2 replicas. The index is built using DataImportHandler (importing data from the database). The number of items in the index can vary from 100 to 100,000,000.

      After indexing part of the data (not necessarily all the data, it is enough to have a small number of items in the search index), we can observe that Solr instances (replicas) return different results for the same search queries. I believe it happens because some of the results have the same scores, and Solr instances return those in a random order.

      PS This is a critical issue for us as we use a load balancer to scale Solr through replicas, and as a result of this issue, we retrieve various results for the same queries all the time. They are not necessarily completely different, but even a couple of items that differ is a deal breaker.

      The expected behaviour would be to always get identical results for the same search queries from all replicas. Otherwise, this "cloud" thing works just unreliably.

      Attachments

        1. Screen Shot 2014-04-05 at 2.26.26 AM.png
          42 kB
          Maxim Novikov
        2. Screen Shot 2014-04-05 at 2.26.41 AM.png
          36 kB
          Maxim Novikov

        Activity

          People

            Unassigned Unassigned
            csbubbles Maxim Novikov
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated: