Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-4260

Inconsistent numDocs between leader and replica

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 4.6.1, 6.0
    • SolrCloud
    • None
    • 5.0.0.2013.01.04.15.31.51

    Description

      After wiping all cores and reindexing some 3.3 million docs from Nutch using CloudSolrServer we see inconsistencies between the leader and replica for some shards.

      Each core hold about 3.3k documents. For some reason 5 out of 10 shards have a small deviation in then number of documents. The leader and slave deviate for roughly 10-20 documents, not more.

      Results hopping ranks in the result set for identical queries got my attention, there were small IDF differences for exactly the same record causing a record to shift positions in the result set. During those tests no records were indexed. Consecutive catch all queries also return different number of numDocs.

      We're running a 10 node test cluster with 10 shards and a replication factor of two and frequently reindex using a fresh build from trunk. I've not seen this issue for quite some time until a few days ago.

      Attachments

        1. SOLR-4260.patch
          4 kB
          Mark Miller
        2. demo_shard1_replicas_out_of_sync.tgz
          4.03 MB
          Timothy Potter
        3. clusterstate.png
          74 kB
          Yago Riveiro
        4. 192.168.20.104-replica2.png
          28 kB
          Yago Riveiro
        5. 192.168.20.102-replica1.png
          28 kB
          Yago Riveiro

        Issue Links

          Activity

            People

              markrmiller@gmail.com Mark Miller
              markus17 Markus Jelsma
              Votes:
              6 Vote for this issue
              Watchers:
              22 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: