Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10006

Cannot do a full sync (fetchindex) if the replica can't open a searcher

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 5.3.1, 6.4
    • None
    • None
    • None

    Description

      Doing a full sync or fetchindex requires an open searcher and if you can't open the searcher those operations fail.

      For discussion. I've seen a situation in the field where a replica's index became corrupt. When the node was restarted, the replica tried to do a full sync but fails because the core can't open a searcher. The replica went into an endless sync/fail/sync cycle.

      I couldn't reproduce that exact scenario, but it's easy enough to get into a similar situation. Create a 2x2 collection and index some docs. Then stop one of the instances and go in and remove a couple of segments files and restart.

      The replica stays in the "down" state, fine so far.

      Manually issue a fetchindex. That fails because the replica can't open a searcher. Sure, issuing a fetchindex is abusive.... but I think it's the same underlying issue: why should we care about the state of a replica's current index when we're going to completely replace it anyway?

      Attachments

        1. solr.log
          49 kB
          Erick Erickson
        2. solr.log
          13 kB
          Erick Erickson
        3. SOLR-10006.patch
          5 kB
          Mike Drob
        4. SOLR-10006.patch
          4 kB
          Mike Drob

        Issue Links

          Activity

            People

              Unassigned Unassigned
              erickerickson Erick Erickson
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: