Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-5213

collections?action=SPLITSHARD parent vs. sub-shards numDocs

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 4.4
    • Fix Version/s: 5.2
    • Component/s: update
    • Labels:
      None

      Description

      The problem we saw was that splitting a shard took a long time and at the end of it the sub-shards contained fewer documents than the original shard.

      The root cause was eventually tracked down to the disappearing documents not falling into the hash ranges of the sub-shards.

      Could SolrIndexSplitter split report per-segment numDocs for parent and sub-shards, with at least a warning logged for any discrepancies (documents falling into none of the sub-shards or documents falling into several sub-shards)?

      Additionally, could a case be made for erroring out when discrepancies are detected i.e. not proceeding with the shard split? Either to always error or to have an verifyNumDocs=false/true optional parameter for the SPLITSHARD action.

        Attachments

        1. SOLR-5213.patch
          2 kB
          Ramkumar Aiyengar
        2. SOLR-5213.patch
          2 kB
          Christine Poerschke

          Activity

            People

            • Assignee:
              andyetitmoves Ramkumar Aiyengar
              Reporter:
              cpoerschke Christine Poerschke
            • Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: