Uploaded image for project: 'Geode'
  1. Geode
  2. GEODE-10410

Rebalance Guard Prevent Lost Bucket Recovery

    XMLWordPrintableJSON

Details

    Description

      Following steps reproduce the issue:

      Run the start.gfsh in the attached example, which configures a geode system with a partitioned region and a gateway sender. So there are two regions, the manually created region, and the queue region.

      Then run the example code, which will source ~400M data and 5 times amount of events into the system. All data are sourced into the system, no bucket lost, and no out of memory.

      Then stop one of the server, and revoke the disk file of the server.

      Then start the server, which will trigger a bucket recovery. After that, there will be part of secondary bucket lost.

      gfsh>show metrics --region=/example-region

                | numBucketsWithoutRedundancy  | 63

       

      Attachments

        1. server2.log
          480 kB
          Weijie Xu
        2. test.tar.gz
          6 kB
          Weijie Xu

        Issue Links

          Activity

            People

              WeijieEST Weijie Xu
              WeijieEST Weijie Xu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: