Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-3325

When wal is deleted, fault recovery and load balancing are abnormal

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • consensus
    • None

    Description

      1、using kudu leader step down to create multiple wal message

      ./kudu tablet leader_step_down $MASTER_IP   1299f5a939d2453c83104a6db0cae3e7

      wal

      cmeta

      2、stop one of tserver to start tablet recovery,so that we can make opid_index flush to cmeta

      wal

      cmeta

      3、stop all tservers,and delete tablet wal

      4、start all tservers

      we can see the index in wal starts counting from 1, but the opid_index recorded in cmeta is the value 20 which is before deleting wal
       

      wal

       

      cmeta

       

      5、stop a tserver,trigger fault recovery


      when the leader recovery a replica, and master request change raft config to add the new replica to new raft config, leader replica while ignored because the opindex is smaller than that in cmeta.
       

      6、delete all wals

      7、kudu cluster rebalance

      ./kudu cluster rebalance $MASTER_IP


      rebalance is also failed when change raft config

      Attachments

        1. image-2021-10-06-15-36-40-996.png
          428 kB
          yejiabao_h
        2. image-2021-10-06-15-36-53-813.png
          185 kB
          yejiabao_h
        3. image-2021-10-06-15-37-09-520.png
          526 kB
          yejiabao_h
        4. image-2021-10-06-15-37-24-776.png
          936 kB
          yejiabao_h
        5. image-2021-10-06-15-37-42-533.png
          194 kB
          yejiabao_h
        6. image-2021-10-06-15-37-54-782.png
          284 kB
          yejiabao_h
        7. image-2021-10-06-15-38-06-575.png
          208 kB
          yejiabao_h
        8. image-2021-10-06-15-38-17-388.png
          204 kB
          yejiabao_h
        9. image-2021-10-06-15-38-29-176.png
          600 kB
          yejiabao_h
        10. image-2021-10-06-15-38-39-852.png
          390 kB
          yejiabao_h
        11. image-2021-10-06-15-38-53-343.png
          555 kB
          yejiabao_h
        12. image-2021-10-06-15-39-03-296.png
          726 kB
          yejiabao_h
        13. image-2021-10-06-19-23-51-769.png
          184 kB
          yejiabao_h

        Activity

          People

            Unassigned Unassigned
            yejiabao_h yejiabao_h
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: