Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-3011

Support for smooth maintenance window

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.12.0
    • None
    • None

    Description

      A scan corresponding to a tablet failure causes the entire SQL to fail on the common query engines, such as Impala. Though we have the fault-tolerant feature by "SetFaultTolerant()", Impala doesn't use it right now since that will make lower throughput. Thus, lots of SQL that are running will fail when we shutdown/reboot/upgrade the tserver. That can be scary.

      Maybe we can do some improvement in this area, for example, the tablets are not allowed to be scanned after the tserver is in maintenance mode (KUDU-2069). And for the LEADER_ONLY mode scanning, the leader role needs to be shifted from the maintenance tserver. Then we can shutdown the tserver smoothly after all the existing SQL are completed.

       

      Attachments

        Activity

          People

            awong Andrew Wong
            helifu LiFu He
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: