Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-2903

Durability testing framework and tests

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: 1.11.0
    • Fix Version/s: None
    • Component/s: test

      Description

      From time to time we get user reports of durability issues in Kudu. We try to be good citizens and obey the POSIX spec w.r.t. durably storing data on disk, but we lack any sort of tests that prove we're doing this correctly.

      Ideally, we'd have a framework that allows us to run a standard Kudu workload while doing pathological things to a subset of nodes like:

      • Panicking the Linux kernel.
      • Abruptly cutting power.
      • Abruptly unmounting a filesystem or yanking a disk.
        Then we'd restart Kudu on the affected nodes and prove that all on-disk data remains consistent.

      Without such a framework, we can only theorize issues and their possible fixes. Some examples include KUDU-2195 and KUDU-2260.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              adar Adar Dembo
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: