Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-1678

Race during abort of pending operations during raft shutdown

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0.0, 1.6.0, 1.8.0
    • 1.9.0
    • consensus
    • None

    Description

      I'm seeing the following race occasionally in alter_table-randomized-test:

      • a follower tablet is shutting down while some operations are pending. The first operation is an ALTER_TABLE, and the second is a WRITE which depends on the ALTER (i.e includes the new column)
      • we cancel the ALTER successfully, and then the thread gets de-scheduled
      • the PrepareTask for the WRITE runs before we're able to cancel it. It then fails to prepare because the alter it depends on has not completed

      It seems like we should probably cancel the pending operations in reverse order.

      Attachments

        1. alter_table-randomized-test-debug.log.xz
          141 kB
          Alexey Serbin
        2. alter_table-randomized-test-release.log.xz
          63 kB
          Alexey Serbin

        Activity

          People

            wdberkeley William Berkeley
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: