Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1623 High Availability Framework for HDFS NN
  3. HDFS-2766

HA: test for case where standby partially reads log and then performs checkpoint

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • HA branch (HDFS-1623)
    • None
    • ha, namenode
    • None

    Description

      Here's a potential bug case that we don't currently test for:

      • SBN is reading a finalized edits file when NFS disappears halfway through (or some intermittent error happens)
      • SBN performs a checkpoint and uploads it to the NN
      • NN receives a checkpoint that doesn't correspond to the end of any log segment
      • Both NN and SBN should be able to restart at this point.

      Attachments

        1. HDFS-2766-HDFS-1623.patch
          18 kB
          Aaron Myers
        2. HDFS-2766-HDFS-1623.patch
          18 kB
          Aaron Myers

        Activity

          People

            atm Aaron Myers
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: