Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1623 High Availability Framework for HDFS NN
  3. HDFS-2766

HA: test for case where standby partially reads log and then performs checkpoint

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: HA branch (HDFS-1623)
    • Fix Version/s: None
    • Component/s: ha, namenode
    • Labels:
      None

      Description

      Here's a potential bug case that we don't currently test for:

      • SBN is reading a finalized edits file when NFS disappears halfway through (or some intermittent error happens)
      • SBN performs a checkpoint and uploads it to the NN
      • NN receives a checkpoint that doesn't correspond to the end of any log segment
      • Both NN and SBN should be able to restart at this point.

        Attachments

        1. HDFS-2766-HDFS-1623.patch
          18 kB
          Aaron Myers
        2. HDFS-2766-HDFS-1623.patch
          18 kB
          Aaron Myers

          Activity

            People

            • Assignee:
              atm Aaron Myers
              Reporter:
              tlipcon Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: