Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1073 Simpler model for Namenode's fs Image and edit Logs
  3. HDFS-2093

1073: Handle case where an entirely empty log is left during NN crash

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Edit log branch (HDFS-1073)
    • namenode
    • None
    • Reviewed

    Description

      In fault-testing the HDFS-1073 branch, I saw the following situation:

      • NN has two storage directories, but one is in failed state
      • NN starts to roll edits logs to edits_inprogress_5160285
      • NN then crashes
      • on restart, it detects the truncated log, but since it has 0 txns, it finalizes it to the nonsense log name edits_5160285-5160284.
      • It then starts logs again at edits_inprogress_5160285.
      • After this point, no checkpoints or future NN startups succeed since there are two logs starting with the same txid

      Attachments

        1. hdfs-2093.txt
          12 kB
          Todd Lipcon
        2. hdfs-2093.txt
          12 kB
          Todd Lipcon
        3. hdfs-2093.txt
          7 kB
          Todd Lipcon
        4. hdfs-2093.txt
          5 kB
          Todd Lipcon

        Activity

          People

            tlipcon Todd Lipcon
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: