Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5283

NN not coming out of startup safemode due to under construction blocks only inside snapshots also counted in safemode threshhold

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 2.1.1-beta, 3.0.0-alpha1
    • 2.3.0
    • snapshots
    • None
    • Reviewed

    Description

      This is observed in one of our env:
      1. A MR Job was running which has created some temporary files and was writing to them.
      2. Snapshot was taken
      3. And Job was killed and temporary files were deleted.
      4. Namenode restarted.
      5. After restart Namenode was in safemode waiting for blocks

      Analysis
      ---------
      1. Since the snapshot taken also includes the temporary files which were open, and later original files are deleted.
      2. UnderConstruction blocks count was taken from leases. not considered the UC blocks only inside snapshots
      3. So safemode threshold count was more and NN did not come out of safemode

      Attachments

        1. HDFS-5283.000.patch
          12 kB
          Jing Zhao
        2. HDFS-5283.patch
          10 kB
          Vinayakumar B
        3. HDFS-5283.patch
          10 kB
          Vinayakumar B
        4. HDFS-5283.patch
          9 kB
          Vinayakumar B
        5. HDFS-5283.patch
          12 kB
          Vinayakumar B
        6. HDFS-5283.patch
          13 kB
          Vinayakumar B
        7. HDFS-5283.patch
          8 kB
          Vinayakumar B

        Issue Links

          Activity

            People

              vinayakumarb Vinayakumar B
              vinayakumarb Vinayakumar B
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: