Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4015

Safemode should count and report orphaned blocks

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.0.0-alpha1
    • Fix Version/s: 2.8.0, 3.0.0-alpha1
    • Component/s: namenode
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      The safemode status currently reports the number of unique reported blocks compared to the total number of blocks referenced by the namespace. However, it does not report the inverse: blocks which are reported by datanodes but not referenced by the namespace.

      In the case that an admin accidentally starts up from an old image, this can be confusing: safemode and fsck will show "corrupt files", which are the files which actually have been deleted but got resurrected by restarting from the old image. This will convince them that they can safely force leave safemode and remove these files – after all, they know that those files should really have been deleted. However, they're not aware that leaving safemode will also unrecoverably delete a bunch of other block files which have been orphaned due to the namespace rollback.

      I'd like to consider reporting something like: "900000 of expected 1000000 blocks have been reported. Additionally, 10000 blocks have been reported which do not correspond to any file in the namespace. Forcing exit of safemode will unrecoverably remove those data blocks"

      Whether this statistic is also used for some kind of "inverse safe mode" is the logical next step, but just reporting it as a warning seems easy enough to accomplish and worth doing.

        Attachments

        1. HDFS-4015.001.patch
          34 kB
          Anu Engineer
        2. HDFS-4015.002.patch
          35 kB
          Anu Engineer
        3. HDFS-4015.003.patch
          31 kB
          Anu Engineer
        4. HDFS-4015.004.patch
          34 kB
          Anu Engineer
        5. HDFS-4015.005.patch
          36 kB
          Anu Engineer
        6. HDFS-4015.006.patch
          36 kB
          Anu Engineer
        7. HDFS-4015.007.patch
          36 kB
          Arpit Agarwal

          Activity

            People

            • Assignee:
              anu Anu Engineer
              Reporter:
              tlipcon Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              20 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: