Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-718

configuration parameter to prevent accidental formatting of HDFS filesystem



    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 0.22.0
    • 0.22.0
    • namenode
    • None
    • Any

    • Reviewed


      Currently, any time the NameNode is not running, an HDFS filesystem will accept the 'format' command, and will duly format itself. There are those of us who have multi-PB HDFS filesystems who are really quite uncomfortable with this behavior. There is "Y/N" confirmation in the format command, but if the formatter genuinely believes themselves to be doing the right thing, the filesystem will be formatted.

      This patch adds a configuration parameter to the namenode, dfs.namenode.support.allowformat, which defaults to "true," the current behavior: always allow formatting if the NameNode is down or some other process is not holding the namenode lock. But if dfs.namenode.support.allowformat is set to "false," the NameNode will not allow itself to be formatted until this config parameter is changed to "true".

      The general idea is that for production HDFS filesystems, the user would format the HDFS once, then set dfs.namenode.support.allowformat to "false" for all time.

      The attached patch was generated against trunk and +1's on my test machine. We have a 0.20 version that we are using in our cluster as well.


        1. HDFS-718-5.patch
          9 kB
          Jakob Homan
        2. HDFS-718-4.patch
          9 kB
          Jakob Homan
        3. HDFS-718-3.patch
          11 kB
          Jakob Homan
        4. HDFS-718.patch-2.txt
          8 kB
          Andrew Ryan
        5. HDFS-718.patch.txt
          7 kB
          Andrew Ryan



            andrewr Andrew Ryan
            andrewr Andrew Ryan
            0 Vote for this issue
            7 Start watching this issue