XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Edit log branch (HDFS-1073)
    • namenode
    • None
    • Reviewed

    Description

      Right now there is very little test coverage of situations where one or more of the edits directories fails. In trunk, the behavior when all of the edits directories are dead is that the NN prints a fatal level log message and calls Runtime.exit(-1).

      I don't think this is really the behavior we want. Needs a bit of thought, but I think something like the following would make more sense:

      • any calls currently waiting on logSync should end up throwing an exception
      • NN should probably enter safe mode
      • ops can restore edits directories and then ask the NN to restore storage, at which point it could edit safemode
      • alternatively, ops could call ask the NN to do saveNamespace and then shut it down

      Attachments

        1. hdfs-2010.0.patch
          67 kB
          Aaron Myers
        2. hdfs-2010.1.patch
          8 kB
          Aaron Myers
        3. hdfs-2010.2.patch
          10 kB
          Aaron Myers

        Activity

          People

            atm Aaron Myers
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: