Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-4935

Manual leaving of safe mode may lead to data lost

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.18.3
    • 0.18.3
    • None
    • None
    • Reviewed

    Description

      Due to HADOOP-4610, NameNode calculates mis-replicated blocks when leaving safe mode manually, where it clears the pending deletion queue before it does the calculation. This works fine when NameNode just starts but introduced a bug when NameNode is running for a while. Clearing the pending deletion queue makes NameNode not able to distinguish valid replicas from invalid ones, ie, the ones that have scheduled or dispatched for deletion. Therefore, NameNode may mistakenly decide the block is over-replicated and choose all valid ones to delete.

      Attachments

        1. misReplBlocks-0-18.patch
          0.6 kB
          Konstantin Shvachko
        2. misReplBlocks.patch
          0.7 kB
          Konstantin Shvachko

        Activity

          People

            shv Konstantin Shvachko
            hairong Hairong Kuang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: