Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-16985

Fix data missing issue when delete local block file.

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      We encounterd several missing-block problem in our production cluster which  hdfs  running on AWS EC2 + EBS.

      The root cause:

      1. the block remains only 1 replication left and hasn't been reconstruction
      2. DN checks block file existing when BlockSender construction
      3. the EBS checking failed and throw FileNotFoundException (EBS may be in fault condition)
      4. DN invalidateBlock and schedule block  async deletion
      5. EBS already back to normal when DN do delete block
      6. the block file be delete permanently and can't be recovered

      Attachments

        Issue Links

          Activity

            People

              smarthan Chengwei Wang
              smarthan Chengwei Wang
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: