Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3764

Hadoop fsck reports files with blocks w/o live replicas as corrupt

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.0.0-alpha
    • None
    • None
    • None

    Description

      fsck calls FSN#listCorruptFileBlocks, this returns info for files where there's no live replicas, which doesn't necessarily indicate the file is "corrupt". Let's make fsck distinguish between:

      • Blocks w/o any lives replicas (mostly the case when repl=1 and a DN goes down)
      • Blocks where are live replicas are/were corrupt

      Perhaps this could leverage the stats from HDFS-2554.

      Attachments

        Issue Links

          Activity

            People

              adi2 Andy Isaacson
              eli Eli Collins
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: