Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1850

DN should transmit absolute failed volume count rather than increments to the NN

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.22.0
    • datanode, namenode
    • None
    • Reviewed

    Description

      The API added in HDFS-811 for the DN to report volume failures to the NN is "inc(DN)". However the given sequence of events will result in the NN forgetting about reported failed volumes:

      1. DN loses a volume and reports it
      2. NN restarts
      3. DN re-registers to the new NN

      A more robust interface would be to have the DN report the total number of volume failures to the NN each heart beat (the same way other volume state is transmitted).

      Attachments

        1. hdfs-1850-1.patch
          42 kB
          Eli Collins
        2. hdfs-1850-2.patch
          49 kB
          Eli Collins
        3. hdfs-1850-3.patch
          49 kB
          Eli Collins
        4. hdfs-1850-4.patch
          49 kB
          Eli Collins
        5. hdfs-1850-5.patch
          48 kB
          Eli Collins
        6. hdfs-1850-6.patch
          48 kB
          Eli Collins
        7. hdfs-1850-7.patch
          48 kB
          Eli Collins

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            eli Eli Collins
            eli Eli Collins
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment