Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-779

Automatic move to safe-mode when cluster size drops

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • namenode
    • None

    Description

      As part of looking at using Kerberos, we want to avoid the case where both the primary (and optional secondary) KDC go offline causing a replication storm as the DataNodes' service tickets time out and they lose the ability to connect to the NameNode. However, this is a specific case of a more general problem of loosing too many nodes too quickly. I think we should have an option to go into safe mode if the cluster size goes down more than N% in terms of DataNodes.

      Attachments

        Issue Links

          Activity

            People

              dhruba Dhruba Borthakur
              omalley Owen O'Malley
              Votes:
              0 Vote for this issue
              Watchers:
              19 Start watching this issue

              Dates

                Created:
                Updated: