Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-2126 Improve Namenode startup time [umbrella task]
  3. HDFS-1391

Exiting safemode takes a long time when there are lots of blocks in the HDFS

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • namenode
    • None

    Description

      When the namenode decides to exit safemode, it acquires the FSNamesystem lock and then iterates over all blocks in the blocksmap to determine if any block has any excess replicas. This call takes upwards of 5 minutes on a cluster that has 100 million blocks. This delays namenode restart to a good extent.

      Attachments

        1. excessReplicas.1_trunk.txt
          16 kB
          Dhruba Borthakur
        2. excessReplicas2.txt
          16 kB
          Dhruba Borthakur
        3. excessReplicas3.txt
          18 kB
          Dhruba Borthakur
        4. excessReplicas5.txt
          19 kB
          Dhruba Borthakur
        5. 1391_excessReplicas5_comments.txt
          4 kB
          Matthew Foley

        Activity

          People

            dhruba Dhruba Borthakur
            dhruba Dhruba Borthakur
            Votes:
            0 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated: