Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1443 Improve Datanode startup time
  3. HDFS-270

DFS Upgrade should process dfs.data.dirs in parallel

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 0.20.2
    • None
    • datanode
    • None
    • datanode startup, volume parallel

    Description

      I just upgraded from 0.14.2 to 0.15.0, and things went very smoothly, if a little slowly.

      The main reason the upgrade took so long was the block upgrades on the datanodes. Each of our datanodes has 3 drives listed for the dfs.data.dir parameter. From looking at the logs, it is fairly clear that the upgrade procedure does not attempt to upgrade all listed dfs.data.dir's in parallel.

      I think even if all of your dfs.data.dir's are on the same physical device, there would still be an advantage to performing the upgrade process in parallel. The less downtime, the better: especially if it is potentially 20 minutes versus 60 minutes.

      Attachments

        Issue Links

          Activity

            People

              hairong Hairong Kuang
              stuhood Stu Hood
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: