Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-854

Datanode should scan devices in parallel to generate block report

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.22.0
    • 0.21.0
    • datanode
    • None
    • Reviewed

    Description

      A Datanode should scan its disk devices in parallel so that the time to generate a block report is reduced. This will reduce the startup time of a cluster.
      A datanode has 12 disk (each of 1 TB) to store HDFS blocks. There is a total of 150K blocks on these 12 disks. It takes the datanode upto 20 minutes to scan these devices to generate the first block report.

      Attachments

        1. HDFS-854-2.patch
          10 kB
          Dmytro Molkov
        2. HDFS-854.patch.1
          9 kB
          Dmytro Molkov
        3. HDFS-854.patch
          8 kB
          Dmytro Molkov

        Issue Links

          Activity

            People

              dms Dmytro Molkov
              dhruba Dhruba Borthakur
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: