Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-7928

Scanning blocks from disk during rolling upgrade startup takes a lot of time if disks are busy

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.6.0
    • Fix Version/s: 2.8.0, 3.0.0-alpha1
    • Component/s: datanode
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      We observed this issue in rolling upgrade to 2.6.x on one of our cluster.
      One of the disks was very busy and it took long time to scan that disk compared to other disks.
      Seeing the sar (System Activity Reporter) data we saw that the particular disk was very busy performing IO operations.
      Requesting for an improvement during datanode rolling upgrade.
      During shutdown, we can persist the whole volume map on the disk and let the datanode read that file and create the volume map during startup after rolling upgrade.
      This will not require the datanode process to scan all the disk and read the block.
      This will significantly improve the datanode startup time.

        Attachments

        1. HDFS-7928.patch
          21 kB
          Rushabh S Shah
        2. HDFS-7928-v1.patch
          27 kB
          Rushabh S Shah
        3. HDFS-7928-v2.patch
          35 kB
          Rushabh S Shah

          Activity

            People

            • Assignee:
              shahrs87 Rushabh S Shah
              Reporter:
              shahrs87 Rushabh S Shah
            • Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: