Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-15547

Dynamic disk-level tiering

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • datanode
    • None

    Description

      This is a proposal for a new use case based on archival storage, to allow configuring DISK and ARCHIVE storage types on the same device (filesystem) to balance disk IO for disks with different density.

      The proposal is to mainly solve two problems:

      1) The disk IO of ARCHIVE disks is underutilized. This is normal in many use cases where the data hotness is highly skewed.

      2) Over the years, as better/cheaper hard drives showing on the market, a large production environment can have mixed disk densities. For example, in our prod environment, we have 2TB, 4TB, 8TB, and 16TB disks. When putting all different HDDs into the cluster, we should be able to utilize disk capacity and disk IO efficiently for all of them.

      When moving blocks from DISK to ARCHIVE, we can prefer the same disk and simply rename the files instead of copying.

      Attachments

        Issue Links

          Activity

            People

              LeonG Leon Gao
              LeonG Leon Gao
              Votes:
              0 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 18h 40m
                  18h 40m