Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-15714

HDFS Provided Storage Read/Write Mount Support On-the-fly

    XMLWordPrintableJSON

Details

    Description

      HDFS Provided Storage (PS) is a feature to tier HDFS over other file systems. In HDFS-9806, PROVIDED storage type was introduced to HDFS. Through configuring external storage with PROVIDED tag for DataNode, user can enable application to access data stored externally from HDFS side. However, there are two issues need to be addressed. Firstly, mounting external storage on-the-fly, namely dynamic mount, is lacking. It is necessary to get it supported to flexibly combine HDFS with an external storage at runtime. Secondly, PS write is not supported by current HDFS. But in real applications, it is common to transfer data bi-directionally for read/write between HDFS and external storage.

      Through this JIRA, we are presenting our work for PS write support and dynamic mount support for both read & write. Please note in the community several JIRAs have been filed for these topics. Our work is based on these previous community work, with new design & implementation to support called writeBack mount and enable admin to add any mount on-the-fly. We appreciate those folks in the community for their great contribution! See their pending JIRAs: HDFS-14805 & HDFS-12090.

      Attachments

        1. HDFS-15714-02.patch
          1.35 MB
          Feilong He
        2. HDFS-15714-01.patch
          1.35 MB
          Feilong He
        3. HDFS_Provided_Storage_Performance-V1.pdf
          317 kB
          Feilong He
        4. HDFS_Provided_Storage_Design-V1.pdf
          269 kB
          Feilong He

        Issue Links

          Activity

            People

              PhiloHe Feilong He
              PhiloHe Feilong He
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 50m
                  2h 50m