XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • erasure-coding
    • None

    Description

      According to the current design doc for hflush support in erasure coding (see HDFS-7661), the parity datanode (DN) needs an undo log for flush operations. After hflush/hsync, the last cell will be overwritten when 1) the current strip is full, 2) the file is closed, 3) or the hflush/hsync is called again for the current non-full stripe. To serve new reader client and to tolerate failures between successful hflush/hsync and overwrite operation, the parity DN should preserve the old cell in the undo log before overwriting it.

      As parities correspond to block group (BG) length and parity data of different BG length may have the same block length, the undo log should also save the respective block group (BG) length information for the flushed data.

      This jira is to track the effort of designing and implementing an undo log in parity DN to support hflush/hsync operations.

      Attachments

        1. HDFS-10201.wip.patch
          66 kB
          Mingliang Liu
        2. HDFS-10201-demo.patch
          10 kB
          Rui Gao

        Activity

          People

            demongaorui Rui Gao
            liuml07 Mingliang Liu
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: