Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-16910

Fix incorrectly initializing RandomAccessFile caused flush performance decreased for JN

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      At present, after our cluster backport patch HDFS-15882,
      when set shouldSyncWritesAndSkipFsync to false, there will be flush performance degradation caused by JN.

      Root Cause
      when setting shouldSyncWritesAndSkipFsync to false, the mode of init RandomAccessFile will be `rws`.
      even if fc.force(false) is executed when flushAndSync is executed (hopefully, only requires updates to the file's content to be written to storage and the metadata is not update),
      but since the mode of RandomAccessFile is `rws`, It will requires updates to both the file's content and its metadata to be written,
      there will be flush performance degradation caused by JN.

      Fix:
      Need to update RandomAccessFile's mode from `rws` to `rwd`:

      rwd: Open for reading and writing, as with "rw", and also require that every update to the file's content be written synchronously to the underlying storage device.

      if (shouldSyncWritesAndSkipFsync) {
      rp = new RandomAccessFile(name, "rwd");
      } else {
      rp = new RandomAccessFile(name, "rw");
      }
      

      In this way, when flushAndSync is executed,
      if shouldSyncWritesAndSkipFsync is false and the mode of RandomAccessFile is 'rw', it will call fc.force(false) to execute,
      otherwise should use `rwd` to perform the operation.

      Attachments

        Issue Links

          Activity

            People

              haiyang Hu Haiyang Hu
              haiyang Hu Haiyang Hu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: