Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1068

Reduce NameNode GC by reusing HdfsFileStatus objects in RPC handlers

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • namenode
    • None

    Description

      In our production clusters, getFileInfo is the most frequent operation that hit NameNode, and its frequency is highly correlated to the GC behavior. HDFS-946 has already reduced the amount of heap/cpu and the number of temporary objects for each getFileInfo call. Yet another improvement is to avoid creation of a HdfsFileStatus object for each getFileInfo call. Instead each RPC handler can have a thread local HdfsFileStatus object. Each getFileInfo call simply sets values for all fields of the thread local HdfsFileStatus object.

      Attachments

        1. Screen Shot 2017-08-31 at 3.58.15 PM.png
          782 kB
          Zhe Zhang
        2. HDFS-1068.00.patch
          9 kB
          Zhe Zhang

        Issue Links

          Activity

            People

              zhz Zhe Zhang
              hairong Hairong Kuang
              Votes:
              0 Vote for this issue
              Watchers:
              19 Start watching this issue

              Dates

                Created:
                Updated: