Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-6732

Improve FsShell's heap consumption by switching to listStatus that returns an iterator

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • None
    • None
    • None
    • None

    Description

      When listing a large directory from the command line using the default heap configuration, FsShell often runs out of memory. This is because all stats of the entries under the directory need to be in memory before printing them. The new API listStatus that returns an iterator of FileStatus, which implemented in HDFS-1091, no longer requires that all entries are fetched first. Thus switching to this new API will greatly improve the use of heap space.

      Attachments

        Issue Links

          Activity

            People

              daryn Daryn Sharp
              hairong Hairong Kuang
              Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: