Hadoop HDFS
  1. Hadoop HDFS
  2. HDFS-1028

INode.getPathNames could split more efficiently

    Details

    • Type: Improvement Improvement
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.22.0
    • Component/s: namenode
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      INode.getPathnames uses String.split(String) which actually uses the full Java regex implementation. Since we're always splitting on a single char, we could implement a faster one like StringUtils.split() (except without the escape character). This takes a significant amount of CPU during FSImage loading so should be a worthwhile speedup.

      1. HDFS-split.2.patch
        0.8 kB
        Dmytro Molkov
      2. HDFS-split.patch
        2 kB
        Dmytro Molkov

        Issue Links

          Activity

          dhruba borthakur made changes -
          Status Patch Available [ 10002 ] Resolved [ 5 ]
          Hadoop Flags [Reviewed]
          Fix Version/s 0.22.0 [ 12314241 ]
          Resolution Fixed [ 1 ]
          Dmytro Molkov made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Dmytro Molkov made changes -
          Attachment HDFS-split.2.patch [ 12443789 ]
          Dmytro Molkov made changes -
          Assignee Dmytro Molkov [ dms ]
          Dmytro Molkov made changes -
          Attachment HDFS-split.patch [ 12443787 ]
          Todd Lipcon made changes -
          Field Original Value New Value
          Link This issue is blocked by HADOOP-6623 [ HADOOP-6623 ]
          Todd Lipcon created issue -

            People

            • Assignee:
              Dmytro Molkov
              Reporter:
              Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development