Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-26685

Improve Path name escaping / unescaping performance

    XMLWordPrintableJSON

Details

    Description

      When escaping / unescaping partition path part names, the existing logic incurs significant avoidable overhead by copying each character sequentially into a new StringBuilder even when no escaping/unescaping is necessary as well as using String.format to escape characters inside of the inner loop.

       

      The included patch to improve the performance of these operations refactors two static method implementations, but requires no external API surface or user-visible behavior changes. This change is applicable and portable to a wide range of Hive versions from branch-0.6 onward when the initial method implementations were added.

      Attachments

        1. HIVE-26685.1.patch
          3 kB
          James Petty

        Issue Links

          Activity

            People

              pettja James Petty
              pettja James Petty
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 10m
                  1h 10m