Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7219

Improve performance of serialization utils in ORC

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.14.0
    • 0.14.0
    • File Formats
    • None

    Description

      ORC uses serialization utils heavily for reading and writing data. The bitpacking and unpacking code in writeInts() and readInts() can be unrolled for better performance. Also double reader/writer performance can be improved by bulk reading/writing from/to byte array.

      Attachments

        1. HIVE-7219.1.patch
          79 kB
          Prasanth Jayachandran
        2. HIVE-7219.2.patch
          100 kB
          Prasanth Jayachandran
        3. HIVE-7219.3.patch
          100 kB
          Prasanth Jayachandran
        4. HIVE-7219.4.patch
          161 kB
          Prasanth Jayachandran
        5. orc-read-perf-jmh-benchmark.png
          403 kB
          Prasanth Jayachandran

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            prasanth_j Prasanth Jayachandran Assign to me
            prasanth_j Prasanth Jayachandran
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment