Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7219

Improve performance of serialization utils in ORC

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.14.0
    • 0.14.0
    • File Formats
    • None

    Description

      ORC uses serialization utils heavily for reading and writing data. The bitpacking and unpacking code in writeInts() and readInts() can be unrolled for better performance. Also double reader/writer performance can be improved by bulk reading/writing from/to byte array.

      Attachments

        1. HIVE-7219.1.patch
          79 kB
          Prasanth Jayachandran
        2. orc-read-perf-jmh-benchmark.png
          403 kB
          Prasanth Jayachandran
        3. HIVE-7219.2.patch
          100 kB
          Prasanth Jayachandran
        4. HIVE-7219.3.patch
          100 kB
          Prasanth Jayachandran
        5. HIVE-7219.4.patch
          161 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              prasanth_j Prasanth Jayachandran
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: