Hive
  1. Hive
  2. HIVE-2097

Explore mechanisms for better compression with RC Files

    Details

      Description

      Optimization of the compression mechanisms used by RC File to be explored.

      Some initial ideas

      1. More efficient serialization/deserialization based on type-specific and storage-specific knowledge.

      For instance, storing sorted numeric values efficiently using some delta coding techniques

      2. More efficient compression based on type-specific and storage-specific knowledge

      Enable compression codecs to be specified based on types or individual columns

      3. Reordering the on-disk storage for better compression efficiency.

      1. datacomp.tar.gz
        23 kB
        Krishna Kumar

        Issue Links

          Activity

            People

            • Assignee:
              Krishna Kumar
              Reporter:
              Krishna Kumar
            • Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

              • Created:
                Updated:

                Development