Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-2097

Explore mechanisms for better compression with RC Files

    XMLWordPrintableJSON

Details

    Description

      Optimization of the compression mechanisms used by RC File to be explored.

      Some initial ideas

      1. More efficient serialization/deserialization based on type-specific and storage-specific knowledge.

      For instance, storing sorted numeric values efficiently using some delta coding techniques

      2. More efficient compression based on type-specific and storage-specific knowledge

      Enable compression codecs to be specified based on types or individual columns

      3. Reordering the on-disk storage for better compression efficiency.

      Attachments

        1. datacomp.tar.gz
          23 kB
          Krishna Kumar

        Issue Links

          Activity

            People

              n_krishna_kumar Krishna Kumar
              n_krishna_kumar Krishna Kumar
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated: