Uploaded image for project: 'ORC'
  1. ORC
  2. ORC-1017

Create a new tool that summarizes the size of a file by column

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.7.2
    • 1.7.2
    • None
    • None

    Description

      I want a tool that summarizes how the space inside an ORC file is used. In particular, for each column, the indexes, the file footer, and the stripe footers.

      The output on the orc_split_elim_new.orc is:

      Percent Bytes/Row Name
      46.79 0.04 subtype
      17.49 0.02 _file_footer
      16.57 0.02 _index
      7.01 0.01 decimal1
      5.05 0.00 _stripe_footer
      2.84 0.00 string1
      2.59 0.00 ts
      1.67 0.00 userid

      Attachments

        Issue Links

          Activity

            People

              omalley Owen O'Malley
              omalley Owen O'Malley
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: