Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-3173 Reduce catalog's memory footprint
  3. IMPALA-3198

Store partition location info with respect to partition keys

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • impala 2.5.1
    • None
    • Catalog
    • None

    Description

      Partitions in the Catalog already store the literals of their clustering columns. The tables also already store the clustering column names.

      THis information is stored redundantly in partition location strings like "hdfs://db/table/part_col_1=part_val_1/part_col_2=part_val_2/". We could save catalog memory by reducing that redundancy.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jbapple Jim Apple
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: