Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-3173 Reduce catalog's memory footprint
  3. IMPALA-3198

Store partition location info with respect to partition keys

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: impala 2.5.1
    • Fix Version/s: None
    • Component/s: Catalog
    • Labels:
      None

      Description

      Partitions in the Catalog already store the literals of their clustering columns. The tables also already store the clustering column names.

      THis information is stored redundantly in partition location strings like "hdfs://db/table/part_col_1=part_val_1/part_col_2=part_val_2/". We could save catalog memory by reducing that redundancy.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              jbapple Jim Apple
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated: