Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-9477

Only add required partitions to TDescriptorTable for hdfs table sinks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • Impala 3.3.0
    • None
    • Frontend
    • None
    • ghx-label-12

    Description

      In https://github.com/apache/impala/blob/b068c2c/fe/src/main/java/org/apache/impala/analysis/DescriptorTable.java#L196 we fall back to adding all partition metadata for the target table for DMLs.

      I don't think this is necessary if query specifies a target partition. Or if it has multiple partition keys and it specified constants for some of them.

      If a table has a large number of partitions, the memory overhead can be very high because of things like IMPALA-8835

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tarmstrong Tim Armstrong
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: