Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-4967

Metadata distribution should be incremental

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • Impala 2.9.0
    • None
    • Catalog
    • None

    Description

      When running REFRESH table partition (x=1) the metadata distribution contains all partitions, not just the single partition that was refreshed. This creates additional load and delays refreshes.

      To solve metadata updates might have to be serialized (assign unique #, and apply in order), but this behavior would help the timeliness of data and reduce load on the cluster.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              PeterEbert Peter Ebert
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: