Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-6994

Avoid reloading a table's HMS data for file-only operations

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • Impala 2.12.0
    • None
    • Catalog
    • None
    • ghx-label-4

    Description

      Reloading file metadata for HDFS tables (e.g. as a final step in an 'insert') is done via
      https://github.com/apache/impala/blob/branch-2.12.0/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java#L628
      , which calls
      https://github.com/apache/impala/blob/branch-2.12.0/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java#L1243

      HdfsTable.load has no option to only load file metadata. HMS metadata will also be reloaded every time, which is an unnecessary overhead (and potential point of failure) when adding files to existing locations.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jeszyb Balazs Jeszenszky
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: