Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-9841

Table level writeIdList represent the filemetadata snapshot

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Catalog
    • None
    • ghx-label-11

    Description

      Currently, in HdfsTable https://github.com/apache/impala/blob/master/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java#L1017 we reload the ValidWriteIdList every time. However in case of incremental refresh, it is possible that the file-metadata is not loaded at all or only loaded for certain partitions. This would mean that the cached ValidWriteIdList of the table is out-of-sync with the actual file-metadata which is cached in catalog.

      This is currently not a problem since we on rely on the ValidWriteIdList during the loading of the file-metadata. However, for a consistent reads of the transactional tables, we would need to make sure that this ValidWriteIdList is guaranteed to represent the cached files.

      Attachments

        Issue Links

          Activity

            People

              vihangk1 Vihang Karajgaonkar
              vihangk1 Vihang Karajgaonkar
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: