Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
ghx-label-4
Description
In catalogd, we keep one CachedHmsPartitionDescriptor for each HdfsPartition. Many fields in it could be identical, e.g. sdBucketCols, sdSortCols. We can keep different CachedHmsPartitionDescriptor in HdfsTable instead and share them to the HdfsPartition. For fields that differs across partitions, e.g. msCreateTime, msLastAccessTime, we can move them to HdfsPartition.
// TODO: Cache this descriptor in HdfsTable so that identical descriptors are shared // between HdfsPartition instances. // TODO: sdInputFormat and sdOutputFormat can be mutated by Impala when the file format // of a partition changes; move these fields to HdfsPartition. private static class CachedHmsPartitionDescriptor {