Details
Description
1.When create table as below:
CREATE TABLE hive_partition.two_partition ( id INT, name STRING ) PARTITIONED BY ( day INT, hour INT ) WITH SERDEPROPERTIES ('serialization.format'='1') STORED AS ORC LOCATION 'hdfs://ly-pfs/hive/hive_partition/two_partition'
2.Then create dir as below:
hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=0 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=00 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=01 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=02 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=03 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=04 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=05 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=06 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=07 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=08 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=09 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=1 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=10 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=11 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=12 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=13 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=14 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=15 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=16 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=17 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=18 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=19 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=2 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=20 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=21 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=22 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=23 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=3 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=4 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=5 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=6 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=7 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=8 hdfs://ly-pfs/hive/hive_partition/two_partition/day=20240613/hour=9
3. Execute Refresh hive_partition.two_partition more times
on Impala 3.4.0, total parititons grow after refresh, partitions grows from 34 to 74 after refresh three times
I0617 17:01:36.244355 18605 CatalogServiceCatalog.java:2225] Refreshing table metadata: hive_partition.two_partition I0617 17:01:38.033699 18605 HdfsTable.java:995] Reloading metadata for table definition and all partition(s) of hive_partition.two_partition (REFRESH issued by root) I0617 17:01:39.245016 18605 ParallelFileMetadataLoader.java:147] Loading file and block metadata for 10 paths for table hive_partition.two_partition using a thread pool of size 10 I0617 17:01:39.336242 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 91.234ms I0617 17:01:39.336658 18605 ParallelFileMetadataLoader.java:147] Refreshing file and block metadata for 34 paths for table hive_partition.two_partition using a thread pool of size 20 I0617 17:01:39.435528 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=0, day=20240613/hour=2, and 51 others. Time taken: 99.075ms I0617 17:01:39.435572 18605 HdfsTable.java:1026] Incrementally loaded table metadata for: hive_partition.two_partition I0617 17:01:39.450933 18605 CatalogServiceCatalog.java:2249] Refreshed table metadata: hive_partition.two_partition I0617 17:01:40.930032 18605 CatalogServiceCatalog.java:2225] Refreshing table metadata: hive_partition.two_partition I0617 17:01:40.942590 18605 HdfsTable.java:995] Reloading metadata for table definition and all partition(s) of hive_partition.two_partition (REFRESH issued by root) I0617 17:01:41.086898 18605 ParallelFileMetadataLoader.java:147] Loading file and block metadata for 10 paths for table hive_partition.two_partition using a thread pool of size 10 I0617 17:01:41.360704 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 273.775ms I0617 17:01:41.361122 18605 ParallelFileMetadataLoader.java:147] Refreshing file and block metadata for 34 paths for table hive_partition.two_partition using a thread pool of size 20 I0617 17:01:41.494148 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=0, day=20240613/hour=2, and 61 others. Time taken: 133.243ms I0617 17:01:41.494203 18605 HdfsTable.java:1026] Incrementally loaded table metadata for: hive_partition.two_partition I0617 17:01:41.494494 18605 CatalogServiceCatalog.java:2249] Refreshed table metadata: hive_partition.two_partition I0617 17:01:42.312086 18605 CatalogServiceCatalog.java:2225] Refreshing table metadata: hive_partition.two_partition I0617 17:01:42.326767 18605 HdfsTable.java:995] Reloading metadata for table definition and all partition(s) of hive_partition.two_partition (REFRESH issued by root) I0617 17:01:42.516573 18605 ParallelFileMetadataLoader.java:147] Loading file and block metadata for 10 paths for table hive_partition.two_partition using a thread pool of size 10 I0617 17:01:42.708683 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=1, day=20240613/hour=2, and 7 others. Time taken: 192.084ms I0617 17:01:42.709103 18605 ParallelFileMetadataLoader.java:147] Refreshing file and block metadata for 34 paths for table hive_partition.two_partition using a thread pool of size 20 I0617 17:01:42.828385 18605 HdfsTable.java:690] Loaded file and block metadata for hive_partition.two_partition partitions: day=20240613/hour=0, day=20240613/hour=0, day=20240613/hour=2, and 71 others. Time taken: 119.517ms
on impala 4.3.0, select query drop into dead loop to prioritized load metadata:
E0617 17:07:01.926093 4158610 ImpaladCatalog.java:264] Error adding catalog object: Error applying incremental updates on table hive_partition.two_partition: missing partition ids true, stale partition ids false Java exception follows: org.apache.impala.catalog.TableLoadingException: Error applying incremental updates on table hive_partition.two_partition: missing partition ids true, stale partition ids false at org.apache.impala.catalog.HdfsTable.validatePartitions(HdfsTable.java:2066) at org.apache.impala.catalog.ImpaladCatalog.addTable(ImpaladCatalog.java:545) at org.apache.impala.catalog.ImpaladCatalog.addCatalogObject(ImpaladCatalog.java:334) at org.apache.impala.catalog.ImpaladCatalog.updateCatalog(ImpaladCatalog.java:262) at org.apache.impala.service.FeCatalogManager$CatalogdImpl.updateCatalogCache(FeCatalogManager.java:114) at org.apache.impala.service.Frontend.updateCatalogCache(Frontend.java:560) at org.apache.impala.service.JniFrontend.updateCatalogCache(JniFrontend.java:186) I0617 17:07:01.927062 4158610 impala-server.cc:2176] Catalog topic update applied with version: 14 new min catalog object version: 1 I0617 17:07:03.927403 1511568 StmtMetadataLoader.java:225] 9c4e7b92e71da380:855fd23400000000] Waiting for table metadata. Waited for 10 catalog updates and 12182ms. Tables remaining: [hive_partition.two_partition] I0617 17:07:13.931190 1511568 StmtMetadataLoader.java:225] 9c4e7b92e71da380:855fd23400000000] Waiting for table metadata. Waited for 20 catalog updates and 22186ms. Tables remaining: [hive_partition.two_partition]
Attachments
Issue Links
- relates to
-
IMPALA-8145 Partition metadata key muddle
-
- Open
-