Details
-
Wish
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
While running hive queries like "SELECT max(date_partition) FROM VERY_LARGE_FACT_TBL", where VERY_LARGE_FACT_TBL is partitioned by column 'date_partition', a full table scan is required. I have observed that if the table contains large amount of data, thousands of mappers are initiated. I wish hive could just interact with metastore to pull simple things like max, min or distinct of partition column. I am using Hive version 2.3.