Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'
Reproduce step:
(1) set hive.stats.collect.rawdatasize=true;
(2) Generate an ORC table in hive, and the value of its 'rawDataSize' is NOT zero.
You can find the value of 'rawDataSize' (NOT zero) by executing 'describe extended TABLE_NAME;'
(4) Execute 'analyze table TABLE_NAME compute statistics;'
(5) Execute 'describe extended TABLE_NAME;' again, and you will find that the value of 'rawDataSize' will be changed to '0'.
Attachments
Attachments
Issue Links
- relates to
-
HIVE-9697 Hive on Spark is not as aggressive as MR on map join [Spark Branch]
- Resolved