Details
-
Sub-task
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Timestamp in Hive was originally mapped to INT_96 in Parquet before. And inside Parquet it use Binary to save values.
When working on HIVE-10255, I found predicate with timestamp does not work, since comparing Binary representative of Timestamp value can not give correct relationship.
Meanwhile, in latest version Parquet, the timestamp is mapped to INT_64. Refer to https://github.com/Parquet/parquet-format/blob/master/LogicalTypes.md
Therefore, this Jira will update the mapping of Timestamp type in Hive, and keep it backward compatible.
Attachments
Attachments
Issue Links
- blocks
-
HIVE-10255 Parquet PPD support TIMESTAMP
- Patch Available
- duplicates
-
HIVE-10054 Clean up ETypeConverter since Parquet supports timestamp type already
- Resolved