Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
This patch enables Hive to start reading timestamps from Parquet written with the new semantics:
With Parquet version 1.11, a new timestamp LogicalType with base INT64 and the following metadata is introduced:
- boolean isAdjustedToUtc: marks whether the timestamp is converted to UTC (aka Instant semantics) or not (LocalDateTime semantics).
- enum TimeUnit (NANOS, MICROS, MILLIS): granularity of timestamp
Upon reading, the semantics of these new timestamps will be determined by their metadata, while the semantics of INT96 timestamps will continue to be deduced from the writer metadata.
This feature will be behind a flag for now.
Attachments
Attachments
Issue Links
- blocks
-
HIVE-21216 Write Parquet INT64 timestamp
- Closed
- causes
-
HIVE-26612 INT64 Parquet timestamps cannot be read into BIGINT Hive type
- Closed
-
HIVE-26658 INT64 Parquet timestamps cannot be mapped to most Hive numeric types
- Closed
- duplicates
-
HIVE-13435 Hive fails to read timestamp stored as binary / int64 from externally generated parquet files
- Resolved
- is blocked by
-
HIVE-21050 Use Parquet LogicalTypes
- Closed