Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
Currently Hive writes the name of the table plus snapshot id as blob type:
Instead, it should write the name of the blog it writes. Table name and snapshot id are redundant information anyway, as they can be inferred from the location and filename of the puffin file.
Currently it writes a non-standard blob (Standard blob types are listed here). I think it would be better to write standard blobs for interoperability. But if Hive wants to write non-standard blobs anyway, it should still come up with a descriptive name for them, e.g. 'hive-column-statistics-v1'.
Attachments
Issue Links
- is a child of
-
HIVE-27158 Store hive columns stats in puffin files for iceberg tables
- Closed