Description
The current implementation of column stats uses the base64 encoding of the internal UnsafeRow format to persist statistics (in table properties in Hive metastore). This is an internal format that is not stable across different versions of Spark and should NOT be used for persistence.
In addition, it would be better if statistics stored in the catalog is human readable.
Attachments
Issue Links
- links to