Details
-
Brainstorming
-
Status: Closed
-
Major
-
Resolution: Implemented
-
None
-
None
-
None
Description
saint.ack@gmail.com was suggesting to use DataSketches (https://datasketches.github.io) in order to write additional statistics to the HFiles. This could be used to improve our split decisions, troubleshooting or potentially do other interesting analysis without having to perform full table scans. The statistics could be stored as part of the HFile but we could initially improve the visibility of the data by adding some statistics to HFilePrettyPrinter.
Attachments
Attachments
Issue Links
- relates to
-
HBASE-9113 Expose region statistics on table.jsp
- Closed
-
HBASE-25641 Improve storeFile.jsp performance on really large files
- Open
-
HBASE-12311 Version stats in HFiles?
- Closed
- links to