Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
-
None
Description
hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the table(cf) compression,
if the compression can be set on client ,sometimes,it's useful,
some case in our production:
1、hfile bulkload replication between the data center with bandwidth limit, we can set the compression of the bulkload hfile not changing the table compression
2、bulkload hfile not set compression ,but the table compression is gz/zstd/snappy... ,can reduce the hfile created time and compaction will make the hfile to compression finally
3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has no compression lib,but the hbase cluster has,it's useful for this case