Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
0.98.13
-
None
-
Reviewed
-
Description
When bulk loading millions of HFile into one HTable, checking HFile format is the most time-consuming phase. Maybe we could use a parallel mechanism to increase the speed, but when it comes to millions of HFiles, it may still cost dozens of minutes. So I think it's necessary to add an option for advanced user to bulkload without checking HFile format at all.
Of course, the default value of this option should be true.