Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.98.8
-
None
-
hadoop-2.3.0, hbase-0.98.8, jdk1.7
-
Reviewed
Description
Normally, we have 2 steps to perform a bulkload: 1. use a job to write HFiles to be loaded; 2. Move these HFiles to the right hdfs directory. However, the locality could be loss during the first step. Why not just write the HFiles directly into the right place? We can do this easily because StoreFile.WriterBuilder has the "withFavoredNodes" method, and we just need to call it in HFileOutputFormat's getNewWriter().
This feature is enabled by default, and we could use 'hbase.bulkload.locality.sensitive.enabled=false' to disable it.