Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.94.1, 0.94.5
-
None
Description
In ColumnTracker.java, the update() method is not used by anyone now. And no one will call checkColumn for different HFiles with update() in between files to re-walk through the target columns. All columns will be feed to checkColumn() in order.
So, within ExplicitColumnTracker, the target columns can be optimized to not dynamic maintain a changing list of columns yet to match. Instead, just move index through it is enough.
with this optimization to save the time for avoid reconstruct a columns array upon each row, the checkColumn method's performance could be improved by 10-20%.