Description
I just found HADOOP-4565 and noticed that CombineFileInputFormat can be used to force-set the number of tasks if the number of blocks exceeds cluster capacity.
For sequence file, I suggest we add CombineSequenceFileInputFormat too. This feature will be very helpful.