Description
The default InputFormat is CombineHiveInputFormat and thus HOS uses this. However, Tez uses HiveInputFormat. Since tasks are relatively cheap in Spark, it might make sense for us to use HiveInputFormat as well. We should evaluate this on a query which has many input splits such as select count(*) from store_sales where something is not null.
Attachments
Attachments
Issue Links
- is related to
-
SPARK-4921 TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks
- Resolved