Description
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator: VectorMapOperator path: hdfs://cn108-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/tpcds_bin_partitioned_orc_200.db/store_sales/ss_sold_date_sk=2451710, read type VECTORIZED_INPUT_FILE_FORMAT, vector deserialize type NONE, aliases store_sales
Lines like this repeat all over the log. This gets really big with a large number of partitions. 6MB of logs per node for a 30 task query running for 20 seconds on a 3 node cluster.
Instead of logging this line - can we have a consolidated log / logging only if something abnormal happens ... or a shorter log message.