Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14332

Reduce logging from VectorMapOperator

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • None
    • 2.1.1, 2.2.0
    • Hive
    • None

    Description

      org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator: VectorMapOperator path: hdfs://cn108-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/tpcds_bin_partitioned_orc_200.db/store_sales/ss_sold_date_sk=2451710, read type VECTORIZED_INPUT_FILE_FORMAT, vector deserialize type NONE, aliases store_sales
      Lines like this repeat all over the log. This gets really big with a large number of partitions. 6MB of logs per node for a 30 task query running for 20 seconds on a 3 node cluster.
      Instead of logging this line - can we have a consolidated log / logging only if something abnormal happens ... or a shorter log message.

      Attachments

        1. HIVE-14332.01.patch
          1 kB
          Matt McCline

        Activity

          People

            mmccline Matt McCline
            mmccline Matt McCline
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: