Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12661 StatsSetupConst.COLUMN_STATS_ACCURATE is not used correctly
  3. HIVE-12871

rawDataSize is always 0 for some query with concurrency support

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      set hive.stats.fetch.column.stats=true;
      set hive.stats.fetch.partition.stats=true;
      set hive.compute.query.using.stats=true;
      set hive.mapred.mode=nonstrict;
      set hive.exec.dynamic.partition.mode=nonstrict;
      set hive.support.concurrency=true;
      set hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager;
      
      drop table calendar;
      
      CREATE TABLE calendar (year int, month int) clustered by (month) into 2 buckets stored as orc TBLPROPERTIES ('transactional'='true');
      
      insert into calendar values (2010, 10), (2011, 11), (2012, 12);
      
      desc formatted calendar;
      
      

      desc will return

      rawDataSize             0
      totalSize               1242
      

      If I commented out

      set hive.support.concurrency=true;
      set hive.txn.manager=org.apache.hadoop.hive.ql.lockmgr.DbTxnManager;
      

      It will return

      rawDataSize             24
      totalSize               547
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            pxiong Pengcheng Xiong
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: