Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-9560

When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.2.0
    • None
    • None

    Description

      When hive.stats.collect.rawdatasize=true, 'rawDataSize' for an ORC table will result in value '0' after running 'analyze table TABLE_NAME compute statistics;'

      Reproduce step:
      (1) set hive.stats.collect.rawdatasize=true;
      (2) Generate an ORC table in hive, and the value of its 'rawDataSize' is NOT zero.
      You can find the value of 'rawDataSize' (NOT zero) by executing 'describe extended TABLE_NAME;'
      (4) Execute 'analyze table TABLE_NAME compute statistics;'
      (5) Execute 'describe extended TABLE_NAME;' again, and you will find that the value of 'rawDataSize' will be changed to '0'.

      Attachments

        1. HIVE-9560.1.patch
          115 kB
          Prasanth Jayachandran
        2. HIVE-9560.2.patch
          499 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              xhao1 Xin Hao
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: