Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-27005

Iceberg: Col stats are not used in queries

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      1. Though, insert-queries compute colstats during runtime, they are not persisted in HMS during final call. 

      2. Due to #1, col stats are not available during runtime for hive queries. This includes col stats, NDV etc. So unless users explicitly run "analyse table" statements, queries can be have suboptimal plans.

      E.g col_stats.txt(note that there is no col stats being used)

      Attachments

        1. col_stats.txt
          25 kB
          Rajesh Balamohan

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            simhadri-g Simhadri Govindappa Assign to me
            rajesh.balamohan Rajesh Balamohan

            Dates

              Created:
              Updated:

              Slack

                Issue deployment