Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-5369 Annotate hive operator tree with statistics from metastore
  3. HIVE-7679

JOIN operator should update the column stats when number of rows changes

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersConvert to IssueMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.14.0
    • 0.14.0
    • Query Processor, Statistics
    • None

    Description

      JOIN operator does not update the column stats when the number of rows changes. All other operators scales up/down the column statistics when the number of rows changes. Same should be done for JOIN operator as well. Because of this dataSize might become negative as numNulls can get bigger than numRows (if scaling down of column stats is not done).

      Attachments

        1. HIVE-7679.3.patch
          7 kB
          Prasanth Jayachandran
        2. HIVE-7679.2.patch
          6 kB
          Prasanth Jayachandran
        3. HIVE-7679.1.patch
          6 kB
          Prasanth Jayachandran

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            prasanth_j Prasanth Jayachandran Assign to me
            prasanth_j Prasanth Jayachandran
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment