Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-28489

Partitioning the input data of Grouping Set GroupBy operator

    XMLWordPrintableJSON

Details

    Description

      GroupBy operator with grouping sets often emits too many rows, which becomes the bottleneck of query execution. To reduce the number output rows, this JIRA proposes partitioning the input data of such GroupBy operator.

      Please check out the attached slides for detailed explanation.

      Attachments

        1. 2.PartitionDataBeforeGroupingSet.pdf
          505 kB
          Seonggon Namgung

        Issue Links

          Activity

            People

              seonggon Seonggon Namgung
              seonggon Seonggon Namgung
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: