Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.3.0
    • PTF-Windowing
    • None
    • Hide
      This is to improve the performance of sum function over range based windowing.

      One issue related to sum(lag(x)) over (partition by c1 order by c2 range between ...) and sum(lead(x)) over (partition by c1 order by c2 range between ...) has been fixed which would produce different result. Without the patch, lag(x)/lead(x) would only consider the previous/next row in the windowing, not within the partition, which doesn't match other databases, also doesn't match rows based windowing.
      Show
      This is to improve the performance of sum function over range based windowing. One issue related to sum(lag(x)) over (partition by c1 order by c2 range between ...) and sum(lead(x)) over (partition by c1 order by c2 range between ...) has been fixed which would produce different result. Without the patch, lag(x)/lead(x) would only consider the previous/next row in the windowing, not within the partition, which doesn't match other databases, also doesn't match rows based windowing.

    Description

      Currently streaming process is not supported for range based windowing. Thus sum( x ) over (partition by y order by z) is O(n^2) running time.

      Investigate the possibility of streaming support.

      Attachments

        1. HIVE-15520.1.patch
          87 kB
          Aihua Xu
        2. HIVE-15520.2.patch
          89 kB
          Aihua Xu
        3. HIVE-15520.3.patch
          93 kB
          Aihua Xu
        4. HIVE-15520.4.patch
          666 kB
          Aihua Xu

        Issue Links

          Activity

            People

              aihuaxu Aihua Xu
              aihuaxu Aihua Xu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: