Hive
  1. Hive
  2. HIVE-7292 Hive on Spark
  3. HIVE-7328

Refactoring: make Hive reduce side data processing reusable [Spark Branch]

    Details

    • Type: Sub-task Sub-task
    • Status: Reopened
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.13.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      ExecReducer is Hive's reducer implementation for MapReduce. Table rows are shuffled by MR framework to ExecReducer and further processed by ExecReducer.reduce() method, which invokes Hive's reduce-side operator tree starting. This task is to extract the reduce-side data processing offered by the operator tree so that it can be reused by other execution engine such as Spark. This is purely refactoring the existing code.

        Activity

        Xuefu Zhang created issue -
        Hide
        Xuefu Zhang added a comment -

        It seems it's easier to use ExecReducer directly than any refactoring. Postpone this item for now for later consideration.

        Show
        Xuefu Zhang added a comment - It seems it's easier to use ExecReducer directly than any refactoring. Postpone this item for now for later consideration.
        Hide
        Xuefu Zhang added a comment -

        Closed as "will not fix". Will reopen if need comes back.

        Show
        Xuefu Zhang added a comment - Closed as "will not fix". Will reopen if need comes back.
        Xuefu Zhang made changes -
        Field Original Value New Value
        Status Open [ 1 ] Resolved [ 5 ]
        Resolution Won't Fix [ 2 ]
        Hide
        Chengxiang Li added a comment -

        We have implement native HiveReduceFunction instead of using ExecRedecer directly in HIVE-7677, reopen this JIRA to track further refactoring.

        Show
        Chengxiang Li added a comment - We have implement native HiveReduceFunction instead of using ExecRedecer directly in HIVE-7677 , reopen this JIRA to track further refactoring.
        Chengxiang Li made changes -
        Resolution Won't Fix [ 2 ]
        Status Resolved [ 5 ] Reopened [ 4 ]
        Brock Noland made changes -
        Summary Refactoring: make Hive reduce side data processing reusable Refactoring: make Hive reduce side data processing reusable [Spark Branch]
        Transition Time In Source Status Execution Times Last Executer Last Execution Date
        Open Open Resolved Resolved
        27d 15h 9m 1 Xuefu Zhang 30/Jul/14 10:51
        Resolved Resolved Reopened Reopened
        15d 16h 4m 1 Chengxiang Li 15/Aug/14 02:55

          People

          • Assignee:
            Xuefu Zhang
            Reporter:
            Xuefu Zhang
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:

              Development