Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7292 Hive on Spark
  3. HIVE-7327

Refactoring: make Hive map side data processing reusable [Spark Branch]

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: 0.13.0
    • Fix Version/s: None
    • Component/s: None
    • Labels:

      Description

      ExecMapper is Hive's mapper implementation for MapReduce. Table rows are read by MR framework and processed by ExecMapper.map() method, which invokes Hive's map-side operator tree starting from MapOperator. This task is to extract the map-side data processing offered by the operator tree so that it can be used by other execution engine such as Spark. This is purely refactoring the existing code.

        Activity

        Hide
        xuefuz Xuefu Zhang added a comment -

        It seems it's easier to use ExecMapper directly than any refactoring. Postpone this item for now for later consideration.

        Show
        xuefuz Xuefu Zhang added a comment - It seems it's easier to use ExecMapper directly than any refactoring. Postpone this item for now for later consideration.
        Hide
        xuefuz Xuefu Zhang added a comment -

        Closed as not fix. Will reopen if need comes back.

        Show
        xuefuz Xuefu Zhang added a comment - Closed as not fix. Will reopen if need comes back.
        Hide
        chengxiang li Chengxiang Li added a comment -

        We have implement native HiveMapFunction instead of using ExecMapper directly in HIVE-7675, reopen this JIRA to track further refactoring.

        Show
        chengxiang li Chengxiang Li added a comment - We have implement native HiveMapFunction instead of using ExecMapper directly in HIVE-7675 , reopen this JIRA to track further refactoring.
        Hide
        xuefuz Xuefu Zhang added a comment -

        Since Hive on Spark doesn't use ExecMapper directly now, this is no longer an issue. Closed this for now.

        Show
        xuefuz Xuefu Zhang added a comment - Since Hive on Spark doesn't use ExecMapper directly now, this is no longer an issue. Closed this for now.

          People

          • Assignee:
            xuefuz Xuefu Zhang
            Reporter:
            xuefuz Xuefu Zhang
          • Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development