Hive
  1. Hive
  2. HIVE-1093

Add a "skew join map join size" variable to control the input size of skew join's following map join job.

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.6.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      In a test, many skew join key itself >250M size. And the following mapjoin will take several hours to do a mapjoin for those big skew keys.
      This can be better by using a small map input size for the following map join job.

      1. hive-1093.2.patch
        7 kB
        He Yongqiang
      2. hive-1093.patch
        6 kB
        He Yongqiang

        Activity

          People

          • Assignee:
            He Yongqiang
            Reporter:
            He Yongqiang
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development