Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-1093

Add a "skew join map join size" variable to control the input size of skew join's following map join job.

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.6.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      In a test, many skew join key itself >250M size. And the following mapjoin will take several hours to do a mapjoin for those big skew keys.
      This can be better by using a small map input size for the following map join job.

        Attachments

        1. hive-1093.2.patch
          7 kB
          He Yongqiang
        2. hive-1093.patch
          6 kB
          He Yongqiang

          Activity

            People

            • Assignee:
              he yongqiang He Yongqiang
              Reporter:
              he yongqiang He Yongqiang
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: