Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7292 Hive on Spark
  3. HIVE-10989

HoS can't control number of map tasks for runtime skew join [Spark Branch]

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • spark-branch, 1.3.0, 2.0.0
    • Spark
    • None

    Description

      Flags hive.skewjoin.mapjoin.map.tasks and hive.skewjoin.mapjoin.min.split are used to control the number of map tasks for the map join of runtime skew join. They work well for MR but have no effect for spark.
      This makes runtime skew join less useful, i.e. we just end up with slow mappers instead of reducers.

      Attachments

        Activity

          People

            lirui Rui Li
            lirui Rui Li
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: