Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7292 Hive on Spark
  3. HIVE-9294

Improve replication factor of small table file given big table partitions [Spark branch]

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: spark-branch
    • Fix Version/s: None
    • Component/s: Spark
    • Labels:
      None

      Description

      During a mapjoin, we might be able to improve replication factor of small table files, given the number of partitions in the big table. This JIRA is to track investigation of that.

      Note, this JIRA might be pending potential changes to mapjoin algorithm.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                jxiang Jimmy Xiang
                Reporter:
                szehon Szehon Ho
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated: