Details

    • Type: New Feature New Feature
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.5.0
    • Component/s: Query Processor
    • Labels:
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Adds the feature of launching multiple map-reduce tasks that are not dependent on each other in parallel. Examples of queries affected would be those including union-alls, and trees of join operators.

      Description

      In a massively parallel database system, it would be awesome to also parallelize some of the mapreduce phases that our data needs to go through.

      One example that just occurred to me is UNION ALL: when you union two SELECT statements, effectively you could run those statements in parallel. There's no situation (that I can think of, but I don't have a formal proof) in which the left statement would rely on the right statement, or vice versa. So, they could be run at the same time...and perhaps they should be. Or, perhaps there should be a way to make this happen...PARALLEL UNION ALL? PUNION ALL?

      1. HIVE549-v7.patch
        199 kB
        Chaitanya Mishra

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Chaitanya Mishra
              Reporter:
              Adam Kramer
            • Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development