Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2104 A CrossProductEdge which produces synthetic cross-product parallelism
  3. TEZ-3573

Allow user to cap number of task in cartesian product (unpartitioned case)

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Invalid
    • None
    • None
    • None
    • None

    Description

      Auto grouping can help reduce #tasks in cartesian product but may still result in too many tasks in case of huge input data. It will be useful for user to cap #task, so that cartesian product won't abuse available resource. The primary limiter will still be auto grouping, but this will be a hard limit which cannot be exceeded anyway.

      Attachments

        1. TEZ-3573.1.patch
          24 kB
          Zhiyuan Yang
        2. TEZ-3573.2.patch
          25 kB
          Zhiyuan Yang
        3. TEZ-3573.3.patch
          25 kB
          Zhiyuan Yang

        Issue Links

          Activity

            People

              zhiyuany Zhiyuan Yang
              zhiyuany Zhiyuan Yang
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: