Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-2758

UnionRDD's UnionPartition should not reference parent RDDs

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0.0, 1.0.1, 1.0.2
    • 1.1.0
    • None
    • None

    Description

      UnionPartition has a non-transient field referencing the parent RDD, to be used in compute (iterator). That causes some trouble with task size because partition objects are supposed to be small.

      Attachments

        Issue Links

          Activity

            People

              rxin Reynold Xin
              rxin Reynold Xin
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: