Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-973

Mark fields in RDD class that are not used in workers as @transient to reduce task size

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.8.1, 0.9.0
    • None
    • None

    Description

      I can see a few candidates to mark as transient.

        /** Optionally overridden by subclasses to specify how they are partitioned. */
        val partitioner: Option[Partitioner] = None
      
        /** A friendly name for this RDD */
        var name: String = null
      
        /** User-defined generator of this RDD*/
        var generator = Utils.getCallSiteInfo.firstUserClass
      
      

      Attachments

        Activity

          People

            rxin Reynold Xin
            rxin Reynold Xin
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: