Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-8392

RDDOperationGraph: getting cached nodes is slow

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.4.0
    • 1.4.1, 1.5.0
    • Spark Core
    • None

    Description

      def getAllNodes: Seq[RDDOperationNode] =

      { _childNodes ++ _childClusters.flatMap(_.childNodes) }

      when the _childClusters has so many nodes, the process will hang on. I think we can improve the efficiency here.

      Attachments

        Activity

          People

            meiyoula meiyoula
            meiyoula meiyoula
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: