Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13115

RandomForest is stuck at computing same stage over and over

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Invalid
    • 1.5.2
    • None
    • ML, MLlib, Spark Core
    • None

    Description

      While running the RandomForest regression, the algorithm keeps computing the same stage and does not proceed any further. I have observed the same stage being computed for more than 11 hours. Attached are some of the captures from Spark WebUI.

      Also, the spark event logs for this model run could be fetched from Spark Event Logs (https://s3.amazonaws.com/com.tookitaki.public.logs/spark-event-logs). I am running spark-1.5.2 in the standalone local mode. Also, I wanted to know why any stage is marked skipped?

      Let me know if you would need more information.

      Attachments

        1. Stage details.png
          138 kB
          Rahul Tanwani
        2. Stages.png
          138 kB
          Rahul Tanwani
        3. Task details.png
          138 kB
          Rahul Tanwani

        Activity

          People

            Unassigned Unassigned
            tanwanirahul Rahul Tanwani
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: