Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10354

First cost RDD shouldn't be cached in k-means|| and the following cost RDD should use MEMORY_AND_DISK

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.5.0
    • 1.3.2, 1.4.2, 1.5.0
    • MLlib
    • None

    Description

      The first RDD doesn't need to be cached, other cost RDDs should use MEMORY_AND_DISK to avoid recomputing.

      Attachments

        Issue Links

          Activity

            People

              mengxr Xiangrui Meng
              mengxr Xiangrui Meng
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: