Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3938

Set RDD name to table name during cache operations

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 1.2.0
    • SQL
    • None

    Description

      When we create a table via "CACHE TABLE tbl" or "CACHE TABLE tbl AS SELECT", we should name the created RDD with the table name. This will allow it to render nicely in the storage tab, which is necessary when people look at the storage tab to understand the caching behavior of Spark (e.g. percentage in cache, etc).

      Attachments

        Activity

          People

            lian cheng Cheng Lian
            pwendell Patrick Wendell
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: