Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3938

Set RDD name to table name during cache operations

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.2.0
    • Component/s: SQL
    • Labels:
      None
    • Target Version/s:

      Description

      When we create a table via "CACHE TABLE tbl" or "CACHE TABLE tbl AS SELECT", we should name the created RDD with the table name. This will allow it to render nicely in the storage tab, which is necessary when people look at the storage tab to understand the caching behavior of Spark (e.g. percentage in cache, etc).

        Attachments

          Activity

            People

            • Assignee:
              lian cheng Cheng Lian
              Reporter:
              pwendell Patrick Wendell
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: