Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-26764

[SPIP] Spark Relational Cache

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1.0
    • None
    • SQL
    • None

    Description

      In modern database systems, relational cache is a common technology to boost ad-hoc queries. While Spark provides cache natively, Spark SQL should be able to utilize the relationship between relations to boost all possible queries. In this SPIP, we will make Spark be able to utilize all defined cached relations if possible, without explicit substitution in user query, as well as keep some user defined cache available in different sessions. Materialized views in many database systems provide similar function.

      Attachments

        1. Relational+Cache+SPIP.pdf
          413 kB
          Adrian Wang

        Issue Links

          Activity

            People

              Unassigned Unassigned
              adrian-wang Adrian Wang
              Votes:
              5 Vote for this issue
              Watchers:
              24 Start watching this issue

              Dates

                Created:
                Updated: