Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-45784 Introduce clustering mechanism to Spark
  3. SPARK-44886

Introduce CLUSTER BY SQL clause to CREATE/REPLACE TABLE

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 4.0.0
    • 4.0.0
    • SQL

    Description

      This proposes to introduce CLUSTER BY clause to CREATE/REPLACE SQL syntax:

      CREATE TABLE tbl(a int, b string) CLUSTER BY (a, b)

      This doesn't introduce a default implementation for clustering, but it's up to the catalog/datasource implementation to utilize the clustering information (e.g., Delta, Iceberg, etc.).

      Attachments

        Issue Links

          Activity

            People

              imback82 Terry Kim
              imback82 Terry Kim
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: