Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-3761 CQL 3.0
  3. CASSANDRA-4179

Add more general support for composites (to row key, column value)

Agile BoardAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersConvert to IssueMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Low
    • Resolution: Fixed
    • 1.2.0 beta 1
    • Legacy/CQL

    Description

      Currently CQL3 have a nice syntax for using composites in the column name (it's more than that in fact, it creates a whole new abstraction but let's say I'm talking implementation here). There is however 2 other place where composites could be used (again implementation wise): the row key and the column value. This ticket proposes to explore which of those make sense for CQL3 and how.

      For the row key, I really think that CQL support makes sense. It's very common (and useful) to want to stuff composite information in a row key. Sharding a time serie (CASSANDRA-4176) is probably the best example but there is other.

      For the column value it is less clear. CQL3 makes it very transparent and convenient to store multiple related values into multiple columns so maybe composites in a column value is much less needed. I do still see two cases for which it could be handy:

      1. to save some disk/memory space, if you do know it makes no sense to insert/read two value separatly.
      2. if you want to enforce that two values should not be inserted separatly. I.e. to enforce a form of "constraint" to avoid programatic error.

      Those are not widely useful things, but my reasoning is that if whatever syntax we come up for "grouping" row key in a composite trivially extends to column values, why not support it.

      As for syntax I have 3 suggestions (that are just that, suggestions):

      1. If we only care about allowing grouping for row keys:
        CREATE TABLE timeline (
            name text,
            month int,
            ts timestamp,
            value text,
            PRIMARY KEY ((name, month), ts)
        )
        
      2. A syntax that could work for both grouping in row key and colum value:
        CREATE TABLE timeline (
            name text,
            month int,
            ts timestamp,
            value1 text,
            value2 text,
            GROUP (name, month) as key,
            GROUP (value1, value2),
            PRIMARY KEY (key, ts)
        )
        
      3. An alternative to the preceding one:
        CREATE TABLE timeline (
            name text,
            month int,
            ts timestamp,
            value1 text,
            value2 text,
            GROUP (name, month) as key,
            GROUP (value1, value2),
            PRIMARY KEY (key, ts)
        ) WITH GROUP (name, month) AS key
           AND GROUP (value1, value2)
        

      Attachments

        1. 4179.txt
          57 kB
          Sylvain Lebresne
        2. 4179-2.txt
          64 kB
          Sylvain Lebresne

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            slebresne Sylvain Lebresne Assign to me
            slebresne Sylvain Lebresne
            Sylvain Lebresne
            Pavel Yaskevich
            Votes:
            1 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment