Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6116 DataFrame API improvement umbrella ticket (Spark 1.5)
  3. SPARK-6865

Decide on semantics for string identifiers in DataFrame API

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.4.0
    • Component/s: SQL
    • Labels:
      None
    • Target Version/s:
    • Sprint:
      Spark 1.5 doc/QA sprint

      Description

      There are two options:

      • Quoted Identifiers: meaning that the strings are treated as though they were in backticks in SQL. Any weird characters (spaces, or, etc) are considered part of the identifier. Kind of weird given that `*` is already a special identifier explicitly allowed by the API
      • Unquoted parsed identifiers: would allow users to specify things like tableAlias.* However, would also require explicit use of `backticks` for identifiers with weird characters in them.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                rxin Reynold Xin
                Reporter:
                marmbrus Michael Armbrust
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: