Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-6035

Specify Drill's JSON behavior

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.13.0
    • None
    • None
    • None

    Description

      Drill supports JSON as its native data format. However, experience suggests that Drill may have limitations in the JSON that Drill supports. This ticket asks to clarify Drill's expected behavior on various kinds of JSON.

      Topics to be addressed:

      • Relational vs. non-relational structures
      • JSON structures used in practice and how they map to Drill
      • Support for varying data types
      • Support for missing values, especially across files

      These topics are complex, hence the request to provide a detailed specifications that clarifies what Drill does and does not support (or what is should and should not support.)

      As noted below, the "net-net" conclusion for users is to use an ETL tool to convert JSON to Parquet, then allow Drill to query the Parquet.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            priteshm Pritesh Maker
            Paul.Rogers Paul Rogers

            Dates

              Created:
              Updated:

              Slack

                Issue deployment