Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4659

Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

      Description

      I have a file, that I would like to use in a query, and it can have one or more of the following properties:

      • Has not extension ==> Drill is unable to handle it.
      • I know it contains data in CSV format, but the field separator is a non standard character ==> Drill is unable to parse it (without modify the storage plugin configuration).
      • Is located in an Amazon S3 bucket ==> I can't rename it.
      • Has a big size ==> It would be expensive to make a copy of it.

      It would be nice if you can specify, as part of the "select" query, as metadata, relevant table information as:

      • Data format (CSV, parquet, JSON. etc.)
      • Field delimiter.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              rogerdielrton Roger Dielrton

              Dates

              • Created:
                Updated:

                Issue deployment