Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-8261

Make the XML format plugin's attributes MAP an implicit column

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Not A Problem
    • 1.20.1
    • 1.21.0
    • Storage - XML
    • None

    Description

      The attributes MAP column generated by the XML format plugin is currently explicit and present in wildcard selects. However, attributes are frequently not present at all in XML data, or are better queried using explicit projections of the individual attributes of interest to scalars. The motivating example here is an ETL-style query that transforms XML to Parquet using a CTAS with a wildcard column spec. This query will fail for XML that has no attributes because the Parquet writer cannot write a Parquet schema containing the empty struct produced by the attributes map. It is therefore proposed that the attributes MAP becomes an implicit column.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            dzamo James Turton
            dzamo James Turton
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment