Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1997

Hive generated parquet files with maps containing strings return wrong value

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Not A Problem
    • None
    • None
    • Storage - Parquet
    • None

    Description

      Created a parquet file in hive having the following DDL
      hive> desc alltypesparquet;
      OK
      c1 int
      c2 boolean
      c3 double
      c4 string
      c5 array<int>
      c6 map<int,string>
      c7 map<string,string>
      c8 struct<r:string,s:int,t:double>
      c9 tinyint
      c10 smallint
      c11 float
      c12 bigint
      c13 array<array<string>>
      c15 struct<r:int,s:struct<a:int,b:string>>
      c16 array<struct<m:map<string,string>,n:int>>
      Time taken: 0.076 seconds, Fetched: 15 row(s)

      All the complex types with string in them are returning incorrect values in drill. For example:

      hive> select c6 from alltypesparquet;
      NULL
      NULL

      {1:"x",2:"y"}

      0: jdbc:drill:> select c6 from `/user/hive/warehouse/alltypesparquet`;
      ------------

      c6

      ------------

      {"map":[]}
      {"map":[]}
      {"map":[{"key":1,"value":"eA=="},{"key":2,"value":"eQ=="}]}

      ------------
      3 rows selected (0.077 seconds)

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            parthc Parth Chandra
            inramana Ramana Inukonda Nagaraj
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment