Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1997

Hive generated parquet files with maps containing strings return wrong value

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Not A Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Storage - Parquet
    • Labels:
      None

      Description

      Created a parquet file in hive having the following DDL
      hive> desc alltypesparquet;
      OK
      c1 int
      c2 boolean
      c3 double
      c4 string
      c5 array<int>
      c6 map<int,string>
      c7 map<string,string>
      c8 struct<r:string,s:int,t:double>
      c9 tinyint
      c10 smallint
      c11 float
      c12 bigint
      c13 array<array<string>>
      c15 struct<r:int,s:struct<a:int,b:string>>
      c16 array<struct<m:map<string,string>,n:int>>
      Time taken: 0.076 seconds, Fetched: 15 row(s)

      All the complex types with string in them are returning incorrect values in drill. For example:

      hive> select c6 from alltypesparquet;
      NULL
      NULL

      {1:"x",2:"y"}

      0: jdbc:drill:> select c6 from `/user/hive/warehouse/alltypesparquet`;
      ------------

      c6

      ------------

      {"map":[]}
      {"map":[]}
      {"map":[{"key":1,"value":"eA=="},{"key":2,"value":"eQ=="}]}

      ------------
      3 rows selected (0.077 seconds)

        Attachments

        1. hive_alltypes.parquet
          2 kB
          Ramana Inukonda Nagaraj

          Activity

            People

            • Assignee:
              parthc Parth Chandra
              Reporter:
              inramana Ramana Inukonda Nagaraj
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: