Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.17.0
    • None

    Description

      When Hive stores array data in parquet format, it creates schema for such columns, like:
      arr_n_0 ARRAY<INT>

       optional group arr_n_0 (LIST) {
         repeated group bag {
           optional int32 array_element;
         }
       }
      

      Sample result before the changes was:

      {"bag":[{"array_element":1},{"array_element":2}]}
      

      After the changes Drill reads only array elements data without additional keys like "bag" or "array_element":

      [1,2]

       

       

      Please read Design Doc linked to parent task for more details. 

      Attachments

        Issue Links

          Activity

            People

              ihuzenko Igor Guzenko
              ihuzenko Igor Guzenko
              Vova Vysotskyi Vova Vysotskyi
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: