Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.17.0
    • Component/s: None
    • Labels:

      Description

      When Hive stores array data in parquet format, it creates schema for such columns, like:
      arr_n_0 ARRAY<INT>

       optional group arr_n_0 (LIST) {
         repeated group bag {
           optional int32 array_element;
         }
       }
      

      Sample result before the changes was:

      {"bag":[{"array_element":1},{"array_element":2}]}
      

      After the changes Drill reads only array elements data without additional keys like "bag" or "array_element":

      [1,2]

       

       

      Please read Design Doc linked to parent task for more details. 

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ihuzenko Igor Guzenko
                Reporter:
                ihuzenko Igor Guzenko
                Reviewer:
                Vova Vysotskyi
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: