Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-15055 Column pruning for nested fields in Parquet
  3. HIVE-15507

Nested column pruning: fix issue when selecting struct field from array/map element

    XMLWordPrintableJSON

Details

    Description

      When running the following query:

      SELECT count(col), arr[0].f
      FROM tbl
      GROUP BY arr[0].f
      

      where arr is an array of struct with field f. Nested column pruning will fail. This is because we currently process GenericUDFIndex in the same way as any other UDF. In this case, it will generate path arr.f, which will not match the struct type info when doing the pruning.
      Same thing for map.

      Attachments

        1. 15507.1.patch
          53 kB
          Chao Sun

        Issue Links

          Activity

            People

              csun Chao Sun
              csun Chao Sun
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: