Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8205

Using strings in group type fails in ParquetSerDe

    XMLWordPrintableJSON

    Details

      Description

      In HIVE-7735, schema info was plumbed to ETypeConverter to disambiguate between hive Char, Varchar and String types, which are all represented as PrimitiveType "binary" and OriginalType "utf8" in parquet.

      However, this does not work for parquet nested types (that map to hive Array, Map, etc.) containing these values, because schema lookup for nested values was not implemented. It's also non-trivial to do that in the current parquet serde implementation. Instead of plumbing in the schema, we should convert
      these types to the same Text writeable and let the object inspectors handle the final conversion.

        Attachments

        1. HIVE-8205.1.patch
          30 kB
          Xuefu Zhang
        2. HIVE-8205.1.patch
          30 kB
          Mohit Sabharwal
        3. HIVE-8205.patch
          29 kB
          Mohit Sabharwal

          Issue Links

            Activity

              People

              • Assignee:
                mohitsabharwal Mohit Sabharwal
                Reporter:
                mohitsabharwal Mohit Sabharwal
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: