Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-12362

Hive's Parquet SerDe ignores 'serialization.null.format' property

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 1.1.0
    • None
    • Hive
    • None

    Description

      create table src (a string);
      insert into table src values (NULL), (''), ('');
      
      0: jdbc:hive2://localhost:10000/default> select * from src;
      +-----------+--+
      | src.a  |
      +-----------+--+
      | NULL      |
      |                |
      |                |
      +-----------+--+
      
      create table dest (a string) row format serde 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' stored as INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat';
      
      alter table dest set SERDEPROPERTIES ('serialization.null.format' = '');
      alter table dest set TBLPROPERTIES ('serialization.null.format' = '');
      insert overwrite table dest select * from src;
      
      0: jdbc:hive2://localhost:10000/default> select * from test11;
      +-----------+--+
      | test11.a  |
      +-----------+--+
      | NULL      |
      |                |
      |                |
      +-----------+--+
      

      Attachments

        1. HIVE-12362.2.patch
          8 kB
          Naveen Gangam
        2. HIVE-12362.patch
          2 kB
          Naveen Gangam

        Activity

          People

            ngangam Naveen Gangam
            ngangam Naveen Gangam
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: