Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7502

Writes to parquet tables via HCatalog fail with "java.lang.RuntimeException: Should never be used".

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.13.1
    • None
    • HCatalog
    • None

    Description

      Attempting to store data into a Parquet table via HCatalog (using Pig) fails. Based on the code, it appears that org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getRecordWriter() is unimplemented.

      To duplicate:

      Hive:

      create table parquet_input_test stored as parquet as with x as (select test as value) select * from x;
      create table parquet_output_test (value int) stored as parquet;
      

      Pig:

      A = LOAD 'parquet_input_test' USING org.apache.hive.hcatalog.pig.HCatLoader();
      STORE A INTO 'parquet_output_test' USING org.apache.hive.hcatalog.pig.HCatStorer();
      

      Result:

      java.lang.RuntimeException: Should never be used
      	at org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getRecordWriter(MapredParquetOutputFormat.java:76)
      	at org.apache.hive.hcatalog.mapreduce.FileOutputFormatContainer.getRecordWriter(FileOutputFormatContainer.java:103)
      	at org.apache.hive.hcatalog.mapreduce.HCatOutputFormat.getRecordWriter(HCatOutputFormat.java:260)
      	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigOutputFormat.getRecordWriter(PigOutputFormat.java:84)
      	at org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.<init>(MapTask.java:623)
      	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:743)
      	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:339)
      	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:162)
      	at java.security.AccessController.doPrivileged(Native Method)
      	at javax.security.auth.Subject.doAs(Subject.java:415)
      	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
      	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:157)
      

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            ccondit Craig Condit

            Dates

              Created:
              Updated:

              Slack

                Issue deployment