Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.13.1
-
None
-
None
Description
Attempting to store data into a Parquet table via HCatalog (using Pig) fails. Based on the code, it appears that org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getRecordWriter() is unimplemented.
To duplicate:
Hive:
create table parquet_input_test stored as parquet as with x as (select test as value) select * from x; create table parquet_output_test (value int) stored as parquet;
Pig:
A = LOAD 'parquet_input_test' USING org.apache.hive.hcatalog.pig.HCatLoader(); STORE A INTO 'parquet_output_test' USING org.apache.hive.hcatalog.pig.HCatStorer();
Result:
java.lang.RuntimeException: Should never be used at org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getRecordWriter(MapredParquetOutputFormat.java:76) at org.apache.hive.hcatalog.mapreduce.FileOutputFormatContainer.getRecordWriter(FileOutputFormatContainer.java:103) at org.apache.hive.hcatalog.mapreduce.HCatOutputFormat.getRecordWriter(HCatOutputFormat.java:260) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigOutputFormat.getRecordWriter(PigOutputFormat.java:84) at org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.<init>(MapTask.java:623) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:743) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:339) at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:162) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:157)
Attachments
Issue Links
- duplicates
-
HIVE-4329 HCatalog should use getHiveRecordWriter rather than getRecordWriter
- Open
- is related to
-
HIVE-8838 Support Parquet through HCatalog
- Closed
- relates to
-
HIVE-22249 Support Parquet through HCatalog
- Closed