Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.12.0
-
None
-
None
-
None
-
local environment
Description
I got the following (simplified) input :
0 1135429845 -7727023712262303168 2004267823 -7727023712262303168 2004267823
If I use the following script:
data = LOAD 'input' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUidNotNull = FILTER data BY (uid != '0') AND (uid != '') AND (uid is not null); dataUidNotNullDistinct = DISTINCT dataUidNotNull; dataUidGroup = GROUP dataUidNotNullDistinct BY uid; ILLUSTRATE dataUidGroup;
I got this error :
Pig Stack Trace
---------------
ERROR 2997: Encountered IOException. Exceptionjava.io.IOException: Exception
at org.apache.pig.PigServer.getExamples(PigServer.java:1246)
at org.apache.pig.tools.grunt.GruntParser.processIllustrate(GruntParser.java:831)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.Illustrate(PigScriptParser.java:802)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:381)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:198)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:173)
at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:69)
at org.apache.pig.Main.run(Main.java:541)
at org.apache.pig.Main.main(Main.java:156)
Caused by: java.lang.ClassCastException: org.apache.pig.data.BinSedesTuple cannot be cast to org.apache.pig.pen.util.ExampleTuple
at org.apache.pig.pen.AugmentBaseDataVisitor.visit(AugmentBaseDataVisitor.java:444)
at org.apache.pig.newplan.logical.relational.LODistinct.accept(LODistinct.java:51)
at org.apache.pig.pen.util.PreOrderDepthFirstWalker.depthFirst(PreOrderDepthFirstWalker.java:82)
at org.apache.pig.pen.util.PreOrderDepthFirstWalker.depthFirst(PreOrderDepthFirstWalker.java:84)
at org.apache.pig.pen.util.PreOrderDepthFirstWalker.walk(PreOrderDepthFirstWalker.java:66)
at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:52)
at org.apache.pig.pen.ExampleGenerator.getExamples(ExampleGenerator.java:180)
at org.apache.pig.PigServer.getExamples(PigServer.java:1240)
... 8 more
I also get the following debug message :
DEBUG mapReduceLayer.PigMapReduce$Reduce: Reduce Plan empty!
However, if I save dataUidNotNullDistinct and load it again, I got no more error, i.e. that script below is working:
data = LOAD 'inputNoBag' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUid = FOREACH data GENERATE uid as uid, hash as hash; dataUidNotNull = FILTER dataUid BY (uid != '0') AND (uid != '') AND (uid is not null); dataUidNotNullDistinct = DISTINCT dataUidNotNull; STORE dataUidNotNullDistinct INTO 'output'; dataUidNotNullDistinct2 = LOAD 'output' USING PigStorage('\t') AS (uid:chararray, hash:chararray); dataUidGroup = GROUP dataUidNotNullDistinct2 BY uid; ILLUSTRATE dataUidGroup;