Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4343

Tez auto parallelism fails at query compile time

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.14.0
    • None
    • tez
    • None

    Description

      I was running some legacy MR jobs in Tez mode to do perf benchmarks. But when pig.tez.auto.parallelism is enabled (by default), Pig fails with the following error-

      org.apache.pig.impl.plan.VisitorException: ERROR 0: java.io.IOException: Cannot estimate parallelism for scope-892, effective parallelism for predecessor scope-892 is -1
          at org.apache.pig.backend.hadoop.executionengine.tez.plan.optimizer.ParallelismSetter.visitTezOp(ParallelismSetter.java:189)
          at org.apache.pig.backend.hadoop.executionengine.tez.plan.TezOperator.visit(TezOperator.java:232)
          at org.apache.pig.backend.hadoop.executionengine.tez.plan.TezOperator.visit(TezOperator.java:49)
          at org.apache.pig.impl.plan.DependencyOrderWalker.walk(DependencyOrderWalker.java:70)
          at org.apache.pig.impl.plan.PlanVisitor.visit(PlanVisitor.java:46)
          at org.apache.pig.backend.hadoop.executionengine.tez.TezLauncher.processLoadAndParallelism(TezLauncher.java:429)
          at org.apache.pig.backend.hadoop.executionengine.tez.TezLauncher.launchPig(TezLauncher.java:143)
          at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:301)
          at org.apache.pig.PigServer.launchPlan(PigServer.java:1390)
          at org.apache.pig.LipstickPigServer.launchPlan(LipstickPigServer.java:151)
          at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1375)
          at org.apache.pig.PigServer.execute(PigServer.java:1364)
          at org.apache.pig.PigServer.executeBatch(PigServer.java:415)
          at org.apache.pig.PigServer.executeBatch(PigServer.java:398)
          at org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:171)
          at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:234)
          at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205)
          at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:81)
          at com.netflix.lipstick.Main.run(Main.java:496)
          at com.netflix.lipstick.Main.main(Main.java:171)
          at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
          at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
          at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
          at java.lang.reflect.Method.invoke(Method.java:606)
          at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
      Caused by: java.io.IOException: Cannot estimate parallelism for scope-892, effective parallelism for predecessor scope-892 is -1
          at org.apache.pig.backend.hadoop.executionengine.tez.plan.optimizer.TezOperDependencyParallelismEstimator.estimateParallelism(TezOperDependencyParallelismEstimator.java:116)
          at org.apache.pig.backend.hadoop.executionengine.tez.plan.optimizer.ParallelismSetter.visitTezOp(ParallelismSetter.java:134)
          ... 24 more
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            cheolsoo Cheolsoo Park
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: