Uploaded image for project: 'Tajo'
  1. Tajo
  2. TAJO-160

StorageManager throws InvalidInputException while running simple join query

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.2-incubating, 0.8.0
    • Component/s: Storage
    • Labels:
      None
    • Environment:

      Ubuntu 12.04.2 LTS, Java 1.6.0_26 (Sun Microsystems Inc.), hadoop 2.0.4-alpha

      Description

      While executing the query "select * from customer as c inner join nation as n on c.c_nationkey = n.n_nationkey;", with nation and customer being the tables generated from TPCH, I got the following:

      tajo> select * from customer as c inner join nation as n on c.c_nationkey = n.n_nationkey;
      2013-09-05 04:45:05,632 INFO  client.TajoClient (TajoClient.java:connectionToQueryMaster(190)) - Connected to Query Master (qid=q_1378350112573_0007, addr=146.XXX.XX.XX:8091)
      2013-09-05 04:45:05,700 INFO  rpc.NettyClientBase (NettyClientBase.java:close(87)) - Proxy is disconnected from 146.XXX.XX.XX:8091
      2013-09-05 04:45:05,700 INFO  client.TajoClient (TajoClient.java:closeQuery(113)) - Closed a QueryMaster connection (qid=q_1378350112573_0007, addr=computer3/146.XXX.XX.XX:8091)
      null
      tajo>
      

      "select * from customer;" and "select * from nation;" returned correctly the tables.

      Here follows the yarn stderr user log:

      13/09/05 04:15:46 INFO service.AbstractService: Service:org.apache.tajo.master.YarnTaskRunnerLauncherImpl is started.
      13/09/05 04:15:46 INFO service.AbstractService: Service:org.apache.hadoop.yarn.client.AMRMClientImpl is started.
      13/09/05 04:15:46 INFO service.AbstractService: Service:org.apache.tajo.worker.AbstractResourceAllocator is started.
      13/09/05 04:15:46 INFO service.AbstractService: Service:org.apache.tajo.master.TajoAsyncDispatcher is started.
      13/09/05 04:15:46 INFO master.TajoAsyncDispatcher: AsyncDispatcher started:q_1378350112573_0004
      13/09/05 04:15:46 INFO service.AbstractService: Service:org.apache.tajo.master.querymaster.QueryMasterTask is started.
      13/09/05 04:15:46 INFO querymaster.Query: Processing q_1378350112573_0004 of type INIT
      13/09/05 04:15:46 INFO querymaster.Query: q_1378350112573_0004 Query Transitioned from QUERY_NEW to QUERY_INIT
      13/09/05 04:15:46 INFO querymaster.Query: Processing q_1378350112573_0004 of type START
      13/09/05 04:15:46 INFO rm.YarnRMContainerAllocator: Available Resource: <memory:6144, vCores:0>
      13/09/05 04:15:46 WARN querymaster.SubQuery: SubQuery (eb_1378350112573_0004_000001) failed
      org.apache.tajo.storage.StorageManager$InvalidInputException
      	at org.apache.tajo.storage.StorageManager.listStatus(StorageManager.java:440)
      	at org.apache.tajo.storage.StorageManager.getSplits(StorageManager.java:622)
      	at org.apache.tajo.master.querymaster.Repartitioner.createJoinTasks(Repartitioner.java:85)
      	at org.apache.tajo.master.querymaster.SubQuery$InitAndRequestContainer.createTasks(SubQuery.java:566)
      	at org.apache.tajo.master.querymaster.SubQuery$InitAndRequestContainer.transition(SubQuery.java:449)
      	at org.apache.tajo.master.querymaster.SubQuery$InitAndRequestContainer.transition(SubQuery.java:433)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:382)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:299)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:43)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:445)
      	at org.apache.tajo.master.querymaster.SubQuery.handle(SubQuery.java:410)
      	at org.apache.tajo.master.querymaster.Query$StartTransition.transition(Query.java:279)
      	at org.apache.tajo.master.querymaster.Query$StartTransition.transition(Query.java:268)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$SingleInternalArc.doTransition(StateMachineFactory.java:359)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:299)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:43)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:445)
      	at org.apache.tajo.master.querymaster.Query.handle(Query.java:399)
      	at org.apache.tajo.master.querymaster.Query.handle(Query.java:53)
      	at org.apache.tajo.master.TajoAsyncDispatcher.dispatch(TajoAsyncDispatcher.java:139)
      	at org.apache.tajo.master.TajoAsyncDispatcher$1.run(TajoAsyncDispatcher.java:79)
      	at java.lang.Thread.run(Thread.java:679)
      13/09/05 04:15:46 INFO querymaster.Query: q_1378350112573_0004 Query Transitioned from QUERY_INIT to QUERY_RUNNING
      13/09/05 04:15:46 INFO querymaster.Query: Processing q_1378350112573_0004 of type SUBQUERY_COMPLETED
      13/09/05 04:15:46 FATAL master.TajoAsyncDispatcher: Error in dispatcher thread:SUBQUERY_COMPLETED
      java.lang.ClassCastException: org.apache.tajo.master.event.QueryDiagnosticsUpdateEvent cannot be cast to org.apache.tajo.master.event.SubQueryCompletedEvent
      	at org.apache.tajo.master.querymaster.Query$SubQueryCompletedTransition.transition(Query.java:291)
      	at org.apache.tajo.master.querymaster.Query$SubQueryCompletedTransition.transition(Query.java:284)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:382)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:299)
      	at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:43)
      	at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:445)
      	at org.apache.tajo.master.querymaster.Query.handle(Query.java:399)
      	at org.apache.tajo.master.querymaster.Query.handle(Query.java:53)
      	at org.apache.tajo.master.TajoAsyncDispatcher.dispatch(TajoAsyncDispatcher.java:139)
      	at org.apache.tajo.master.TajoAsyncDispatcher$1.run(TajoAsyncDispatcher.java:79)
      	at java.lang.Thread.run(Thread.java:679)
      13/09/05 04:15:46 INFO querymaster.Query: Processing q_1378350112573_0004 of type SUBQUERY_COMPLETED
      13/09/05 04:15:46 INFO querymaster.Query: q_1378350112573_0004 Query Transitioned from QUERY_RUNNING to QUERY_FAILED
      

        Issue Links

          Activity

          Hide
          jihoonson Jihoon Son added a comment -

          This is due to the legacy table path.
          It will be fixed after TAJO-80 is resolved.

          Show
          jihoonson Jihoon Son added a comment - This is due to the legacy table path. It will be fixed after TAJO-80 is resolved.

            People

            • Assignee:
              Unassigned
              Reporter:
              sesteves Sergio Esteves
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development