Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2568

V_INPUT_DATA_INFORMATION may happen after vertex is initialized

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.7.0
    • 0.8.0-alpha, 0.7.1
    • None
    • None

    Description

      2015-06-19 15:57:28,462 ERROR [Dispatcher thread: Central] impl.VertexImpl: Can't handle Invalid event V_INPUT_DATA_INFORMATION on vertex Map 2 with vertexId vertex_1434754502979_0002_2_00 at current state INITED
      org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: V_INPUT_DATA_INFORMATION at INITED
              at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
              at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
              at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
              at org.apache.tez.state.StateMachineTez.doTransition(StateMachineTez.java:57)
              at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:1799)
              at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:198)
              at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1963)
              at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1949)
              at org.apache.tez.common.AsyncDispatcher.dispatch(AsyncDispatcher.java:183)
              at org.apache.tez.common.AsyncDispatcher$1.run(AsyncDispatcher.java:114)
              at java.lang.Thread.run(Thread.java:722)
      

      Vertex move to INITED as long as its parallelism is determined, no null edges and root inputs are initialized. RootInputDataInformation handling is not a precondition of vertex move to INITED. We can't wait for all the V_INPUT_DATA_INFORMATION events available in INITIALIZING state, because it is not know how many V_INPUT_DATA_INFORMATION we may receive, it is determined by VM. So will allow V_INPUT_DATA_INFORMATION happens when vertex is initialized.

      Attachments

        1. a.log
          37 kB
          Jeff Zhang
        2. TEZ-2568-1.patch
          12 kB
          Jeff Zhang
        3. TEZ-2568-2.patch
          14 kB
          Jeff Zhang
        4. TEZ-2568-3.patch
          2 kB
          Jeff Zhang
        5. TEZ-2568-4.patch
          3 kB
          Jeff Zhang
        6. TEZ-2568-5.patch
          3 kB
          Jeff Zhang

        Activity

          People

            zjffdu Jeff Zhang
            zjffdu Jeff Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: