Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2074

TaskRescheduledAfterVertexSuccessTransition may go back to RUNNING incorrectly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

          public VertexState transition(VertexImpl vertex, VertexEvent event) {
            if (vertex.outputCommitters == null // no committer
                || vertex.outputCommitters.isEmpty() // no committer
                || !vertex.commitVertexOutputs) { // committer does not commit on vertex success
              LOG.info(vertex.getLogIdentifier() + " back to running due to rescheduling "
                  + ((VertexEventTaskReschedule)event).getTaskID());
              (new TaskRescheduledTransition()).transition(vertex, event);
              // inform the DAG that we are re-running
              vertex.eventHandler.handle(new DAGEventVertexReRunning(vertex.getVertexId()));
              return VertexState.RUNNING;
            }
            ...
          }
       

      The "// committer does not commit on vertex success" may be wrong because the DAG might have completed at this time and the overall commit might be in progress. If so, the Vertex and DAG should fail.

      Attachments

        Activity

          People

            Unassigned Unassigned
            bikassaha Bikas Saha
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: