Uploaded image for project: 'Falcon'
  1. Falcon
  2. FALCON-1719

Retry does not update the state of the instance in the database

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.9
    • 0.9
    • scheduler
    • None

    Description

      Falcon retries the instance if it gets killed. Although the same state is not reflected in database in case of native scheduler.

      Instance status

      dataqa@lda01:~$ falcon instance -type process -name ProcessWfTest-agregator-coord16-a7c562ac -status -start 2016-01-04T12:04Z -end 2016-01-04T12:05Z
      Consolidated Status: SUCCEEDED
      
      Instances:
      Instance		Cluster		SourceCluster		Status		Start		End		Details					Log
      -----------------------------------------------------------------------------------------------
      2016-01-04T12:04Z	ProcessWfTest-corp-bb2ecac1	-	SUCCEEDED	2016-01-04T12:13Z	2016-01-04T12:13Z	-	http://lda01:11000/oozie?job=0000089-160104115615658-oozie-oozi-W
      actions:
       hdfscommands	OK	-
       aggregator	SUCCEEDED	http://lda01:8088/proxy/application_1451903998946_0133/
      
      Additional Information:
      Response: ua1/STATUS
      Request Id: ua1/512026219@qtp-1816147548-5 - 0dc22236-75fc-4a0b-a699-60266c304329
      
      

      Oozie info :

      dataqa@lda01:~$ oozie job -info 0000089-160104115615658-oozie-oozi-W
      SLF4J: Class path contains multiple SLF4J bindings.
      SLF4J: Found binding in [jar:file:/mnt/oozie/lib/slf4j-simple-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
      SLF4J: Found binding in [jar:file:/mnt/oozie/lib/slf4j-log4j12-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
      SLF4J: Found binding in [jar:file:/mnt/oozie/libext/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
      SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
      SLF4J: Actual binding is of type [org.slf4j.impl.SimpleLoggerFactory]
      Job ID : 0000089-160104115615658-oozie-oozi-W
      ------------------------------------------------------------------------------------------------------------------------------------
      Workflow Name : FALCON_PROCESS_DEFAULT_ProcessWfTest-agregator-coord16-a7c562ac
      App Path      : hdfs://192.168.138.236:8020/tmp/falcon-regression-staging/falcon/workflows/process/ProcessWfTest-agregator-coord16-a7c562ac/11642dd2e8532508c97721c51dbb28ce_1451909324381
      Status        : SUCCEEDED
      Run           : 1
      User          : dataqa
      Group         : -
      Created       : 2016-01-04 12:09 GMT
      Started       : 2016-01-04 12:13 GMT
      Last Modified : 2016-01-04 12:13 GMT
      Ended         : 2016-01-04 12:13 GMT
      CoordAction ID: -
      
      Actions
      ------------------------------------------------------------------------------------------------------------------------------------
      ID                                                                            Status    Ext ID                 Ext Status Err Code  
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@:start:                                  OK        -                      OK         -         
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@failed-post-processing                   OK        job_1451903998946_0108 SUCCEEDED  -         
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@fail                                     OK        -                      OK         E0729     
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@user-action                              OK        0000090-160104115615658-oozie-oozi-WSUCCEEDED  -         
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@succeeded-post-processing                OK        job_1451903998946_0135 SUCCEEDED  -         
      ------------------------------------------------------------------------------------------------------------------------------------
      0000089-160104115615658-oozie-oozi-W@end                                      OK        -                      OK         -         
      ------------------------------------------------------------------------------------------------------------------------------------
      
      

      Database state :

      mysql> select id,current_state from INSTANCES where instance_time="2016-01-04T12:04Z";
      +-----------------------------------------------------------------------------------------------+---------------+
      | id                                                                                            | current_state |
      +-----------------------------------------------------------------------------------------------+---------------+
      | PROCESS/ProcessWfTest-agregator-coord16-a7c562ac/ProcessWfTest-corp-bb2ecac1/2016-01-04-12-04 | FAILED     
      

      Attachments

        1. FALCON-1719.patch
          9 kB
          pavan kumar kolamuri
        2. FALCON-1719-v1.patch
          9 kB
          pavan kumar kolamuri

        Issue Links

          Activity

            People

              pavan kumar pavan kumar kolamuri
              pragya.mittal Pragya Mittal
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: