Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24594

/jobs endpoint returns same job id twice (suspended and running) until stopped

    XMLWordPrintableJSON

Details

    Description

      We observe this behavior since our migration to flink 1.13.2. 

      /jobs GET endpoint returns the same job id twice :

      {"jobs":[{"id":"58bde1f30e0ec30d511a996b74ab2e12","status":"RUNNING"},{"id":"58bde1f30e0ec30d511a996b74ab2e12","status":"SUSPENDED"}]}
      

      /jobs/overview GET returns :

      {"jobs":[{"jid":"58bde1f30e0ec30d511a996b74ab2e12","name":"Enrichissement Incidents HTA","state":"RUNNING","start-time":1634479775893,"end-time":-1,"duration":173551611,"last-modification":1634584098667,"tasks":{"total":1,"created":0,"scheduled":0,"deploying":0,"running":1,"finished":0,"canceling":0,"canceled":0,"failed":0,"reconciling":0,"initializing":0}},{"jid":"58bde1f30e0ec30d511a996b74ab2e12","name":"Enrichissement Incidents HTA","state":"SUSPENDED","start-time":1634300482098,"end-time":-1,"duration":179293293,"last-modification":0,"tasks":{"total":0,"created":0,"scheduled":0,"deploying":0,"running":0,"finished":0,"canceling":0,"canceled":0,"failed":0,"reconciling":0,"initializing":0}}]}
      

      Of course, this also shows up on the overview page of the job-manager web UI.
       
      /jobs/8bde1f30e0ec30d511a996b74ab2e12 returns details of the RUNNING job. 
       
      flink stop command stops the running job and both jobs disappear from /jobs and /jobs/overview
       
      We observed this several times on clusters we have migrated to flink 1.13.2, not sure yet how / when it happens.
       
      I noticed this issue : https://issues.apache.org/jira/browse/FLINK-20195 , it may be related but this is not exactly the same issue, I get this response until job is canceled / stopped.
       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              azocankara Azocan Kara
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: